For 1, that’s why you say “Format your answer in this exact sentence: The number of bytes required (rounded up) is exactly # bytes.
, where # is the number of bytes.” And then regex for that sentence. What could go wrong?
Also, it can do math somewhat consistently if you let it show its work, but I still wouldn’t rely on it as a cog in code execution. It’s not nearly reliable enough for that.
Linux machines don’t crash unexpectedly, because if they do, it’s your fault for configuring it wrong and you should have expected it.