Character encoding issues will never go away.
You’re never actually directly dealing with “characters” or “text”, you’re always dealing with bits as seen through several layers of abstractions. Incorrect results are a sign of one of the abstraction layers failing.
One reply on “Encodings and Character Sets”
Basically, you can visualise this by assuming that all characters are stored in computers using a code, like the ciphers used in espionage. A character encoding provides a key to unlock (ie. crack) the code. It is a set of mappings between the bytes representing numbers in the computer and characters in the coded character set. Without the key, the data looks like garbage.