Categories
Posts

Encodings and Character Sets

Character encoding issues will never go away.

What Every Programmer Absolutely, Positively Needs to Know About Encodings and Character Sets to Work With Text:

You’re never actually directly dealing with “characters” or “text”, you’re always dealing with bits as seen through several layers of abstractions. Incorrect results are a sign of one of the abstraction layers failing.

One reply on “Encodings and Character Sets”

Basically, you can visualise this by assuming that all characters are stored in computers using a code, like the ciphers used in espionage. A character encoding provides a key to unlock (ie. crack) the code. It is a set of mappings between the bytes representing numbers in the computer and characters in the coded character set. Without the key, the data looks like garbage.

Leave a Reply

Your email address will not be published. Required fields are marked *