Around IT in 256 seconds

#41: Unicode: can you see these: ร†, ็ˆฑ and ๐Ÿš€?

May 25, 2021 | 4 Minute Read

Computers speak bits and bytes. Numbers in general. They donโ€™t understand images, poems and JSON. When we say โ€œhelloโ€, it needs to be encoded to numbers. Conveniently, each character becomes one number. A number can then be stored, transfered and rendered on another computer. Therefore, everyone needs to agree which numbers represent which characters. The first commonly used attempt was called ASCII. American Standard Code for Information Interchange. In short, itโ€™s a table of 127 symbols and their respective numbers. For example, lower-case h is 104, whereas exclamation mark is 33. Thereโ€™s one problem here. 127 symbols. 7 bits. Of course, itโ€™s an American Standard. So it ignores the existence of any other country and alphabet.

More materials

Tags: ascii, unicode

Be the first to listen to new episodes!

To get exclusive content: