#41: Unicode: can you see these: Æ, 爱 and 🚀?
by Tomasz Nurkiewicz
Computers speak bits and bytes. Numbers in general. They don’t understand images, poems and JSON. When we say “hello”, it needs to be encoded to numbers. Conveniently, each character becomes one number. A number can then be stored, transfered and rendered on another computer. Therefore, everyone needs to agree which numbers represent which characters. The first commonly used attempt was called ASCII. American Standard Code for Information Interchange. In short, it’s a table of 127 symbols and their respective numbers. For example, lower-case h is 104, whereas exclamation mark is 33. There’s one problem here. 127 symbols. 7 bits. Of course, it’s an American Standard. So it ignores the existence of any other country and alphabet.
- Characters, Symbols and the Unicode Miracle - Computerphile
- GeeCON 2019: Tomasz Nurkiewicz - CharBusters - 10 Unicode Myths - my talk about Unicode slides are available here
- Unicode® Chart: Total Characters by Year
Be the first to listen to new episodes!
To get exclusive content:
- Unedited, longer content
- More extra materials to learn
- Your user voice ideas are prioritized