HTML Character Sets


HTML Character Sets


https://www.w3schools.com/charsets/default.asp

ASCII : https://www.w3schools.com/charsets/ref_html_ascii.asp
ISO-8859-1 : https://www.w3schools.com/charsets/ref_html_8859.asp
UTF-8 : https://www.w3schools.com/charsets/ref_html_utf8.asp

https://www.w3schools.com/charsets/ref_html_symbols.asp
https://www.w3schools.com/charsets/ref_html_ansi.asp



The Difference Between Unicode and UTF-8



Unicode is a character set. UTF-8 is encoding.
Unicode is a list of characters with unique decimal numbers (code points). A = 65, B = 66, C = 67, ....
This list of decimal numbers represent the string "hello": 104 101 108 108 111
Encoding is how these numbers are translated into binary numbers to be stored in a computer:
UTF-8 encoding will store "hello" like this (binary): 01101000 01100101 01101100 01101100  01101111

Comments