HTML Character Codes

Character codes (eg ASCII, ISO8859-1, Unicode) can be specified in decimal or hex notation. Similarly, HTML numeric character references may take two forms:

  • The syntax "&#D;", where D stands for a decimal number.
  • The syntax "&#xH;" or "&#XH;", where H stands for a hexadecimal number. Hexadecimal numbers in numeric character references are not case-sensitive.

HTML character entity references use symbolic names so that authors need not remember HTML character codes. It's most likely easier to remember & than & for the ampersand (&) sign.

HTML does not define a character entity reference for every character. Also character entity references are case-sensitive. Thus, Å refers to a different character than å.

The HTML character codes and names on this page are drawn from Character entity references in HTML 4, The HTML Coded Character Set, and ISO Latin 1 Character Entity Set.

Quick jump to:

ISO8859-1 Latin Characters

This group is for characters with codes up to decimal 255, and include the ASCII character codes. The set of Latin-1 entities was extended according to RFC1866. Names of entities are taken from ISO8879 (SGML - Standard Generalized Markup Language).

Show Table

 

UCS (Universal Multiple-Octet Coded Character Set) Characters

This group comprises of characters defined in ISO10646. ISO10646 defines both the character codes and the character entity references. ISO10646 names do not clash with ISO8879 entity names.

Mathematical Symbols/Characters

Show Table

 

General Characters

Show Table

 

Greek Characters

Show Table

 

Internationalisation characters

Show Table

 

Back to top