Teaching and Learning with Technology

Computing With Accents and Foreign Scripts

Skip Menu

Unicode Numeric Codes

Thanks to Michael Czepiel for his technical input.

If you need to only insert a few special symbols or non-English words onto a mostly English page, you may find that you can insert it with a special entity code using the Unicode codepoint.

Page Contents

  1. What's a Codepoint?
  2. Finding a Codepoint
  3. Convert Codepoint to HTML Entity Code
  4. Hex Calculators

What's a Code Point?

A codepoint is the numeric position of a character within an encoding system. Or more simply put, it's the number that a computer can translate into a particular character.

In Unicode, code points are given in the form of U+ a four or six hexadecimal (Base 16) number. For example, the codepoint for capital A macron (Ā) is U+0100.

You can read more about the structure of codepoints and hexadecimal numbers in the Encoding Tutorial.

Finding a Codepoint

Codepoint lists can be found at Unicode and other locations. Here is a partial list of sites

Convert Codepoint to HTML Code

Below are some steps for finding and inserting a numeric code.

1. Look Up the Character Code

The Unicode Consortium and other sources provide Unicode charts.

Determine if the number is Hexadecimal or Decimal

Either hexadecimal (base 16) or decimal numbers (Base 10) can be used, but it is important to know which version the source uses. Please note.

2. Place in entity code template

Hexadecimal Template.

The hexadecimal template is &#x<hex-number>; All codes begin with "&#" and end with a semicolon (;). A hexadecimal number is signaled by the use of "x" after the "#" sign.

Since the codepoint for capital A macron (Ā) is U+0100, the entity code would be &‌#x100; or &‌#x0100; (either version works in many browsers).

The Decimal template

The decimal template is &#<dec-number>; without the x. However, you must make sure that your code is in a Base 10 numbner.

The hex number x100 for Ā is 16 cubed (163) or 16 × 16 × 16 or 256. The decimal entity code for capital A macron (Ā) is &‌#256;.

Example Entity Codes

Entity Codes for Latin Capital Long A with Macron
Capital A with Macron (Ā) Numeric Code HTML Entity Code Result
Hexadecimal x100 &‌#x0100; or &‌#x100;
Ā
Decimal 256 &‌#256; Ā

 

3. Declare Unicode encoding on the page 

Use the following code to declare that your HTML page is using

<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
...
<head>

4. Test on Multiple Browsers/Platforms

Browsers and content platforms may use prefer different forms of an entity code, so it's wise to test a code in different platforms.

Also, so codepoint can be displayed unless the browser has access to the correct font. See the Languages page for information about each script and language.

Hexadecimal Number Conversion

The following tools are available to convert between hexadecimal and decimal numbers.

Windows Calculator App

You can use the free Calculator utility to convert hexadecimal number to decimal numbers. To use this tool:

  1. Open the Calculator utility from the Start menu.
  2. Under the view options, switch from Standard to Programmer (Win 7/10) or Scientific (XP) to reveal more functions.
  3. Input a number and use the Dec and Hex controls to convert numbers back and forth.

Macintosh Calculator App

The Calculator application includes hex conversion and Unicode conversion calculator.

  1. Open the Calculator utility in the Applications folder.
  2. Under the View menu, switch from Basic to Programming (Command+3) to reveal conversion functions.
  3. Input a number and use the Dec and Hex toggles buttons to convert numbers back and forth.
  4. Click the Unicode button to display its equivalent Unicode glyph.

Top of Page

Last Modified: Tuesday, 28-Mar-2017 14:58:41 EDT