Computing With Accents and Foreign Scripts

Encoding on the Internet

7: Unicode

What is Unicode?

Although multiple encoding standards have been developed and implemented for multiple scripts, ideally it would be nice if there were one super encoding scheme which covered all the scripts in the world in a standard fashion.

Unicode (www.unicode.org) is a global encoding scheme which seeks to include all characters in all scripts in one super global encoding system. Unicode 4 includes most current national scripts and many CJK characters, but the most recent standards may not be incorporated into all software packages.

Unicode is structured as follows


Unicode allows:

Test Pages and Progress

The most recent operating systems support Unicode, although not all software does. Font and software support for Unicode is still being developed, but you can see some Unicode test pages are at:

See the Unicode Setup page for more information on viewing Unicode pages.

Unicode Resources

Unicode Operating Systems

Advanced Unicode

