Encoding on the Internet
Although 256 characters can support most Western European languages, it is not enough to handle non-Roman characters or languages with non-standard Roman characters. Therefore, other 8-bit encodings were developed for languages outside Western Europe.
To accommodate both English and the other script, many 8-bit encodings are structured as follows:
|Arabic||ISO-8859-6* (rarely used)||
*External links to Wikipedia
On the Internet, if you switch the encoding View of your browser (View » Character Set/Encoding) for an English site, in most cases, you will still see English because the encoding supports it.
Because non-Roman encodings include ASCII, if you switch to a properly encoded font in word-processor font and begin to type, you will see English characters. It is not until you switch your keyboard, that the non-Roman letters appear.
For many scripts, there is a competing Windows encoding standard and a non-Windows standard, typically one registered at the ISO as an ISO-8859-x set. For instance Hebrew Web pages can be encoded as either ISO-8859-8 ("Visual Hebrew") or as Windows-1255.
|Hebrew||ISO-8859-8 ("Visual Hebrew")||Windows-1255|
|Central Europe||ISO-8859-2 ("Latin 2")||Windows-1250|
These are links which show the specifications for different encoding systems and the languages they are associated with. However, most languages can also be encoded as Unicode (utf-8).
NOTE: "C.P." (Codepage) is the same as "Windows". For instance CP1252 is Windows-1252.
©Penn State University, 2000-2013.
This Web page maintained by Teaching and Learning with Technology, a unit of Information Technology Services. For questions or comments on this Web page, please contact Elizabeth J. Pyatt (firstname.lastname@example.org).
This site uses Unicode to display non-English characters. This site is best viewed in the most recent versions of your browser.
Unicode character names and hexadecimal entity codes are taken from the public Unicode Character Charts.
This publication is available in alternate media upon request.
Last Modified: Tuesday, 04-Jun-2013 12:41:29 EDT