This page focuses on languages other than Russian which are written in the Cyrillic script. Please see the Russian page for information on Russian language utilities.
Not surprisingly, many languages that were part of the former Soviet Union are written in Cyrillic or use Cyrillic as one of the possible available scripts. In terms of internationalization, the languages can be divided into two classes.
These include languages related to Russian like Ukrainian, Belarussian, Bulgarian, Macedonian and others. Support for these languages is generally available on both Windows and Mac.
These included non-Slavic languages like Uzbek, Kyrghiz, Azer, Tatar, and others. Many of these are actually Turkic languages (related to Turkish) or other languages not related to Russian and use unique Cyrillic letters not commonly needed for the Slavic languages. Some languages in this region can be written in either Cyrillic, the Roman alphabet or even the Arabic script depending on the location of a particular speaker community.
Keyboard support for several Central Asian Cyrillic languages is available on Windows, but are not well-supported on the Macintosh. However, encoded fonts for the Mac are available and some third-party keyboarding utilities may be available for older systems of Mac.
Follow the instructions for Activating Keyboard Locales to activate and switch Microsoft keyboards to the appropriate layout. Keyboards for both Eastern European and Central Asian languages are available.
Follow the instructions for Activating Macintosh Keyboards to activate and switch Macintosh keyboards. You need to have the Central European items installed. If they are not available on your system, you should reinstall them from a System CD.
The fonts are installed for Mac OS X , but no keyboards are available from Apple at this time. One suggestion is to type most content with one of the Eastern European language Keyboards, then use the Character Pallete to input special Central Asian Cyrillic characters.
Note: This works only for Unicode aware editors such as TextEdit, Micorsoft Office 2004 and Nisus Writer Express.
Please note which fonts are needed for each platform before viewing instructions to configure your browsers in the Preferences or Tools menu. Most browsers are recommended, but older browsers like Netscape 4.7 may need more adjustments.
Fonts with links are freeware fonts can be downloaded and placed on Windows or Mac OS X . These fonts are designed for medieval or ancient scholars.
These fonts include many supplementary Cyrillic and historic characters
Browsers which fully support Unicode are strongly recommended. Click link in list to view configuration instructions. You will be asked to match a script with a font.
If you have all the utilities set up correctly, then your browser should be able to see the following Cyrillic test pages correctly.
If you see some unusual letters instead of the appropriate Cyrillic letters, you will need to manually switch from Western encoding to one of the Cyrillic encodings or Unicode under the View menu of your browser.
These are the codes which allow browsers and screen readers to process data as the appropriate language. All letters in codes are lower case.
If you are desigining a new page, Unicode encoding is recommended since it supports all Cyrillic languages.
Computers process text by assuming a certain encoding or a system of matching electronic data with visual text characters. Whenever you develop a Web site you need to make sure the proper encoding is specified in the header tags; otherwise the browser may default to U.S. settings and not display the text properly.
To declare an encoding, insert or inspect the following meta-tag at the top of your HTML file, then replace "???" with one of the encoding codes listed above. If you are not sure, use utf-8 as the encoding.
Generic Encoding Template
<meta http-equiv="Content-Type" content="text/html; charset=??? ">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8 ">
The final close slash must be included after the final quote mark in the encoding header tag if you are using XHTML
Declare Unicode in XHTML
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
If no encoding is declared, then the browser uses the default setting, which in the U.S. is typically Latin-1. Some display errors may occur.
Language tags are also suggested so that search engines and screen readers parse the language of a page. These are metadata tags which indicate the language of a page, not devices to trigger translation. Visit the Language Tag page to view information on where to insert it.
One option is to use Dreamweaver, Microsoft Expression or other Web editor and change the keyboard to the correct script. This will allow you to type content in directly with the appropriate script. However, it is important to verify that the correct encoding is specified in the Web page header.
Another option is to compose the basic text in an international or foreign language text editor or word processor and export the content as an HTML or text file with the appropriate encoding. This file could be opened in another HTML editor such as Dreamweaver or Microsoft Expression, and edited for formatting.
For Web tools such as Blogs at Penn State, Facebook, Twitter, del.icio.us, Flicker, and others, users can typically change the keyboard and input text. In most cases, this content will be encoded as Unicode.
In some cases, your best options may be to use PDF files or image files. See the Web Development Tips section for more details.
University of Arizona Slavic Information Literacy Font Tools - Includes PDAs
These fonts include many supplementary Cyrillic characters
©Penn State University, 2000-2013.
This Web page maintained by Teaching and Learning with Technology, a unit of Information Technology Services. For questions or comments on this Web page, please contact Elizabeth J. Pyatt (email@example.com).
This site uses Unicode to display non-English characters. This site is best viewed in the most recent versions of your browser.
Unicode character names and hexadecimal entity codes are taken from the public Unicode Character Charts.
This publication is available in alternate media upon request.
Last Modified: Tuesday, 04-Jun-2013 12:39:40 EDT