Teaching and Learning with Technology

Computing With Accents and Foreign Scripts

Skip Menu

Arabic

This Page

  1. About the Arabic Script
  2. Windows Utilities for Arabic
  3. Macintosh Utilities for Arabic
  4. Browser and Font Recommendations
  5. Web Development
    1. Language Code: ar (Arabic)
    2. See additional country code variants
    3. See additional ISO-639-3 language code
  6. Colloquial Arabic Forms
  7. Links
  8. Typing Right-To-Left (RTL) Languages in Word for Windows New Page
  9. Typing Right-To-Left (RTL) Languages on the Mac New Page
  10. Arabic Unicode Chart New Page

Other Languages Using the Arabic Script

  1. Farsi/Persian
  2. Kurdish
  3. Pashto
  4. Urdu & Sindhi
  5. Turkic (Uzbek/Ottoman Turkish)
  6. Maltese Arabic (Roman alphabet)

About the Arabic Script

The Arabic script has two features which make it unique in terms of encoding. One is that it is written from right to left (or RTL). The other feature is that the shapes of individual letters change forms depending on whether the letter is alone, at the beginning of a word, the middle of a word or at the end.

In order to process Arabic correctly, a software must be able to display text from right to left and make sure the letter forms are displayed correctly depending on their positions within a word. Unfortunately, there is incomplete implementation of creating correct letter forms in many software packages.

Other Languages in Arabic Script

The Arabic script is used for other languages besides Arabic, but their local alphabets may include special characters for sounds not found in Arabic. Supporting these languages may require special fonts for extra keyboards and special keyboards.

See the following pages for more details.

Top of Page

Windows Utilities for Arabic

Basic Setup

In order to integrate foreign scripts into your computer, you must set up "keyboard" or input utilities in your operating system. These utilities will allow you to switch between typing English and other languages in word processors and Web tools. This process will also make sure the correct fonts are installed and available on your operating system.

See instructions for Setting up Keyboards for details.

Microsoft Keyboard Utilities

Microsoft provides a variety of free keyboard utlities, but they must be installed from the disk, then activated from the Regional Control Panel. Both standard and AZERTY layouts are available.

In AZERTY layouts, the Arabic letters mapped similar to a French keyboard (where in French keyboards, the upper right key is A, not Q).

Step 1 - Install Utilities

Student Computing Labs - The utilities are installed in the University Park Student Computing Labs, but students must install the utlities by going to the Start menu then International Language Support » Microsoft » Office Microsoft Office Asian Character Input Support.

Home Computers - Several Asian and Middle Eastern keyboards are available in Windows, but you may have to install it from the Windows System disk because it is a complex script. After that you can activate the keyboards from the Regional Control Panel.

See Windows East Asian and Complex Script Keyboards for detailed instructions with screen captures.

Step 2 - Activate from Control Panel

Once the keyboards have been installed, they must be activated in the Regional Control Panel. Read the summary instructions below or go to East Asian and Complex Scripts Keyboards for detailed instructions with screen captures.

  1. Go to Start then Control Panels then Regional and Language Options. Follow the instructions for Activating the Language Bar
  2. While in the Regional and Language Options control panel, click on the Languages tab, then the Details button.
  3. Click the Add button and select the appropriate language from the Input Language pull down menu.
  4. Click the OK buttons until you have exited the control panels - this will save the changes in your Profile.
  5. Open any Windows application then make sure the Language Bar menu on top is set to the correct language.

Outside the Lab

Microsoft provides an Arabic keyboard, but it may need to be installed from the Windows System disk. See the Windows Complex Scripts Keyboard Instructions for details on how to activate the keyboard. To see where the critical keys are, go to the Microsoft Keyboard Layouts Page.

Vista Instructions

See the Microsoft Arabic Support Page for instructions for Vista with screencaptures.

RTL in Word

Right to Left Typing in Word for Windows

See instructions for configuring right to left typing in Word for Windows for tips on how to type RTL languages.

Windows Vista Updates

Microsoft has developed new fonts and utilites for Arabic. See the Microsoft Arabic Support page for additional details.

Top of Page

Macintosh Utilites for Arabic

Both Arabic and QWERTY Arabic keyboard are available in Macintosh. On a QWERTY keyboard, Arabic letters are arranged in a manner similar to the Western QWERTY keyboard. See instructions below for information on how to activate the utlities.

Student Computing Labs - Many language keyboards have been activated in the labs and are available through the flag menu on the upper right. Skip to Step #4 in the instructions below.

Home Computers - A variety of keyboards are available from Apple, but you may have to install it from the Macintosh System disk then they can be activated through the International System Preferences. See details below.

To Use Keyboards

  1. Go to the Apple menu and open Systems Preferences.
  2. Click the U.N. flag icon on the first row of the Systems Preferences panel which is either the Language & Text settings (System 10.6/Snow Leopard) or the International settings (System 10.2-10.5).
  3. Click the tab for Input Sources (System 10.6/Snow Leopard), Input Menu (OS X 10.5-3) or the Keyboard Menu (OS X 10.2) tab and check the keyboards you wish to be activated.
  4. Close the System Preferences window.
  5. Open a software application such as a word processor, spread-sheet or any other application in which you need to enter text.
  6. On the upper right portion of the screen, click on the American flag icon (U.S. Flag Icon). Use the dropdown menu to select a script or language.
  7. The keyboard will be switched and an appropriate font will be selected within the application. A flag icon corresponding to the keyboard will be displayed on the upper right.
  8. To switch back to the U.S. keyboard or to some other keyboard, click on the flag icon on the upper right and select a keyboard from the dropdown menu.

See the Macintosh Keyboard Activation for complete instructions with screen captures.

Additional Macintosh RTL Tips

See tips for creating Mac Right-to-Left documents (including alternatives to Microsoft Office) for more information.

Top of Page

Browser and Font Recommendations

Test Sites

If you have your browser configured correctly, the Web sites above should display the correct characters. If you have difficulties, see list below for font and browser configuration instructions.

www.linux4arab.com/

Fonts by Platform

Additional freeware fonts can be downloaded from the sites below.

See also

Recommended Browsers

Browsers which fully support Unicode are strongly recommended. Click link in list to view configuration instructions. You will be asked to match a script with a font.

Manually Switch Encoding

If you see Roman character gibberish instead of Arabic you will need to manually switch from Western encoding view to the Arabic or Unicode encoding under the View menu of your browser.

Top of Page

Web Development

Arabic Encoding and Language Tags

See Using Encoding and Language Codes for more information on the meaning and implementation of these codes.

Suggested Encodings

Selected Language Tags

See Using Encoding and Language Codes for more information on the meaning and implementation of these codes.

Inputting and Editing Text in an HTML Editor

One option is to use Dreamweaver, Microsoft Expression or other Web editor and change the keyboard to the correct script. This will allow you to type content in directly with the appropriate script. However, it is important to verify that the correct encoding is specified in the Web page header.

Another option is to compose the basic text in an international or foreign language text editor or word processor and export the content as an HTML or text file with the appropriate encoding. This file could be opened in another HTML editor such as Dreamweaver or Microsoft Expression, and edited for formatting.

Other Web Tools

For Web tools such as Blogs at Penn State, Facebook, Twitter, del.icio.us, Flicker, and others, users can typically change the keyboard and input text. In most cases, this content will be encoded as Unicode.

Specifying Text Direction

Some HTML editors set the direction of the text automatically. but it can also be set manually by using the newer <dir> and <bdo> attributes. See the Right-to-Left Alignment Tips page for more details.

PDF and Image Files

In some cases, your best options may be to use PDF files or image files. See the Web Development Tips section for more details.

Using Encoding and Language Codes

Computers process text by assuming a certain encoding or a system of matching electronic data with visual text characters. Whenever you develop a Web site you need to make sure the proper encoding is specified in the header tags; otherwise the browser may default to U.S. settings and not display the text properly.

To declare an encoding, insert or inspect the following meta-tag at the top of your HTML file, then replace "???" with one of the encoding codes listed above. If you are not sure, use utf-8 as the encoding.

Generic Encoding Template

<head>
<meta http-equiv="Content-Type" content="text/html; charset=??? ">
...
<head>

Declare Unicode

<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8 ">
...
<head>

XHTML

The final close slash must be included after the final quote mark in the encoding header tag if you are using XHTML

Declare Unicode in XHTML

<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
...
<head>

No Encoding Declared

If no encoding is declared, then the browser uses the default setting, which in the U.S. is typically Latin-1. In that case many Unicode characters could be displayed incorrectly. Also, older browsers such as Netscape 4.7 may not be able to process the entity codes correctly without the "utf-8" declaration.

Language Tags

Language tags are also suggested so that search engines and screen readers parse the language of a page. These are metadata tags which indicate the language of a page, not devices to trigger translation. Visit the Language Tag page to view information on where to insert it.

Top of Page

Colloquial Arabic Forms

About Colloquial Arabic

Although most educated speakers learn to read, write and speak in the same Modern Standard Arabic or fusha (a form similar to the Classical Arabic used in the Koran), the spoken or colloquial varieties from each region vary widely. The variation is enough that linguists classify the totality of these forms as a series of closely related languages.

Colloquial Forms include

Maltese

One local form which is written is Maltese Arabic from the island of Malta. Malta is formally a part of the European Union and uniquely uses the Western Roman alphabet. It's grammar has diverged further than some of the other forms of Arabic, but shares similarities with North African Arabic forms.

See the Maltese page for additional information writing the accents.

Language Codes

The use of Colloquial Arabic in formal writing is used for most formal publications, but can be found in some television dramas, local poems and local plays. If you are transcribing one of these forms, then it is recommended that one of the following language tags be used.

By Country Code

The following codes, as implemented by Microsoft, assume there is one variety of Arabic per country. The advantage of these codes is that there are utilities such as spell checkers which may be available, however there are colloquial forms which do not conform to national boundaries.

Note: The Subsaharan African codes have not been implemented by Microsoft.

ISO-639-3 "Dialect" Language Codes

These ISO-639-3 codes from SIL codes are meant to match linguistic classification of the Colloquial Arabic forms and do not always match national boundaries.

Colloquial Arabic "Dialect" Codes"
Name Code Location
Algerian arq Algeria
Algerian Saharan aao Algeria (minority language)
Babalia Creole Arabic bbz Chad (minority language)
Baharna Arabic abv Bahrain
Chadian Arabic shu Chad
Cypriot Arabic acy Cyprus (minority language)
Dhofari Arabic adf Oman (Salala)
Egyptian arz Egypt
Egyptian (Eastern) Bedawi avl Sinai (Bedouin), Gaza, Red Sea coast
Gulf Arabic afb Persian Gulf (Bahrain, Iran, Kuwait, Oman, Qatar, Saudi Arabia, United Arab Emirates, Yemen. Zubair & Fau Peninsula
Hadrami ayh Yemen, minority language
Hijazi Arabic acw Saudi Arabia, Red Sea coast
Levantine (South) ajp Jordan, Palestine, Israel, Kuwait
Levantine (North) apc Syria, Lebanon
Mesopotamian acm Iraq, eastern Jordan/Syria, western Iran, Turkey
Mesopotamian (North) ayp Tigris & Euphrates north of Baghdad
Moroccan ary Morocco
Nadji ars Saudi Arabia
Omani Arabic acx Oman, Hajar Mountains
Saidi Arabic aec Southern Egypt
Sanaani ayn Northern Yemen
Standard arb Formal written standard
Sudanese apd Sudan
Sudanese Creole Arabic pga Southern Sudan (minority language)
Tai'izzi-Adeni acq Yemen, except east
Tajiki Arabic abh Afghanistan (minority language)
Tunisian Arabic aeb Tunisia
Uzbeki Arabic auz Uzbekistan (Central Asia), minority language

 

Links

Arabic Computing

Windows

Macintosh

Mobile

Unix and Linux

Encodings

Arabic Script

Script Basics

Additional Arabic Script Fonts

Create Right To Left Web Pages

These Web sites provide additional information on developing Right-to-Left Pages.

Top of Page

Last Modified: Tuesday, 04-Jun-2013 12:39:37 EDT