The following feature is required to be implemented for this class in
accordance with the specification given in $TEXT which is a sub-type of the class $ENUMS{SCRIPTS} :-
This enumeration provides values which may be given the
interpretation indicated :-
Arabic - this value models all of the various
forms of Arabic character except for the presentation forms which
are allocated as a separate script (see below). It is used for writing,
among others, Arabic, Persian, Urdu, Pashto, Sindhi and Kurdish.
Arabic_Presentation - this value models the
very fancy form of writing religious and other important documents in
Arabic communities. See also the everyday Arabic script above.
Armenian - this value models the forms of
character used in Armenian speaking countries, including Armenia
itself.
Bengali - this value models the characters used
in the language of the same name. In addition this script is used to write
Assamese and a number of other languages - such as Daphla, Garo, Hallam,
Khasi, etc in northern India.
Bopomofo - this value models the characters
used to teach/describe the phonetic sounds of all forms of Chinese
ideograph; in mainland China it has been generally superceded by the use
of PinYin, based on latin letters with diacritics.
Canadian_Aboriginal - this value models the
unified form of script for writing languages used widely in Canada. It
was originally invented for the Algonquian language family, but is now
used for Inuktitut and Athapascan language families.
Cherokee - this value models the written script
used principally for the Cherokee language - a member of the Iroquian
language family.
CJK_Unified - this value models all of the
characters called - in China Hanzi - in Japan Kanji, in
Korea Hanza. The currently standardised ideographs do not yet
include older forms still found in many documents. See also the Yi value
below. There are many other script forms for the many minority languages
in China - but these have yet to be standardised.
Cyrillic - this value models the forms of
character used in Russia and Slavic languages. It has been extended to
write some non-Slavic language used in the former Soviet Union.
Devanagari - this value is used throughout the
Indian sub-continent for writing Hindi, Sanskrit and over twenty other
languages including Nepali, Bihari, Jaipuri and Santali.
Ethiopic - this value models the script used
for writing the Semitic language Ge'ez (now limited principally to
liturgical use) and has been extended to write Amharic, Tigre, Oromo and
other languages in the central east Africa region.
Georgian - this value models the forms of
character used in the Asian republic of Georgia.
Greek - this value models the forms of
character used in Greek speaking cultures, including some additions and
variants only used in some of these eg in Coptic (although this may
eventually be treated as a separete script.
Gujarati - this value models the written form
of the language used in the state of Gujarat in India.
Gurmukhi - this value models the written form
of the language Punjabi used in the state of Punjab in India.
Hangul - this value models a superset of the
Johab syllables used in writing Korean.
Hangul_Jamo - this value models the basic Jamo
consonant and vowel letters of which the pre-composed syllable form is
referred to above as Hangul.
Hebrew - this value models the characters used
in cultures where Hebrew is spoken and written. It is also used when
writing Yiddiah, Ladino and a number of other Middle-Eastern
languages.
Hiragana - this value models the forms used in
writing both Japanese language inflections and writing Japanese word
pronunciations.
IPA - this value models the forms of character
which form the International Phonetic Alphabet extension to the basic
Latin character set.
Kannada - this value models the south Indian
script used to write the Kannada (Kanarese) language as well as a number
of regional minority langauges such as Tulu.
Katakana - this value models the phonetic forms
of writing mainly foreign names used in Japan.
Khmer - this value models the official language
of the same name used in Kampuchea; there is a number of dialect variants
which can also be written using this script.
Lao - this value models the written form of
characters used in Laos.
Latin - this value models the forms of
alphabetic scripts used in many European countries, some African extension
characters, US/Canada(en),Australia, New Zealand and others.
Malayalam - this value models the character
form used in writing the language of the same name in Kerala state in
India..
Mongolian - this value models the script used
in writing the language of the same name and also for writing several
other Altaic languages.
Myanmar - this value models the written form of
the Burmese language - as well as Shan, Mon, Pali, Sanskrit and other
regional languages.
Ogham - this value models the written form used
for Irish and Pictish inscriptions. It is no longer in regular use.
Oriya - this value models the written form of
the language of the same name used in Orisda state in India as well some
additional languages of the region including Khondi and Santali.
Runic - this value models the script
historically used to write the languages of early/mediaeval languages in
German, Scandinavian and Anglo-Saxon areas of Europe. It is the first
historical script to be defined in the standard.
Sinhala - this value models the written form of
the language of the same name - the majority language in Sri Lanka. It
may also be used in writing Pali and Sanskrit.
Syriac - this value models the written script
used liturgically in many Middle Easter and South-east Indian communities.
It is also the principal form of written/spoken communication for several
Aramaic languages.
Tamil - this value models the characters used
in the language of the same name used in Tamil Nadu state in India and
parts of Sri Lanka, Singapore and patts of Malaysia. It is also used for
several minority languages such as Badaga.
Telugu - this value models the written form of
the language of the same name used in Andhra Pradesh state in India, as
well as being used for several minority languages such as Gondi and
Lambadi.
Thaana - this value models the written form of
the Dhivehi language used in the Republic of the Maldives. It is also
used to write Arabic, particularly dates.
Thai - this value models the form of script
used in writing Thai and a number of other South-east Asian languages such
as Kuy, Lavna and Pali.
Tibetan - this value models the script form
used in writing Tibetan in several countries and regions around the
Himalayas. It is also the written form used in writing the language
Dzongkha in Bhutan.
Yi - this value models the syllabary used in
writing the Yi language (a Sino-Tibetan language) in China. It has
recently been declared an official language in China.
Numerics - this value models those characters
with the numeric property which are script independent.
Future Script standardisation
International experts are in the course of defining encodings
for the characters/pictographs of the following scripts - Avestan,
Blissymbolics, Buhid, Cham, Cirth, Coptic, Cypriot, Hanunoo, Kirat, Lepcha,
Manipuri, Moso, Old_Hungarian, Old_Permic, Pahawh, Pollard, Shavian,
South_Arabian, Soyombo, Tagalog, Tagbanwa, Tengwar, Tai_Lu, Tai_Mau, Tifinagh,
TlhIngan_Hol.
The following scripts are awaiting resources before they can be added to
the standard - Ahom, Aramaic, Balinese, Balti, Batak, Brahmi, Buginese, Chola,
Cypro-Minoan, Glagolitic, Egyptian_HieroGlyphs, Hittite_Hieroglyphs, Javanese,
Kaithi, Kawi, Khamti, Kharoshthi, Khotanese, Lahndu, Linear_B, Mardaic,
Mangyan, Meroitic, Modi, Numidian, Pahlavi, Phugs_Pa, Pyu, Persian_Cuneiform,
Phoenician, Satavahana, Siddham, Sumero_Akkadian_Cuneiform, Tircul,
Ugaritic_Cuneiform - among others.