Script (Unicode)

In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems.[1] Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems, and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century, but transitioned to Latin in the early part of the 20th century. For a list of languages supported by each script see the list of languages by writing system. More or less complementary to scripts are symbols and Unicode control characters.

Armenian script

The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual scripts often have their own punctuation and diacritics, so that many scripts include not only letters, but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and space characters.

Unicode 13.0 defines 154 separate scripts, including 91 modern scripts and 63 ancient or historic scripts.[2][3] More scripts are in the process for encoding or have been tentatively allocated for encoding in roadmaps.[4]

Definition and classification

When multiple languages make use of the same script, there are frequently some differences: particularly in diacritics and other marks. For example, Swedish and English both use the Latin script. However, Swedish includes the character ‘å’ (sometimes called a "Swedish O") while English has no such character. Nor does English make use of the diacritic combining circle above for any character. In general the languages sharing the same scripts share many of the same characters. Despite these peripheral differences in the Swedish and English writing systems they are said to use the same Latin script. So the Unicode abstraction of scripts is a basic organizing technique. The differences between different alphabets or writing systems remain and are supported through Unicode’s flexible scripts, combining marks and collation algorithms.

Script versus writing system

"Writing system" is sometimes treated as a synonym for script. However it also can be used as the specific concrete writing system supported by a script. For example, the Vietnamese writing system is supported by the Latin script. A writing system may also cover more than one script, for example the Japanese writing system makes use of the Han, Hiragana and Katakana scripts.

Most writing systems can be broadly divided into several categories: logographic, syllabic, alphabetic (or segmental), abugida, abjad and featural; however, all features of any of these may be found in any given writing system in varying proportions, often making it difficult to purely categorize a system. The term complex system is sometimes used to describe those where the admixture makes classification problematic.

Unicode supports all of these types of writing systems through its numerous scripts. Unicode also adds further properties to characters to help differentiate the various characters and the ways they behave within Unicode text processing algorithms.

Special script property values

In addition to explicit or specific script properties Unicode uses three special values:[5]

Common: Unicode can assign a character in the UCS to a single script only. However, many characters — those that are not part of a formal natural language writing system or are unified across many writing systems may be used in more than one script. For example, currency signs, symbols, numerals and punctuation marks. In these cases Unicode defines them as belonging to the "common" script (ISO 15924 code "Zyyy").
Inherited: Many diacritics and non-spacing combining characters may be applied to characters from more than one script. In these cases Unicode assigns them to the "inherited" script (ISO 15924 code Zinh), which means that they have the same script class as the base character with which they combine, and so in different contexts they may be treated as belonging to different scripts. For example, U+0308 ̈ COMBINING DIAERESIS may combine with either U+0065 e LATIN SMALL LETTER E to create a Latin "ë", or with U+0435 е CYRILLIC SMALL LETTER IE for the Cyrillic "ё". In the former case it inherits the Latin script of the base character whereas in the latter case it inherits the Cyrillic script of the base character.
Unknown: The value of "unknown" script (ISO 15924 code Zzzz) is given to unassigned, private use, noncharacter, and surrogate code points.

Character categories within scripts

Unicode provides a general category property for each character. So in addition to belonging to a script every character also has a general category. Typically scripts include letter characters including: uppercase letters, lowercase letter and modifier letters. Some characters are considered titlecase letters for a few precomposed ligatures such as ǲ (U+01F2). Such titlecase ligatures are all in the Latin and Greek scripts and are all compatibility characters, and therefore Unicode discourages their use by authors. It is unlikely that new titlecase letters will be added in the future.

Most writing systems do not differentiate between uppercase and lowercase letters. For those scripts all letters are categorized as "other letter" or "modifier letter". Ideographs such as Unihan ideographs are also categorized as "other letters". A few scripts do differentiate between uppercase and lowercase however: Latin, Cyrillic, Greek, Armenian, Georgian, and Deseret. Even for these scripts there are some letters that are neither uppercase nor lowercase.

Scripts can also contain any other general category character such as marks (diacritic and otherwise), numbers (numerals), punctuation, separators (word separators such as spaces), symbols and non-graphical format characters. These are included in a particular script when they are unique to that script. Other such characters are generally unified and included in the punctuation or diacritic blocks. However, the bulk of characters in any script (other than the common and inherited scripts) are letters.

List of scripts in Unicode

Unicode defines over a hundred script names (called "Alias" or "Property value alias"), based on the ISO 15924 list. Unicode uses the "Common" script name for ISO 15924's Zyyy (code for undetermined script), "Inherited" for ISO 15924's Zinh (code for inherited script), and "Unknown" for ISO 15924's Zzzz (code for uncoded script). Not used are, among others, the ISO 15924 script codes: Zsym (Symbols) and Zmth (Mathematical notation). These are considered not to be scripts in Unicode sense.

Scripts in ISO 15924^[a]^[b] and in Unicode^[c]^[d]
ISO 15924				Script in Unicode^[e]
Code	ISO formal name	Directionality	Unicode Alias^[f]	Version	Characters	Notes	Description

Adlm	Adlam	R-to-L	Adlam	9.0	88		Ch 19.9
Afak	Afaka	Varies	ZZ— Not in Unicode, proposal is explored[6]
Aghb	Caucasian Albanian	L-to-R	Caucasian Albanian	7.0	53	Ancient/historic	Ch 8.10
Ahom	Ahom, Tai Ahom	L-to-R	Ahom	8.0	58	Ancient/historic	Ch 15.15
Arab	Arabic	R-to-L	Arabic	1.0	1,291		Ch 9.2
Aran	Arabic (Nastaliq variant)	Mixed	ZZ— Typographic variant of Arabic (§ Arab)
Armi	Imperial Aramaic	R-to-L	Imperial Aramaic	5.2	31	Ancient/historic	Ch 10.4
Armn	Armenian	L-to-R	Armenian	1.0	96		Ch 7.6
Avst	Avestan	R-to-L	Avestan	5.2	61	Ancient/historic	Ch 10.7
Bali	Balinese	L-to-R	Balinese	5.0	121		Ch 17.3
Bamu	Bamum	L-to-R	Bamum	5.2	657		Ch 19.6
Bass	Bassa Vah	L-to-R	Bassa Vah	7.0	36	Ancient/historic	Ch 19.7
Batk	Batak	L-to-R	Batak	6.0	56		Ch 17.6
Beng	Bengali (Bangla)	L-to-R	Bengali	1.0	96		Ch 12.2
Bhks	Bhaiksuki	L-to-R	Bhaiksuki	9.0	97	Ancient/historic	Ch 14.3
Blis	Blissymbols	Varies	ZZ— Not in Unicode, proposal is explored[6]
Bopo	Bopomofo	L-to-R	Bopomofo	1.0	77		Ch 18.3
Brah	Brahmi	L-to-R	Brahmi	6.0	109	Ancient/historic	Ch 14.1
Brai	Braille	L-to-R	Braille	3.0	256		Ch 21.1
Bugi	Buginese	L-to-R	Buginese	4.1	30		Ch 17.2
Buhd	Buhid	L-to-R	Buhid	3.2	20		Ch 17.1
Cakm	Chakma	L-to-R	Chakma	6.1	71		Ch 13.11
Cans	Unified Canadian Aboriginal Syllabics	L-to-R	Canadian Aboriginal	3.0	710		Ch 20.2
Cari	Carian	L-to-R	Carian	5.1	49	Ancient/historic	Ch 8.4
Cham	Cham	L-to-R	Cham	5.1	83		Ch 16.10
Cher	Cherokee	L-to-R	Cherokee	3.0	172		Ch 20.1
Chrs	Chorasmian	Mixed	Chorasmian	13.0	28	Ancient/historic	Ch 10.8
Cirt	Cirth	Varies	ZZ— Not in Unicode
Copt	Coptic	L-to-R	Coptic	1.0	137	Ancient/historic, Disunified from Greek in 4.1	Ch 7.3
Cpmn	Cypro-Minoan	L-to-R	ZZ— Not in Unicode
Cprt	Cypriot syllabary	R-to-L	Cypriot	4.0	55	Ancient/historic	Ch 8.3
Cyrl	Cyrillic	L-to-R	Cyrillic	1.0	443	Includes typographic variant Old Church Slavonic (§ Cyrs)	Ch 7.4
Cyrs	Cyrillic (Old Church Slavonic variant)	Varies	ZZ— Typographic variant of Cyrillic (§ Cyrl)			Ancient/historic
Deva	Devanagari (Nagari)	L-to-R	Devanagari	1.0	154		Ch 12.1
Diak	Dives Akuru	L-to-R	Dives Akuru	13.0	72	Ancient/historic	Ch 15.14
Dogr	Dogra	L-to-R	Dogra	11.0	60	Ancient/historic	Ch 15.17
Dsrt	Deseret (Mormon)	L-to-R	Deseret	3.1	80		Ch 20.4
Dupl	Duployan shorthand, Duployan stenography	L-to-R	Duployan	7.0	143		Ch 21.5
Egyd	Egyptian demotic	Mixed	ZZ— Not in Unicode
Egyh	Egyptian hieratic	Mixed	ZZ— Not in Unicode
Egyp	Egyptian hieroglyphs	L-to-R	Egyptian Hieroglyphs	5.2	1,080	Ancient/historic	Ch 11.4
Elba	Elbasan	L-to-R	Elbasan	7.0	40	Ancient/historic	Ch 8.9
Elym	Elymaic	R-to-L	Elymaic	12.0	23	Ancient/historic	Ch 10.9
Ethi	Ethiopic (Geʻez)	L-to-R	Ethiopic	3.0	495		Ch 19.1
Geok	Khutsuri (Asomtavruli and Nuskhuri)	L-to-R	Georgian			Unicode groups "Khutsori", "Asomtavruli" and "Nuskhuri" into 'Georgian' (§ Geok). Also "Mkhedruli" and "Mtavruli" are 'Georgian' (§ Geor)	Ch 7.7
Geor	Georgian (Mkhedruli and Mtavruli)	L-to-R	Georgian	1.0	173	In Unicode, also includes Geok (Nuskhuri)	Ch 7.7
Glag	Glagolitic	L-to-R	Glagolitic	4.1	132	Ancient/historic	Ch 7.5
Gong	Gunjala Gondi	L-to-R	Gunjala Gondi	11.0	63		Ch 13.15
Gonm	Masaram Gondi	L-to-R	Masaram Gondi	10.0	75		Ch 13.14
Goth	Gothic	L-to-R	Gothic	3.1	27	Ancient/historic	Ch 8.8
Gran	Grantha	L-to-R	Grantha	7.0	85	Ancient/historic	Ch 15.13
Grek	Greek	L-to-R	Greek	1.0	518	Directionality sometimes as boustrophedon	Ch 7.2
Gujr	Gujarati	L-to-R	Gujarati	1.0	91		Ch 12.4
Guru	Gurmukhi	L-to-R	Gurmukhi	1.0	80		Ch 12.3
Hanb	Han with Bopomofo (alias for Han + Bopomofo)	Varies	ZZ— See § Hani, § Bopo
Hang	Hangul (Hangŭl, Hangeul)	L-to-R	Hangul	1.0	11,739	Hangul syllables relocated in 2.0	Ch 18.6
Hani	Han (Hanzi, Kanji, Hanja)	L-to-R	Han	1.0	94,204		Ch 18.1
Hano	Hanunoo (Hanunóo)	L-to-R	Hanunoo	3.2	21		Ch 17.1
Hans	Han (Simplified variant)	Varies	ZZ— Subset of Han (Hanzi, Kanji, Hanja) (§ Hani)
Hant	Han (Traditional variant)	Varies	ZZ— Subset of § Hani
Hatr	Hatran	R-to-L	Hatran	8.0	26	Ancient/historic	Ch 10.12
Hebr	Hebrew	R-to-L	Hebrew	1.0	134		Ch 9.1
Hira	Hiragana	L-to-R	Hiragana	1.0	379		Ch 18.4
Hluw	Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs)	L-to-R	Anatolian Hieroglyphs	8.0	583	Ancient/historic	Ch 11.6
Hmng	Pahawh Hmong	L-to-R	Pahawh Hmong	7.0	127		Ch 16.11
Hmnp	Nyiakeng Puachue Hmong	L-to-R	Nyiakeng Puachue Hmong	12.0	71		Ch 16.12
Hrkt	Japanese syllabaries (alias for Hiragana + Katakana)	Varies	Katakana or Hiragana			See § Hira, § Kana	Ch 18.4
Hung	Old Hungarian (Hungarian Runic)	R-to-L	Old Hungarian	8.0	108	Ancient/historic	Ch 8.7
Inds	Indus (Harappan)	Mixed	ZZ— Not in Unicode, proposal is explored[6]
Ital	Old Italic (Etruscan, Oscan, etc.)	L-to-R	Old Italic	3.1	39	Ancient/historic	Ch 8.5
Jamo	Jamo (alias for Jamo subset of Hangul)	Varies	ZZ— Subset of § Hang
Java	Javanese	L-to-R	Javanese	5.2	90		Ch 17.4
Jpan	Japanese (alias for Han + Hiragana + Katakana)	Varies	ZZ— See § Hani, § Hira and § Kana
Jurc	Jurchen	L-to-R	ZZ— Not in Unicode
Kali	Kayah Li	L-to-R	Kayah Li	5.1	47		Ch 16.9
Kana	Katakana	L-to-R	Katakana	1.0	304		Ch 18.4
Khar	Kharoshthi	R-to-L	Kharoshthi	4.1	68	Ancient/historic	Ch 14.2
Khmr	Khmer	L-to-R	Khmer	3.0	146		Ch 16.4
Khoj	Khojki	L-to-R	Khojki	7.0	62	Ancient/historic	Ch 15.7
Kitl	Khitan large script	L-to-R	ZZ— Not in Unicode
Kits	Khitan small script	T-to-B	Khitan Small Script	13.0	471	Ancient/historic	Ch 18.12
Knda	Kannada	L-to-R	Kannada	1.0	89		Ch 12.8
Kore	Korean (alias for Hangul + Han)	L-to-R	ZZ— See § Hani, § Hang
Kpel	Kpelle	L-to-R	ZZ— Not in Unicode, proposal is explored[6]
Kthi	Kaithi	L-to-R	Kaithi	5.2	67	Ancient/historic	Ch 15.2
Lana	Tai Tham (Lanna)	L-to-R	Tai Tham	5.2	127		Ch 16.7
Laoo	Lao	L-to-R	Lao	1.0	82		Ch 16.2
Latf	Latin (Fraktur variant)	Varies	ZZ— Typographic variant of Latin (§ Latn)
Latg	Latin (Gaelic variant)	L-to-R	ZZ— Typographic variant of Latin (§ Latn)
Latn	Latin	L-to-R	Latin	1.0	1,374	See also: Latin script in Unicode	Ch 7.1
Leke	Leke	L-to-R	ZZ— Not in Unicode
Lepc	Lepcha (Róng)	L-to-R	Lepcha	5.1	74		Ch 13.12
Limb	Limbu	L-to-R	Limbu	4.0	68		Ch 13.6
Lina	Linear A	L-to-R	Linear A	7.0	341	Ancient/historic	Ch 8.1
Linb	Linear B	L-to-R	Linear B	4.0	211	Ancient/historic	Ch 8.2
Lisu	Lisu (Fraser)	L-to-R	Lisu	5.2	49		Ch 18.9
Loma	Loma	L-to-R	ZZ— Not in Unicode, proposal is explored[6]
Lyci	Lycian	L-to-R	Lycian	5.1	29	Ancient/historic	Ch 8.4
Lydi	Lydian	R-to-L	Lydian	5.1	27	Ancient/historic	Ch 8.4
Mahj	Mahajani	L-to-R	Mahajani	7.0	39	Ancient/historic	Ch 15.6
Maka	Makasar	L-to-R	Makasar	11.0	25	Ancient/historic	Ch 17.8
Mand	Mandaic, Mandaean	R-to-L	Mandaic	6.0	29		Ch 9.5
Mani	Manichaean	R-to-L	Manichaean	7.0	51	Ancient/historic	Ch 10.5
Marc	Marchen	L-to-R	Marchen	9.0	68	Ancient/historic	Ch 14.5
Maya	Mayan hieroglyphs	Mixed	ZZ— Not in Unicode
Medf	Medefaidrin (Oberi Okaime, Oberi Ɔkaimɛ)	L-to-R	Medefaidrin	11.0	91		Ch 19.10
Mend	Mende Kikakui	R-to-L	Mende Kikakui	7.0	213		Ch 19.8
Merc	Meroitic Cursive	R-to-L	Meroitic Cursive	6.1	90	Ancient/historic	Ch 11.5
Mero	Meroitic Hieroglyphs	R-to-L	Meroitic Hieroglyphs	6.1	32	Ancient/historic	Ch 11.5
Mlym	Malayalam	L-to-R	Malayalam	1.0	118		Ch 12.9
Modi	Modi, Moḍī	L-to-R	Modi	7.0	79	Ancient/historic	Ch 15.11
Mong	Mongolian	T-to-B	Mongolian	3.0	167	Mong includes Clear and Manchu scripts	Ch 13.5
Moon	Moon (Moon code, Moon script, Moon type)	Mixed	ZZ— Not in Unicode, proposal is explored[6]
Mroo	Mro, Mru	L-to-R	Mro	7.0	43		Ch 13.8
Mtei	Meitei Mayek (Meithei, Meetei)	L-to-R	Meetei Mayek	5.2	79		Ch 13.7
Mult	Multani	L-to-R	Multani	8.0	38	Ancient/historic	Ch 15.9
Mymr	Myanmar (Burmese)	L-to-R	Myanmar	3.0	223		Ch 16.3
Nand	Nandinagari	L-to-R	Nandinagari	12.0	65	Ancient/historic	Ch 15.12
Narb	Old North Arabian (Ancient North Arabian)	R-to-L	Old North Arabian	7.0	32	Ancient/historic	Ch 10.1
Nbat	Nabataean	R-to-L	Nabataean	7.0	40	Ancient/historic	Ch 10.10
Newa	Newa, Newar, Newari, Nepāla lipi	L-to-R	Newa	9.0	97		Ch 13.3
Nkdb	Naxi Dongba (na²¹ɕi³³ to³³ba²¹, Nakhi Tomba)	L-to-R	ZZ— Not in Unicode
Nkgb	Nakhi Geba (na²¹ɕi³³ gʌ²¹ba²¹, 'Na-'Khi ²Ggŏ-¹baw, Nakhi Geba)	L-to-R	ZZ— Not in Unicode, proposal is explored[6]
Nkoo	N’Ko	R-to-L	NKo	5.0	62		Ch 19.4
Nshu	Nüshu	L-to-R	Nushu	10.0	397		Ch 18.8
Ogam	Ogham	Mixed	Ogham	3.0	29	Ancient/historic	Ch 8.12
Olck	Ol Chiki (Ol Cemet’, Ol, Santali)	L-to-R	Ol Chiki	5.1	48		Ch 13.10
Orkh	Old Turkic, Orkhon Runic	R-to-L	Old Turkic	5.2	73	Ancient/historic	Ch 14.8
Orya	Oriya (Odia)	L-to-R	Oriya	1.0	91		Ch 12.5
Osge	Osage	L-to-R	Osage	9.0	72		Ch 20.3
Osma	Osmanya	L-to-R	Osmanya	4.0	40		Ch 19.2
Ougr	Old Uyghur	Mixed	ZZ— Not in Unicode
Palm	Palmyrene	R-to-L	Palmyrene	7.0	32	Ancient/historic	Ch 10.11
Pauc	Pau Cin Hau	L-to-R	Pau Cin Hau	7.0	57		Ch 16.13
Pcun	Proto-Cuneiform	L-to-R	ZZ— Not in Unicode
Pelm	Proto-Elamite	L-to-R	ZZ— Not in Unicode
Perm	Old Permic	L-to-R	Old Permic	7.0	43	Ancient/historic	Ch 8.11
Phag	Phags-pa	T-to-B	Phags-pa	5.0	56	Ancient/historic	Ch 14.4
Phli	Inscriptional Pahlavi	R-to-L	Inscriptional Pahlavi	5.2	27	Ancient/historic	Ch 10.6
Phlp	Psalter Pahlavi	R-to-L	Psalter Pahlavi	7.0	29	Ancient/historic	Ch 10.6
Phlv	Book Pahlavi	Mixed	ZZ— Not in Unicode
Phnx	Phoenician	R-to-L	Phoenician	5.0	29	Ancient/historic^[g]	Ch 10.3
Piqd	Klingon (KLI pIqaD)	L-to-R	ZZ— Rejected for inclusion in Unicode[7][8]
Plrd	Miao (Pollard)	L-to-R	Miao	6.1	149		Ch 18.10
Prti	Inscriptional Parthian	R-to-L	Inscriptional Parthian	5.2	30	Ancient/historic	Ch 10.6
Psin	Proto-Sinaitic	Mixed	ZZ— Not in Unicode
Qaaa-Qabx	Reserved for private use (range)		ZZ— Not in Unicode
Ranj	Ranjana	L-to-R	ZZ— Not in Unicode
Rjng	Rejang (Redjang, Kaganga)	L-to-R	Rejang	5.1	37		Ch 17.5
Rohg	Hanifi Rohingya	R-to-L	Hanifi Rohingya	11.0	50		Ch 16.14
Roro	Rongorongo	Mixed	ZZ— Not in Unicode, proposal is explored[6]
Runr	Runic	L-to-R	Runic	3.0	86	Ancient/historic	Ch 8.6
Samr	Samaritan	R-to-L	Samaritan	5.2	61		Ch 9.4
Sara	Sarati	Mixed	ZZ— Not in Unicode
Sarb	Old South Arabian	R-to-L	Old South Arabian	5.2	32	Ancient/historic	Ch 10.2
Saur	Saurashtra	L-to-R	Saurashtra	5.1	82		Ch 13.13
Sgnw	SignWriting	T-to-B	SignWriting	8.0	672		Ch 21.6
Shaw	Shavian (Shaw)	L-to-R	Shavian	4.0	48		Ch 8.13
Shrd	Sharada, Śāradā	L-to-R	Sharada	6.1	96		Ch 15.3
Shui	Shuishu	L-to-R	ZZ— Not in Unicode
Sidd	Siddham, Siddhaṃ, Siddhamātṛkā	L-to-R	Siddham	7.0	92	Ancient/historic	Ch 15.5
Sind	Khudawadi, Sindhi	L-to-R	Khudawadi	7.0	69		Ch 15.8
Sinh	Sinhala	L-to-R	Sinhala	3.0	111		Ch 13.2
Sogd	Sogdian	R-to-L	Sogdian	11.0	42	Ancient/historic	Ch 14.10
Sogo	Old Sogdian	R-to-L	Old Sogdian	11.0	40	Ancient/historic	Ch 14.9
Sora	Sora Sompeng	L-to-R	Sora Sompeng	6.1	35		Ch 15.16
Soyo	Soyombo	L-to-R	Soyombo	10.0	83	Ancient/historic	Ch 14.7
Sund	Sundanese	L-to-R	Sundanese	5.1	72		Ch 17.7
Sylo	Syloti Nagri	L-to-R	Syloti Nagri	4.1	45	Ancient/historic	Ch 15.1
Syrc	Syriac	R-to-L	Syriac	3.0	88	Includes variants Estrangelo (§ Syre), Western (§ Syrj), and Eastern (§ Syrn)	Ch 9.3
Syre	Syriac (Estrangelo variant)	Mixed	ZZ— Typographic variant of Syriac (§ Syrc)
Syrj	Syriac (Western variant)	Mixed	ZZ— Typographic variant of Syriac (§ Syrc)
Syrn	Syriac (Eastern variant)	Mixed	ZZ— Typographic variant of Syriac (§ Syrc)
Tagb	Tagbanwa	L-to-R	Tagbanwa	3.2	18		Ch 17.1
Takr	Takri, Ṭākrī, Ṭāṅkrī	L-to-R	Takri	6.1	67		Ch 15.4
Tale	Tai Le	L-to-R	Tai Le	4.0	35		Ch 16.5
Talu	New Tai Lue	L-to-R	New Tai Lue	4.1	83		Ch 16.6
Taml	Tamil	L-to-R	Tamil	1.0	123		Ch 12.6
Tang	Tangut	L-to-R	Tangut	9.0	6,914	Ancient/historic	Ch 18.11
Tavt	Tai Viet	L-to-R	Tai Viet	5.2	72		Ch 16.8
Telu	Telugu	L-to-R	Telugu	1.0	98		Ch 12.7
Teng	Tengwar	L-to-R	ZZ— Not in Unicode
Tfng	Tifinagh (Berber)	L-to-R	Tifinagh	4.1	59		Ch 19.3
Tglg	Tagalog (Baybayin, Alibata)	L-to-R	Tagalog	3.2	20		Ch 17.1
Thaa	Thaana	R-to-L	Thaana	3.0	50		Ch 13.1
Thai	Thai	L-to-R	Thai	1.0	86		Ch 16.1
Tibt	Tibetan	L-to-R	Tibetan	2.0	207	Added in 1.0, removed in 1.1 and reintroduced in 2.0	Ch 13.4
Tirh	Tirhuta	L-to-R	Tirhuta	7.0	82		Ch 15.10
Toto	Toto	L-to-R	ZZ— Not in Unicode
Ugar	Ugaritic	L-to-R	Ugaritic	4.0	31	Ancient/historic	Ch 11.2
Vaii	Vai	L-to-R	Vai	5.1	300		Ch 19.5
Visp	Visible Speech	L-to-R	ZZ— Not in Unicode
Wara	Warang Citi (Varang Kshiti)	L-to-R	Warang Citi	7.0	84		Ch 13.9
Wcho	Wancho	L-to-R	Wancho	12.0	59		Ch 13.16
Wole	Woleai	Mixed	ZZ— Not in Unicode, proposal is explored[6]
Xpeo	Old Persian	L-to-R	Old Persian	4.1	50	Ancient/historic	Ch 11.3
Xsux	Cuneiform, Sumero-Akkadian	L-to-R	Cuneiform	5.0	1,234	Ancient/historic	Ch 11.1
Yezi	Yezidi	R-to-L	Yezidi	13.0	47	Ancient/historic	Ch 9.6
Yiii	Yi	L-to-R	Yi	3.0	1,220		Ch 18.7
Zanb	Zanabazar Square (Zanabazarin Dörböljin Useg, Xewtee Dörböljin Bicig, Horizontal Square Script)	L-to-R	Zanabazar Square	10.0	72	Ancient/historic	Ch 14.6
Zinh	Code for inherited script	Inherited	Inherited		573
Zmth	Mathematical notation	L-to-R	ZZ— Not a 'script' in Unicode
Zsym	Symbols		ZZ— Not a 'script' in Unicode
Zsye	Symbols (emoji variant)		ZZ— Not a 'script' in Unicode
Zxxx	Code for unwritten documents		ZZ— Not a 'script' in Unicode
Zyyy	Code for undetermined script	Mixed	Common		8,087
Zzzz	Code for uncoded script		Unknown		970,188	In Unicode: All other code points
Notes ^ ISO 15924 publications As of 25 January 2021 ^ ISO 15924 Normative text file As of 25 January 2021 ^ ISO 15924 Changes (including Aliases for Unicode; as of 25 January 2021) ^ Unicode version 13.0 ^ Unicode charts ^ Unicode uses the "Property Value Alias" (Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924. An alias script name may be used in a character name: `Palm`, Palmyrene → U+10860 𐡠 PALMYRENE LETTER ALEPH. ^ In Unicode, the Phoenician script is intended for the representation of text in Paleo-Hebrew, Archaic Phoenician, Phoenician, Early Aramaic, Late Phoenician cursive, Phoenician papyri, Siloam Hebrew, Hebrew seals, Ammonite, Moabite, and Punic.[9]

References

"Glossary". unicode.org.
"Unicode Character Database: Scripts". unicode.org.
"Chapter 14: Additional Ancient and Historic Scripts". The Unicode Standard, Version 6.2 (PDF). Mountain View, CA: Unicode, Inc. September 2012. p. 473. ISBN 978-1-936213-07-8.
https://www.unicode.org/roadmaps/ Roadmaps to Unicode
"UAX #24: Unicode Script Property". www.unicode.org.
"Proposed New Scripts". Unicode Consortium. 2018-05-25. Retrieved 2019-09-12.
Michael Everson (1997-09-18). "Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2".
The Unicode Consortium (2001-08-14). "Approved Minutes of the UTC 87 / L2 184 Joint Meeting".
"Middle East-II, Ancient Scripts" (PDF). 13.0.0. The Unicode Consortiumtitle=Middle-East scripts II. Retrieved 2021-01-28.

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[cnote_a_grp_ISO_Unicode] 
ISO 15924 publications As of 25 January 2021

[cnote_b_grp_ISO_list] 
ISO 15924 Normative text file As of 25 January 2021

[cnote_c_grp_ISO_changes] 
ISO 15924 Changes (including Aliases for Unicode; as of 25 January 2021)

[cnote_d_grp_Asof_Unicode_version] 
Unicode version 13.0

[cnote_e_grp_Unicode_charts] 
Unicode charts

[cnote_f_grp_Aliases_for_Unicode] 
Unicode uses the "Property Value Alias" (Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924. An alias script name may be used in a character name: Palm, Palmyrene → U+10860 𐡠 PALMYRENE LETTER ALEPH.

[cnote_g_grp_Scripts] 
In Unicode, the Phoenician script is intended for the representation of text in Paleo-Hebrew, Archaic Phoenician, Phoenician, Early Aramaic, Late Phoenician cursive, Phoenician papyri, Siloam Hebrew, Hebrew seals, Ammonite, Moabite, and Punic.[9]

[1] "Glossary". unicode.org.

[2] "Unicode Character Database: Scripts". unicode.org.

[3] "Chapter 14: Additional Ancient and Historic Scripts". The Unicode Standard, Version 6.2 (PDF). Mountain View, CA: Unicode, Inc. September 2012. p. 473. ISBN 978-1-936213-07-8.

[4] ttps://www.unicode.org/roadmaps/ Roadmaps to Unicode

[Unicode_script_property-5] "UAX #24: Unicode Script Property". www.unicode.org.

[uniproposed-6] "Proposed New Scripts". Unicode Consortium. 2018-05-25. Retrieved 2019-09-12.

[7] Michael Everson (1997-09-18). "Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2".

[8] The Unicode Consortium (2001-08-14). "Approved Minutes of the UTC 87 / L2 184 Joint Meeting".

[9] "Middle East-II, Ancient Scripts" (PDF). 13.0.0. The Unicode Consortiumtitle=Middle-East scripts II. Retrieved 2021-01-28.

[a]

[b]

[c]

[d]

[e]

[f]

[g]

Writing systems
Index of language articles
Overview	History of writing History of the alphabet Graphemes Scripts in Unicode
Lists	Writing systems Languages by writing system / by first written account Undeciphered writing systems Creators of writing systems
Types	Featural Alphabets Abjads Alphasyllabaries / Abugidas Semi-syllabaries Syllabaries Ideogrammic Pictographic Logographic Numeral