Unicode block

A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.

Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc.

Design and implementation

Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" and "SUPPLEMENTALARROWSA".[1]

Blocks are pairwise disjoint, that is, they do not overlap. The starting code point and the size (number of code points) of each block are always multiples of 16; therefore, in the hexadecimal notation, the starting (smallest) point is U+xxx0 and the ending (largest) point is U+yyyF, where xxx and yyy are three or more hexadecimal digits. (These constraints are intended to simplify the display of glyphs in Unicode Consortium documents, as tables with 16 columns labeled with the last hexadecimal digit of the code point.[1]) The size of a block may range from the minimum of 16 to a maximum of 65,536 code points.

Every assigned code point has a glyph property called "Block", whose value is a character string naming the unique block that owns that point.[2] However, a block may also contain unassigned code points, usually reserved for future additions of characters that "logically" should belong to that block. Code points not belonging to any of the named blocks, e.g. in the unassigned planes 3–13, have the value block="No_block".[1]

Other classifications

Each Unicode point also has a property called "General Category", that attempts to describes the role of the corresponding symbol in the languages or applications for whose sake it was included in the system. Examples of General Categories are "Lu" (meaning upper-case letter), "Nd" (decimal digit), "Pi" (open-quote punctuation), and "Mn" (non-spacing mark, i.e. a diacritic for the preceding glyph). This division is completely independent of code blocks: the code points with a given General Category generally span many blocks, and do not have to be consecutive, not even within each block.[3]

Each code point also has a script property, specifying which writing system it is intended for, or whether it is intended for multiple writing systems. This, also, is independent of block.

In descriptions of the Unicode system, a block may be subdivided into more specific subgroups, such as the "Chess symbols" in the block "Miscellaneous symbols". Those subgroups are not "blocks" in the technical sense used by the Unicode consortium, and are named only for the convenience of users.

List of blocks

Unicode 13.0 defines 308 blocks:[1]

  • 163 in plane 0, the Basic Multilingual Plane (BMP)
  • 134 in plane 1, the Supplementary Multilingual Plane (SMP)
  • 6 in plane 2, the Supplementary Ideographic Plane (SIP)
  • 1 in plane 3, the Tertiary Ideographic Plane (TIP)
  • 2 in plane 14 (E in hexadecimal), the Supplementary Special-purpose Plane (SSP)
  • One each in the planes 15 (Fhex) and 16 (10hex), called Supplementary Private Use Area-A and -B
Plane Block range Block name Code points[lower-alpha 1] Assigned characters Scripts[lower-alpha 2][lower-alpha 3][lower-alpha 4][lower-alpha 5][lower-alpha 6]
 
0 BMPU+0000..U+007FBasic Latin[lower-alpha 7]128128Latin (52 characters), Common (76 characters)
U+0080..U+00FFLatin-1 Supplement[lower-alpha 8]128128Latin (64 characters), Common (64 characters)
U+0100..U+017FLatin Extended-A128128Latin
U+0180..U+024FLatin Extended-B208208Latin
U+0250..U+02AFIPA Extensions9696Latin
U+02B0..U+02FFSpacing Modifier Letters8080Bopomofo (2 characters), Latin (14 characters), Common (64 characters)
U+0300..U+036FCombining Diacritical Marks112112Inherited
U+0370..U+03FFGreek and Coptic144135Coptic (14 characters), Greek (117 characters), Common (4 characters)
U+0400..U+04FFCyrillic256256Cyrillic (254 characters), Inherited (2 characters)
U+0500..U+052FCyrillic Supplement4848Cyrillic
0 BMPU+0530..U+058FArmenian9691Armenian
U+0590..U+05FFHebrew11288Hebrew
U+0600..U+06FFArabic256255Arabic (237 characters), Common (6 characters), Inherited (12 characters)
U+0700..U+074FSyriac8077Syriac
U+0750..U+077FArabic Supplement4848Arabic
U+0780..U+07BFThaana6450Thaana
U+07C0..U+07FFNKo6462Nko
U+0800..U+083FSamaritan6461Samaritan
U+0840..U+085FMandaic3229Mandaic
U+0860..U+086FSyriac Supplement1611Syriac
0 BMPU+08A0..U+08FFArabic Extended-A9684Arabic (83 characters), Common (1 character)
U+0900..U+097FDevanagari128128Devanagari (122 characters), Common (2 characters), Inherited (4 characters)
U+0980..U+09FFBengali12896Bengali
U+0A00..U+0A7FGurmukhi12880Gurmukhi
U+0A80..U+0AFFGujarati12891Gujarati
U+0B00..U+0B7FOriya12891Oriya
U+0B80..U+0BFFTamil12872Tamil
U+0C00..U+0C7FTelugu12898Telugu
U+0C80..U+0CFFKannada12889Kannada
U+0D00..U+0D7FMalayalam128118Malayalam
0 BMPU+0D80..U+0DFFSinhala12891Sinhala
U+0E00..U+0E7FThai12887Thai (86 characters), Common (1 character)
U+0E80..U+0EFFLao12882Lao
U+0F00..U+0FFFTibetan256211Tibetan (207 characters), Common (4 characters)
U+1000..U+109FMyanmar160160Myanmar
U+10A0..U+10FFGeorgian9688Georgian (87 characters), Common (1 character)
U+1100..U+11FFHangul Jamo256256Hangul
U+1200..U+137FEthiopic384358Ethiopic
U+1380..U+139FEthiopic Supplement3226Ethiopic
U+13A0..U+13FFCherokee9692Cherokee
0 BMPU+1400..U+167FUnified Canadian Aboriginal Syllabics640640Canadian Aboriginal
U+1680..U+169FOgham3229Ogham
U+16A0..U+16FFRunic9689Runic (86 characters), Common (3 characters)
U+1700..U+171FTagalog3220Tagalog
U+1720..U+173FHanunoo3223Hanunoo (21 characters), Common (2 characters)
U+1740..U+175FBuhid3220Buhid
U+1760..U+177FTagbanwa3218Tagbanwa
U+1780..U+17FFKhmer128114Khmer
U+1800..U+18AFMongolian176157Mongolian (154 characters), Common (3 characters)
U+18B0..U+18FFUnified Canadian Aboriginal Syllabics Extended8070Canadian Aboriginal
0 BMPU+1900..U+194FLimbu8068Limbu
U+1950..U+197FTai Le4835Tai Le
U+1980..U+19DFNew Tai Lue9683New Tai Lue
U+19E0..U+19FFKhmer Symbols3232Khmer
U+1A00..U+1A1FBuginese3230Buginese
U+1A20..U+1AAFTai Tham144127Tai Tham
U+1AB0..U+1AFFCombining Diacritical Marks Extended8017Inherited
U+1B00..U+1B7FBalinese128121Balinese
U+1B80..U+1BBFSundanese6464Sundanese
U+1BC0..U+1BFFBatak6456Batak
0 BMPU+1C00..U+1C4FLepcha8074Lepcha
U+1C50..U+1C7FOl Chiki4848Ol Chiki
U+1C80..U+1C8FCyrillic Extended-C169Cyrillic
U+1C90..U+1CBFGeorgian Extended4846Georgian
U+1CC0..U+1CCFSundanese Supplement168Sundanese
U+1CD0..U+1CFFVedic Extensions4843Common (16 characters), Inherited (27 characters)
U+1D00..U+1D7FPhonetic Extensions128128Cyrillic (2 characters), Greek (15 characters), Latin (111 characters)
U+1D80..U+1DBFPhonetic Extensions Supplement6464Greek (1 character), Latin (63 characters)
U+1DC0..U+1DFFCombining Diacritical Marks Supplement6463Inherited
U+1E00..U+1EFFLatin Extended Additional256256Latin
0 BMPU+1F00..U+1FFFGreek Extended256233Greek
U+2000..U+206FGeneral Punctuation112111Common (109 characters), Inherited (2 characters)
U+2070..U+209FSuperscripts and Subscripts4842Latin (15 characters), Common (27 characters)
U+20A0..U+20CFCurrency Symbols4832Common
U+20D0..U+20FFCombining Diacritical Marks for Symbols4833Inherited
U+2100..U+214FLetterlike Symbols8080Greek (1 character), Latin (4 characters), Common (75 characters)
U+2150..U+218FNumber Forms6460Latin (41 characters), Common (19 characters)
U+2190..U+21FFArrows112112Common
U+2200..U+22FFMathematical Operators256256Common
U+2300..U+23FFMiscellaneous Technical256256Common
0 BMPU+2400..U+243FControl Pictures6439Common
U+2440..U+245FOptical Character Recognition3211Common
U+2460..U+24FFEnclosed Alphanumerics160160Common
U+2500..U+257FBox Drawing128128Common
U+2580..U+259FBlock Elements3232Common
U+25A0..U+25FFGeometric Shapes9696Common
U+2600..U+26FFMiscellaneous Symbols256256Common
U+2700..U+27BFDingbats192192Common
U+27C0..U+27EFMiscellaneous Mathematical Symbols-A4848Common
U+27F0..U+27FFSupplemental Arrows-A1616Common
0 BMPU+2800..U+28FFBraille Patterns256256Braille
U+2900..U+297FSupplemental Arrows-B128128Common
U+2980..U+29FFMiscellaneous Mathematical Symbols-B128128Common
U+2A00..U+2AFFSupplemental Mathematical Operators256256Common
U+2B00..U+2BFFMiscellaneous Symbols and Arrows256253Common
U+2C00..U+2C5FGlagolitic9694Glagolitic
U+2C60..U+2C7FLatin Extended-C3232Latin
U+2C80..U+2CFFCoptic128123Coptic
U+2D00..U+2D2FGeorgian Supplement4840Georgian
U+2D30..U+2D7FTifinagh8059Tifinagh
0 BMPU+2D80..U+2DDFEthiopic Extended9679Ethiopic
U+2DE0..U+2DFFCyrillic Extended-A3232Cyrillic
U+2E00..U+2E7FSupplemental Punctuation12883Common
U+2E80..U+2EFFCJK Radicals Supplement128115Han
U+2F00..U+2FDFKangxi Radicals224214Han
U+2FF0..U+2FFFIdeographic Description Characters1612Common
U+3000..U+303FCJK Symbols and Punctuation6464Han (15 characters), Hangul (2 characters), Common (43 characters), Inherited (4 characters)
U+3040..U+309FHiragana9693Hiragana (89 characters), Common (2 characters), Inherited (2 characters)
U+30A0..U+30FFKatakana9696Katakana (93 characters), Common (3 characters)
U+3100..U+312FBopomofo4843Bopomofo
0 BMPU+3130..U+318FHangul Compatibility Jamo9694Hangul
U+3190..U+319FKanbun1616Common
U+31A0..U+31BFBopomofo Extended3232Bopomofo
U+31C0..U+31EFCJK Strokes4836Common
U+31F0..U+31FFKatakana Phonetic Extensions1616Katakana
U+3200..U+32FFEnclosed CJK Letters and Months256255Hangul (62 characters), Katakana (47 characters), Common (146 characters)
U+3300..U+33FFCJK Compatibility256256Katakana (88 characters), Common (168 characters)
U+3400..U+4DBFCJK Unified Ideographs Extension A6,5926,592Han
U+4DC0..U+4DFFYijing Hexagram Symbols6464Common
U+4E00..U+9FFFCJK Unified Ideographs20,99220,989Han
0 BMPU+A000..U+A48FYi Syllables1,1681,165Yi
U+A490..U+A4CFYi Radicals6455Yi
U+A4D0..U+A4FFLisu4848Lisu
U+A500..U+A63FVai320300Vai
U+A640..U+A69FCyrillic Extended-B9696Cyrillic
U+A6A0..U+A6FFBamum9688Bamum
U+A700..U+A71FModifier Tone Letters3232Common
U+A720..U+A7FFLatin Extended-D224180Latin (175 characters), Common (5 characters)
U+A800..U+A82FSyloti Nagri4845Syloti Nagri
U+A830..U+A83FCommon Indic Number Forms1610Common
0 BMPU+A840..U+A87FPhags-pa6456Phags Pa
U+A880..U+A8DFSaurashtra9682Saurashtra
U+A8E0..U+A8FFDevanagari Extended3232Devanagari
U+A900..U+A92FKayah Li4848Kayah Li (47 characters), Common (1 character)
U+A930..U+A95FRejang4837Rejang
U+A960..U+A97FHangul Jamo Extended-A3229Hangul
U+A980..U+A9DFJavanese9691Javanese (90 characters), Common (1 character)
U+A9E0..U+A9FFMyanmar Extended-B3231Myanmar
U+AA00..U+AA5FCham9683Cham
U+AA60..U+AA7FMyanmar Extended-A3232Myanmar
0 BMPU+AA80..U+AADFTai Viet9672Tai Viet
U+AAE0..U+AAFFMeetei Mayek Extensions3223Meetei Mayek
U+AB00..U+AB2FEthiopic Extended-A4832Ethiopic
U+AB30..U+AB6FLatin Extended-E6460Latin (56 characters), Greek (1 character), Common (3 characters)
U+AB70..U+ABBFCherokee Supplement8080Cherokee
U+ABC0..U+ABFFMeetei Mayek6456Meetei Mayek
U+AC00..U+D7AFHangul Syllables11,18411,172Hangul
U+D7B0..U+D7FFHangul Jamo Extended-B8072Hangul
U+D800..U+DB7FHigh Surrogates8960Unknown
U+DB80..U+DBFFHigh Private Use Surrogates1280Unknown
0 BMPU+DC00..U+DFFFLow Surrogates1,0240Unknown
U+E000..U+F8FFPrivate Use Area6,4006,400Unknown
U+F900..U+FAFFCJK Compatibility Ideographs512472Han
U+FB00..U+FB4FAlphabetic Presentation Forms8058Armenian (5 characters), Hebrew (46 characters), Latin (7 characters)
U+FB50..U+FDFFArabic Presentation Forms-A688611Arabic (609 characters), Common (2 characters)
U+FE00..U+FE0FVariation Selectors1616Inherited
U+FE10..U+FE1FVertical Forms1610Common
U+FE20..U+FE2FCombining Half Marks1616Cyrillic (2 characters), Inherited (14 characters)
U+FE30..U+FE4FCJK Compatibility Forms3232Common
U+FE50..U+FE6FSmall Form Variants3226Common
U+FE70..U+FEFFArabic Presentation Forms-B144141Arabic (140 characters), Common (1 character)
U+FF00..U+FFEFHalfwidth and Fullwidth Forms240225Hangul (52 characters), Katakana (55 characters), Latin (52 characters), Common (66 characters)
U+FFF0..U+FFFFSpecials165Common
1 SMPU+10000..U+1007FLinear B Syllabary12888Linear B
U+10080..U+100FFLinear B Ideograms128123Linear B
U+10100..U+1013FAegean Numbers6457Common
U+10140..U+1018FAncient Greek Numbers8079Greek
U+10190..U+101CFAncient Symbols6414Greek (1 character), Common (13 characters)
U+101D0..U+101FFPhaistos Disc4846Common (45 characters), Inherited (1 character)
U+10280..U+1029FLycian3229Lycian
U+102A0..U+102DFCarian6449Carian
U+102E0..U+102FFCoptic Epact Numbers3228Common (27 characters), Inherited (1 character)
U+10300..U+1032FOld Italic4839Old Italic
1 SMPU+10330..U+1034FGothic3227Gothic
U+10350..U+1037FOld Permic4843Old Permic
U+10380..U+1039FUgaritic3231Ugaritic
U+103A0..U+103DFOld Persian6450Old Persian
U+10400..U+1044FDeseret8080Deseret
U+10450..U+1047FShavian4848Shavian
U+10480..U+104AFOsmanya4840Osmanya
U+104B0..U+104FFOsage8072Osage
U+10500..U+1052FElbasan4840Elbasan
U+10530..U+1056FCaucasian Albanian6453Caucasian Albanian
1 SMPU+10600..U+1077FLinear A384341Linear A
U+10800..U+1083FCypriot Syllabary6455Cypriot
U+10840..U+1085FImperial Aramaic3231Imperial Aramaic
U+10860..U+1087FPalmyrene3232Palmyrene
U+10880..U+108AFNabataean4840Nabataean
U+108E0..U+108FFHatran3226Hatran
U+10900..U+1091FPhoenician3229Phoenician
U+10920..U+1093FLydian3227Lydian
U+10980..U+1099FMeroitic Hieroglyphs3232Meroitic Hieroglyphs
U+109A0..U+109FFMeroitic Cursive9690Meroitic Cursive
1 SMPU+10A00..U+10A5FKharoshthi9668Kharoshthi
U+10A60..U+10A7FOld South Arabian3232Old South Arabian
U+10A80..U+10A9FOld North Arabian3232Old North Arabian
U+10AC0..U+10AFFManichaean6451Manichaean
U+10B00..U+10B3FAvestan6461Avestan
U+10B40..U+10B5FInscriptional Parthian3230Inscriptional Parthian
U+10B60..U+10B7FInscriptional Pahlavi3227Inscriptional Pahlavi
U+10B80..U+10BAFPsalter Pahlavi4829Psalter Pahlavi
U+10C00..U+10C4FOld Turkic8073Old Turkic
U+10C80..U+10CFFOld Hungarian128108Old Hungarian
1 SMPU+10D00..U+10D3FHanifi Rohingya6450Hanifi Rohingya
U+10E60..U+10E7FRumi Numeral Symbols3231Arabic
U+10E80..U+10EBFYezidi6447Yezidi
U+10F00..U+10F2FOld Sogdian4840Old Sogdian
U+10F30..U+10F6FSogdian6442Sogdian
U+10FB0..U+10FDFChorasmian4828Chorasmian
U+10FE0..U+10FFFElymaic3223Elymaic
U+11000..U+1107FBrahmi128109Brahmi
U+11080..U+110CFKaithi8067Kaithi
U+110D0..U+110FFSora Sompeng4835Sora Sompeng
1 SMPU+11100..U+1114FChakma8071Chakma
U+11150..U+1117FMahajani4839Mahajani
U+11180..U+111DFSharada9696Sharada
U+111E0..U+111FFSinhala Archaic Numbers3220Sinhala
U+11200..U+1124FKhojki8062Khojki
U+11280..U+112AFMultani4838Multani
U+112B0..U+112FFKhudawadi8069Khudawadi
U+11300..U+1137FGrantha12886Grantha (85 characters), Inherited (1 character)
U+11400..U+1147FNewa12897Newa
U+11480..U+114DFTirhuta9682Tirhuta
1 SMPU+11580..U+115FFSiddham12892Siddham
U+11600..U+1165FModi9679Modi
U+11660..U+1167FMongolian Supplement3213Mongolian
U+11680..U+116CFTakri8067Takri
U+11700..U+1173FAhom6458Ahom
U+11800..U+1184FDogra8060Dogra
U+118A0..U+118FFWarang Citi9684Warang Citi
U+11900..U+1195FDives Akuru9672Dives Akuru
U+119A0..U+119FFNandinagari9665Nandinagari
U+11A00..U+11A4FZanabazar Square8072Zanabazar Square
1 SMPU+11A50..U+11AAFSoyombo9683Soyombo
U+11AC0..U+11AFFPau Cin Hau6457Pau Cin Hau
U+11C00..U+11C6FBhaiksuki11297Bhaiksuki
U+11C70..U+11CBFMarchen8068Marchen
U+11D00..U+11D5FMasaram Gondi9675Masaram Gondi
U+11D60..U+11DAFGunjala Gondi8063Gunjala Gondi
U+11EE0..U+11EFFMakasar3225Makasar
U+11FB0..U+11FBFLisu Supplement161Lisu
U+11FC0..U+11FFFTamil Supplement6451Tamil
U+12000..U+123FFCuneiform1,024922Cuneiform
1 SMPU+12400..U+1247FCuneiform Numbers and Punctuation128116Cuneiform
U+12480..U+1254FEarly Dynastic Cuneiform208196Cuneiform
U+13000..U+1342FEgyptian Hieroglyphs1,0721,071Egyptian Hieroglyphs
U+13430..U+1343FEgyptian Hieroglyph Format Controls169Egyptian Hieroglyphs
U+14400..U+1467FAnatolian Hieroglyphs640583Anatolian Hieroglyphs
U+16800..U+16A3FBamum Supplement576569Bamum
U+16A40..U+16A6FMro4843Mro
U+16AD0..U+16AFFBassa Vah4836Bassa Vah
U+16B00..U+16B8FPahawh Hmong144127Pahawh Hmong
U+16E40..U+16E9FMedefaidrin9691Medefaidrin
1 SMPU+16F00..U+16F9FMiao160149Miao
U+16FE0..U+16FFFIdeographic Symbols and Punctuation327Han (2 characters), Khitan Small Script (1 character), Nushu (1 character), Tangut (1 character), Common (2 characters)
U+17000..U+187FFTangut6,1446,136Tangut
U+18800..U+18AFFTangut Components768768Tangut
U+18B00..U+18CFFKhitan Small Script512470Khitan small script
U+18D00..U+18D8FTangut Supplement1449Tangut
U+1B000..U+1B0FFKana Supplement256256Hiragana (255 characters), Katakana (1 character)
U+1B100..U+1B12FKana Extended-A4831Hiragana
U+1B130..U+1B16FSmall Kana Extension647Hiragana (3 characters), Katakana (4 characters)
U+1B170..U+1B2FFNushu400396Nüshu
1 SMPU+1BC00..U+1BC9FDuployan160143Duployan
U+1BCA0..U+1BCAFShorthand Format Controls164Common
U+1D000..U+1D0FFByzantine Musical Symbols256246Common
U+1D100..U+1D1FFMusical Symbols256231Common (209 characters), Inherited (22 characters)
U+1D200..U+1D24FAncient Greek Musical Notation8070Greek
U+1D2E0..U+1D2FFMayan Numerals3220Common
U+1D300..U+1D35FTai Xuan Jing Symbols9687Common
U+1D360..U+1D37FCounting Rod Numerals3225Common
U+1D400..U+1D7FFMathematical Alphanumeric Symbols1,024996Common
U+1D800..U+1DAAFSutton SignWriting688672SignWriting
1 SMPU+1E000..U+1E02FGlagolitic Supplement4838Glagolitic
U+1E100..U+1E14FNyiakeng Puachue Hmong8071Nyiakeng Puachue Hmong
U+1E2C0..U+1E2FFWancho6459Wancho
U+1E800..U+1E8DFMende Kikakui224213Mende Kikakui
U+1E900..U+1E95FAdlam9688Adlam
U+1EC70..U+1ECBFIndic Siyaq Numbers8068Common
U+1ED00..U+1ED4FOttoman Siyaq Numbers8061Common
U+1EE00..U+1EEFFArabic Mathematical Alphabetic Symbols256143Arabic
U+1F000..U+1F02FMahjong Tiles4844Common
U+1F030..U+1F09FDomino Tiles112100Common
1 SMPU+1F0A0..U+1F0FFPlaying Cards9682Common
U+1F100..U+1F1FFEnclosed Alphanumeric Supplement256200Common
U+1F200..U+1F2FFEnclosed Ideographic Supplement25664Hiragana (1 character), Common (63 characters)
U+1F300..U+1F5FFMiscellaneous Symbols and Pictographs768768Common
U+1F600..U+1F64FEmoticons8080Common
U+1F650..U+1F67FOrnamental Dingbats4848Common
U+1F680..U+1F6FFTransport and Map Symbols128114Common
1 SMPU+1F700..U+1F77FAlchemical Symbols128116Common
U+1F780..U+1F7FFGeometric Shapes Extended128101Common
U+1F800..U+1F8FFSupplemental Arrows-C256150Common
U+1F900..U+1F9FFSupplemental Symbols and Pictographs256254Common
U+1FA00..U+1FA6FChess Symbols11298Common
U+1FA70..U+1FAFFSymbols and Pictographs Extended-A14457Common
U+1FB00..U+1FBFFSymbols for Legacy Computing256212Common
2 SIPU+20000..U+2A6DFCJK Unified Ideographs Extension B42,72042,718Han
U+2A700..U+2B73FCJK Unified Ideographs Extension C4,1604,149Han
U+2B740..U+2B81FCJK Unified Ideographs Extension D224222Han
U+2B820..U+2CEAFCJK Unified Ideographs Extension E5,7765,762Han
U+2CEB0..U+2EBEFCJK Unified Ideographs Extension F7,4887,473Han
U+2F800..U+2FA1FCJK Compatibility Ideographs Supplement544542Han
3 TIPU+30000..U+3134FCJK Unified Ideographs Extension G4,9444,939Han
14 SSPU+E0000..U+E007FTags12897Common
U+E0100..U+E01EFVariation Selectors Supplement240240Inherited
15 PUA-AU+F0000..U+FFFFFSupplementary Private Use Area-A65,53665,534Unknown
16 PUA-BU+100000..U+10FFFFSupplementary Private Use Area-B65,53665,534Unknown
  1. Code point count includes unassigned code points: non-character, reserved
  2. The script has one or multiple characters in the block, as defined by the Script Property. This is independent of the block name
  3. "Common" and "Unknown" (Zyyy) and "Inherited" (Zinh or Qaai) refer to Scripts in ISO 15924
  4. Unicode Blocks data file. As of Unicode version 13.0
  5. UAX 24: Unicode Script Property (4 alpha code)
  6. UAX 24: Script data file
  7. Called "C0 Controls and Basic Latin" in ISO/IEC 10646
  8. Called "C1 Controls and Latin-1 Supplement" in ISO/IEC 10646

Deleted blocks

The Unicode Stability Policy requires that a character, once assigned, may not be moved or removed, although it may be deprecated. This applies to Unicode 2.0 and all subsequent versions.

Prior to this, the following former blocks were removed:

Former Unicode blocks from before Unicode 2.0
Block range Block name Range now occupied by Superseded by block Code points Assigned characters Scripts
U+1000..U+105FTibetan[4]MyanmarTibetan9671Tibetan
U+3400..U+3D2DHangul[5]CJK Unified Ideographs Extension AHangul Syllables23502350Hangul
U+3D2E..U+44B7Hangul Supplementary-A[5]CJK Unified Ideographs Extension A19301930Hangul
U+44B8..U+4DFFHangul Supplementary-B[5]CJK Unified Ideographs Extension A23762376Hangul
Yijing Hexagram Symbols

References

  1. "Unicode Blocks data file, Unicode version 13.0". Unicode Consortium. Retrieved 2019-04-29.}
  2. Unicode glossary
  3. "Unicode Core Specification, Chapter 4: Character Properties" (PDF). Retrieved 2020-03-14.
  4. "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. Version 1.0. Unicode Consortium.
  5. "Appendix E: Block Names" (PDF). The Unicode Standard. Version 1.1. Unicode Consortium.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.