ISO-IR-153

ISO-IR-153[3] (ST SEV 358-88) is an 8-bit character set that covers the Russian and Bulgarian alphabets. Unlike the KOI encodings, this encoding lists the Cyrillic letters in their correct traditional order. This has become the basis for ISO/IEC 8859-5 and the Cyrillic Unicode block.

ISO-IR-153
Language(s)Russian, Bulgarian
StandardST SEV 358-88, GOST R 34.303-92 (see below)
ClassificationExtended ASCII
Based onMain code page[1]
ExtensionsISO-8859-5, IBM-1124, ISO-IR-200, ISO-IR-201
Preceded byKOI8-B[2]

Standards and Naming

ISO-IR-153 is a subset of ISO/IEC 8859-5 (synchronised with ECMA-113 since 1988).[4] The ISO-IR-153 documentation cites ST SEV 358-88 as the source standard.[3] While it also cites the earlier GOST 19768-74[3] (which defines KOI-8 and was conformed to by the first version of ECMA-113, i.e. ISO-IR-111),[4] it does not follow the KOI-8 layout (rather using a close modification of the letter layout from the Main code page)[1] so this appears to be in error. The ISO-IR-153 encoding was intended to replace GOST 19768-74, and is sometimes referred to as GOST-19768-87.[2][5] This confusion has led to a common misconception that ISO-8859-5 was defined in or based on GOST 19768-74.[1]

Notwithstanding the extents of their accuracy, the IANA lists GOST_19768-74, ST_SEV_358-88 and iso-ir-153 as labels which may be used for the ISO-IR-153 encoding on the Internet, with reference to RFC 1345, which assigns it those labels.[6][7]

GOST R 34.303-92 includes the ISO-IR-153 code page and dubs it KOI-8 V1 (in addition to using KOI-8 N1 and KOI-8 N2 for two Alternative code page/Code page 866 variants).[8]

Character set

The following table shows the ISO-IR-153 encoding. Each character is shown with its equivalent Unicode code point.

The encoding closely resembles the letter subset of the Cyrillic part of the Main code page, apart from the relocation of the uppercase Ё from 0xF0 to 0xA1. ISO-8859-5 is a superset.

ISO-IR-153[3]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
8_
9_
A_ NBSP
00A0
Ё
0401
SHY
00AD
B_ А
0410
Б
0411
В
0412
Г
0413
Д
0414
Е
0415
Ж
0416
З
0417
И
0418
Й
0419
К
041A
Л
041B
М
041C
Н
041D
О
041E
П
041F
C_ Р
0420
С
0421
Т
0422
У
0423
Ф
0424
Х
0425
Ц
0426
Ч
0427
Ш
0428
Щ
0429
Ъ
042A
Ы
042B
Ь
042C
Э
042D
Ю
042E
Я
042F
D_ а
0430
б
0431
в
0432
г
0433
д
0434
е
0435
ж
0436
з
0437
и
0438
й
0439
к
043A
л
043B
м
043C
н
043D
о
043E
п
043F
E_ р
0440
с
0441
т
0442
у
0443
ф
0444
х
0445
ц
0446
ч
0447
ш
0448
щ
0449
ъ
044A
ы
044B
ь
044C
э
044D
ю
044E
я
044F
F_ ё
0451

  Letter  Number  Punctuation  Symbol  Other  Undefined

See also

References

  1. Nechayev, Valentin (2013) [2001]. "Review of 8-bit Cyrillic encodings universe". Archived from the original on 2016-12-05. Retrieved 2016-12-05.
  2. Czyborra, Roman (1998-11-30) [1998-05-25]. "The Cyrillic Charset Soup". Archived from the original on 2016-12-03. Retrieved 2016-12-03. […] in the meantime GOST had inhaled some perestroika and declared the installed base and KOI correspondence less important and revised its 19768 standard from 1974 in 1987 into an incompatible new GOST 19768-87 […]
  3. ISO-IR-153 (1 December 1989)
  4. ECMA-113. 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Cyrillic Alphabet (2nd ed., June 1988)
  5. http://czyborra.com/charsets/gost19768-87.txt.gz
  6. "Character Sets". IANA.
  7. Simonsen, Keld (1992). "Character Mnemonics & Character Sets". Requests for Comments. IETF. doi:10.17487/rfc1345. RFC 1345.
  8. (in Russian) ГОСТ Р 34.303-92. Наборы 8-битных кодированных символов. 8-битный код обмена и обработки информации. = 8-bit coded character sets. 8-bit code for information interchange.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.