ISO/IEC 8859-9

ISO/IEC 8859-9:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 9: Latin alphabet No. 5, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1989. It is informally referred to as Latin-5 or Turkish. It was designed to cover the Turkish language, designed as being of more use than the ISO/IEC 8859-3 encoding. It is identical to ISO/IEC 8859-1 except for these six replacements of Icelandic characters with characters unique to the Turkish alphabet:

Position 0xD00xDD0xDE0xF00xFD0xFE
8859-9 ĞİŞğış
8859-1 ÐÝÞðýþ
ISO/IEC 8859-9
MIME / IANAISO-8859-9
Alias(es)iso-ir-148, latin5, l5, csISOLatin5[1]
StandardECMA-128, ISO/IEC 8859
ClassificationISO 8859 (extended ASCII, ISO 4873 level 1)
ExtendsUS-ASCII
Based onISO/IEC 8859-1
Preceded byISO/IEC 8859-3
Other related encoding(s)Windows-1254

ISO-8859-9 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. In modern applications Unicode and UTF-8 are preferred; authors of new web pages and the designers of new protocols are instructed to use UTF-8 instead.[2] Since August 2019, 0.1% of all web pages use ISO-8859-9,[3][4] while 3.1% of web pages located in Turkey use ISO-8859-9.[5] However, the WHATWG Encoding Standard, which specifies the character encodings which are permitted in HTML5 and which compliant browsers must support,[6] requires that web pages marked as ISO-8859-9 be handled as Windows-1254,[2] which differs from ISO-8859-9 by using the CR range which ISO-8859-9 reserves for C1 control codes for additional graphical characters instead (analogous to the relationship between ISO-8859-1 and Windows-1252).

Microsoft has assigned code page 28599 a.k.a. Windows-28599 to ISO-8859-9 in Windows. IBM has assigned code page 920 (CCSID 920) to ISO-8859-9.[7][8] It is published by Ecma International as ECMA-128.[9]

Codepage layout

ISO/IEC 8859-9[10][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_
0
1_
16
2_
32
SP
0020
!
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3_
48
0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4_
64
@
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5_
80
P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
\
005C
]
005D
^
005E
_
005F
6_
96
`
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7_
112
p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D
~
007E
8_
128
9_
144
A_
160
NBSP
00A0
¡
00A1
¢
00A2
£
00A3
¤
00A4
¥
00A5
¦
00A6
§
00A7
¨
00A8
©
00A9
ª
00AA
«
00AB
¬
00AC
SHY
00AD
®
00AE
¯
00AF
B_
176
°
00B0
±
00B1
²
00B2
³
00B3
´
00B4
µ
00B5

00B6
·
00B7
¸
00B8
¹
00B9
º
00BA
»
00BB
¼
00BC
½
00BD
¾
00BE
¿
00BF
C_
192
À
00C0
Á
00C1
Â
00C2
Ã
00C3
Ä
00C4
Å
00C5
Æ
00C6
Ç
00C7
È
00C8
É
00C9
Ê
00CA
Ë
00CB
Ì
00CC
Í
00CD
Î
00CE
Ï
00CF
D_
208
Ğ
011E
Ñ
00D1
Ò
00D2
Ó
00D3
Ô
00D4
Õ
00D5
Ö
00D6
×
00D7
Ø
00D8
Ù
00D9
Ú
00DA
Û
00DB
Ü
00DC
İ
0130
Ş
015E
ß
00DF
E_
224
à
00E0
á
00E1
â
00E2
ã
00E3
ä
00E4
å
00E5
æ
00E6
ç
00E7
è
00E8
é
00E9
ê
00EA
ë
00EB
ì
00EC
í
00ED
î
00EE
ï
00EF
F_
240
ğ
011F
ñ
00F1
ò
00F2
ó
00F3
ô
00F4
õ
00F5
ö
00F6
÷
00F7
ø
00F8
ù
00F9
ú
00FA
û
00FB
ü
00FC
ı
0131
ş
015F
ÿ
00FF

  Letter  Number  Punctuation  Symbol  Other  Undefined   Differences from ISO-8859-1

See also

References

  1. Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
  2. van Kesteren, Anne. "Names and labels". Encoding Standard. WHATWG.
  3. "Historical trends in the usage of character encodings for websites". w3techs.com.
  4. "Frequently Asked Questions". w3techs.com.
  5. "Distribution of character encodings among websites that use Turkey". w3techs.com.
  6. "8.2.2.3. Character encodings". HTML 5.1 2nd Edition. W3C. User agents must support the encodings defined in the WHATWG Encoding standard, including, but not limited to […]
  7. "Code page 920 information document". Archived from the original on 2017-01-16.
  8. "CCSID 920 information document". Archived from the original on 2016-03-27.
  9. Standard ECMA-128: 8-Bit Single-Byte Coded Graphic Character Sets - Latin Alphabet No. 5 (2nd ed.). 1999. This Ecma publication is also approved as ISO 8859-9.
  10. Code Page CPGID 00920 (pdf) (PDF), IBM
  11. Code Page CPGID 00920 (txt), IBM
  12. International Components for Unicode (ICU), ibm-920_P100-1995.ucm, 2002-12-03
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.