Chinese Character Code for Information Interchange

CCCII / EACC
	The character 圓 (circle, Japanese yen, Chinese yuan) in four layers of CCCII.
Language(s)	Chinese, Japanese, Korean
Standard	MARC-8, ANSI/NISO Z39.64 (both EACC version)
Current status	Used mainly by library systems
Classification	TBCS for CJK based on the ISO 2022 structure, JACKPHY component of MARC

The Chinese Character Code for Information Interchange (Chinese: 中文資訊交換碼) or CCCII is a character set developed by the Chinese Character Analysis Group in Taiwan. It was first published in 1980, and significantly expanded in 1982 and 1987.[1]

It is used mostly by library systems.[2][3] It is one of the earliest established and most sophisticated encodings for Traditional Chinese (predating the establishment of Big5 in 1984 and CNS 11643 in 1986).[2] It is distinguished by its unique system for encoding simplified versions and other variants of its main set of hanzi characters.[1]

A variant of an earlier version of CCCII is used by the Library of Congress as part of MARC-8, under the name East Asian Character Code (EACC, ANSI/NISO Z39.64),[4] where it comprises part of MARC 21's JACKPHY support. However, EACC contains fewer characters than the most recent versions of CCCII.[5][1]

Design

Nine characters from CCCII or EACC implementations, which are encoded as variants of 劍 (double-edged sword). The ninth of these, 釖, is usually a variant of 刀.[6]

Byte ranges

CCCII is designed as an 94ⁿ set, as defined by ISO/IEC 2022.[1] Each Chinese character is represented by a 3-byte code in which each byte is 7-bit, between 0x21 and 0x7E inclusive. Thus, the maximum number of Chinese characters representable in CCCII is 94×94×94 = 830584. In practice the number of characters encodable by CCCII would be less than this number, because variant characters are encoded in related ISO 2022 planes under CCCII, so most of the code points would have to be reserved for variants.

In practice, however, bytes outside of these ranges are sometimes used. The code 0x212320 is used by some implementations as an ideographic space.[7] A CCCII specification used by libraries in Hong Kong uses codes starting with 0x2120 for punctuation and symbols.[8] The first byte 0x7F is used by some variants to encode codes for some otherwise unavailable Unified Repertoire and Ordering or CJK Unified Ideographs Extension A hanzi (e.g. 0x7F3449 for U+3449 or 0x7F796E for U+796E;[8] notice how the continuation bytes match the UCS-2BE code), and this may include bytes outside of the 0x21–0x7E or even 0x20–0x7F range, e.g. 0x7F551C for U+551C,[9] 0x7F5AA4 for U+5AA4[9] or 0x7F8EDA for U+8EDA.[8]

Interaction with ISO 2022

CCCII/EACC is not registered in the International Registry of Coded Character Sets to be Used with Escape Sequences,[10] and as such, does not have a standard designation escape for use with ISO 2022. MARC-8 assigns EACC the private-use F-byte 0x31 (1) in its implementation of ANSI X3.41 (ISO 2022).[11]

Layers and variant characters

The 94 ISO 2022 planes are grouped into 16 layers of 6 planes each (except for layer 16, which contains the four planes 91–94).[1] Layer 1 contains both non-hanzi and hanzi characters, with the non-hanzi and most frequently used hanzi being placed in plane 1, and with the remaining five planes consisting of less common hanzi.[1] Layer 2 contains Simplified Chinese characters, with their row and cell numbers being the same as their Traditional Chinese equivalents in layer 1. Layers 3 through 12 contain further variant forms, at row and cell numbers homologous to the first two layers.[12]

The last four layers are used for other purposes. Specifically, layer 13 contains additional characters for Japanese language support (kana and Japanese kokuji), and layer 14 contains additional characters for Korean language support (hangul).[12] Layer 15 is unused (reserved), while layer 16 is used for other characters.[1]

This distinctive design has been criticised by Christian Wittern of the International Research Institute for Zen Buddhism at Hanazono University, who asserts that the relationship of character variants "is very complex and can not be expressed in a fixed, one-dimensional, hard-wired codetable".[3] Ken Lunde describes it as "one of the most well thought-out character set standards from Taiwan", describing its structure as "to be truly admired", but concluding that OpenType variant form substitution can provide the same level of functionality.[1]

CCCII defines roughly 53940 code points as of its 1987 edition, although a more recent draft from 1989 extends this to 75684 code points (comprising 44167 unique characters and 31517 variants). EACC, the variant used by the Library of Congress, includes only a smaller set of 15686 characters.[1]

Adoption

As of 1995, CCCII or EACC was used mostly in libraries in the United States, Hong Kong and Taiwan. Although CCCII promised pan-CJK coverage, its support was limited to specialised hardware; difficulty ascertaining when the root versus variant character should be used, exacerbated by a lack of firmly established reference glyphs, further limited its adoption, resulting in Big5 being more commonly used for Chinese in those territories outside of library use (since Unicode had yet to become widely adopted at the time).[3]

As of 2009, EACC is still in extensive use for specialized bibliographic purposes.[1] It was also an important precursor to Unicode.[1] Unicode hanzi characters are referenced to their corresponding CCCII and EACC codes in the Unihan database, in the keys kCCCII and kEACC.[4] Mapping tables for hanzi, hangul, kana and punctuation between EACC and Unicode are available from the Library of Congress.[13]

Punctuation, symbol, kana and jamo charts

Following are charts for punctuation, symbols, kana and Hangul jamo, showing the characters and giving possible Unicode mappings. Where possible, these are referenced against published mapping data.

Unicode mappings for Hangul syllables are omitted for brevity, but may also be obtained from the Library of Congress.[14] CCCII hanzi number in the tens of thousands[1][3] and are not shown below (except where they are also included in the non-hanzi range, as radicals or numerals), but mappings to Unicode are available from the Unihan database[4] and from elsewhere.[9][8]

Character set 0x2120 (plane 1, row 0: Hong Kong punctuation)

Although CCCII is usually a 94ⁿ set,[1] and therefore does not usually use codes starting with 0x2120,[9] the following layout is used by a variant used by libraries in Hong Kong:[8]

CCCII (Hong Kong)[8] (prefixed with 0x2120)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		､ FF64 1-0-1	｡ FF61 1-0-2	1-0-3	･ FF65 1-0-4	ﾞ FF9E 1-0-5	ﾟ FF9F 1-0-6	´ 00B4 1-0-7	｀ FF40 1-0-8	¨ 00A8 1-0-9	1-0-10	￣ FFE3 1-0-11	ヽ 30FD 1-0-12	ヾ 30FE 1-0-13	ゝ 309D 1-0-14	ゞ 309E 1-0-15
3_	〃 3003 1-0-16	1-0-17	1-0-18	〆 3006 1-0-19	1-0-20	1-0-21	1-0-22	1-0-23	1-0-24	‖ 2016 1-0-25	1-0-26	1-0-27	1-0-28	… 2026 1-0-29	‥ 2025 1-0-30	1-0-31
4_	“ 201C 1-0-32	〔 3014 1-0-33	〕 3015 1-0-34	1-0-35	1-0-36	1-0-37	1-0-38	｢ FF62 1-0-39	｣ FF63 1-0-40	『 300E 1-0-41	』 300F 1-0-42	【 3010 1-0-43	】 3011 1-0-44	± 00B1 1-0-45	× 00D7 1-0-46	÷ 00F7 1-0-47
5_	≠ 2260 1-0-48	≦ 2266 1-0-49	≧ 2267 1-0-50	∞ 221E 1-0-51	∴ 2234 1-0-52	♂ 2642 1-0-53	♀ 2640 1-0-54	° 00B0 1-0-55	℃ 2103 1-0-56	¢ 00A2 1-0-57	£ 00A3 1-0-58	§ 00A7 1-0-59	☆ 2606 1-0-60	★ 2605 1-0-61	○ 25CB 1-0-62	● 25CF 1-0-63
6_	′ 2032 1-0-64	″ 2033 1-0-65	1-0-66	1-0-67	1-0-68	1-0-69	1-0-70	1-0-71	1-0-72	1-0-73	1-0-74	1-0-75	1-0-76	1-0-77	1-0-78	◎ 25CE 1-0-79
7_	◇ 25C7 1-0-80	◆ 25C6 1-0-81	□ 25A1 1-0-82	■ 25A0 1-0-83	△ 25B3 1-0-84	▲ 25B2 1-0-85	▽ 25BD 1-0-86	▼ 25BC 1-0-87	※ 203B 1-0-88	〒 3012 1-0-89	→ 2192 1-0-90	← 2190 1-0-91	↑ 2191 1-0-92	↓ 2193 1-0-93	1-0-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x2121 (plane 1, row 1: reserved for controls)

No characters are assigned in plane 1 row 1, which is reserved for control codes.[1]

Character set 0x2122 (plane 1, row 2: mathematical operators)

This row contains mathematical operators. EACC leaves this row empty.[13] The following table is referenced against sources from Taiwan.[2][9]

CCCII (Taiwan)[9][15] (prefixed with 0x2122)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		∞ 221E 1-2-1	＋ FF0B 1-2-2	− 2212 1-2-3	± 00B1 1-2-4	× 00D7 1-2-5	⋅ 22C5 1-2-6	÷ 00F7 1-2-7	∕ 2215 1-2-8	＝ FF1D 1-2-9	≠ 2260 1-2-10	≡ 2261 1-2-11	≈ 2248 1-2-12	∼ 223C 1-2-13	∝ 221D 1-2-14	＜ FF1C 1-2-15
3_	＞ FF1E 1-2-16	≮ 226E 1-2-17	≯ 226F 1-2-18	≤ 2264 1-2-19	≥ 2265 1-2-20	≪ 226A 1-2-21	≫ 226B 1-2-22	∂ 2202 1-2-23	∫ 222B 1-2-24	Δ 0394 1-2-25	∆ 2206 1-2-26	∇ 2207 1-2-27	▫ 25AB 1-2-28	∠ 2220 1-2-29	⊤ 22A4 1-2-30	∥ 2225 1-2-31
4_	≅ 2245 1-2-32	≞ 225E 1-2-33	∴ 2234 1-2-34	∃ 2203 1-2-35	∀ 2200 1-2-36	∪ 222A 1-2-37	∩ 2229 1-2-38	⊂ 2282 1-2-39	⊃ 2283 1-2-40	⇒ 21D2 1-2-41	⇔ 21D4 1-2-42	∋ 220B 1-2-43	∈ 2208 1-2-44	∉ 2209 1-2-45	∑ 2211 1-2-46	㏒ 33D2 1-2-47
5_	㏑ 33D1 1-2-48	ℯ 212F 1-2-49	π 03C0 1-2-50	√ 221A 1-2-51	︕ FE15 1-2-52	⎸ 23B8 1-2-53	⎹ 23B9 1-2-54	〈 3008 1-2-55	〉 3009 1-2-56	1-2-57	1-2-58	1-2-59	1-2-60	1-2-61	1-2-62	1-2-63
6_	1-2-64	1-2-65	1-2-66	1-2-67	1-2-68	1-2-69	1-2-70	1-2-71	1-2-72	1-2-73	1-2-74	1-2-75	1-2-76	1-2-77	1-2-78	1-2-79
7_	1-2-80	1-2-81	1-2-82	1-2-83	1-2-84	1-2-85	1-2-86	1-2-87	1-2-88	1-2-89	1-2-90	1-2-91	1-2-92	1-2-93	1-2-94

Letter Number Punctuation Symbol Other Undefined

The following table is referenced against CCCII data provided by the Hong Kong Innovative Users Group, a group of libraries in Hong Kong, and hosted by the University of Hong Kong.[16][8] It uses an entirely different layout in this row:

CCCII (Hong Kong)[8] (prefixed with 0x2122)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_	∈ 2208	∋ 220B 1-2-1	⊆ 2286 1-2-2	⊇ 2287 1-2-3	⊂ 2282 1-2-4	⊃ 2283 1-2-5	∪ 222A 1-2-6	∩ 2229 1-2-7	∧ 2227 1-2-8	∨ 2228 1-2-9	¬ 00AC 1-2-10	⇒ 21D2 1-2-11	⇔ 21D4 1-2-12	∀ 2200 1-2-13	∃ 2203 1-2-14	1-2-15
3_	∠ 2220 1-2-16	⊥ 22A5 1-2-17	⌒ 2312 1-2-18	∂ 2202 1-2-19	∇ 2207 1-2-20	≡ 2261 1-2-21	≒ 2252 1-2-22	≪ 226A 1-2-23	≫ 226B 1-2-24	√ 221A 1-2-25	∽ 223D 1-2-26	∝ 221D 1-2-27	∵ 2235 1-2-28	∫ 222B 1-2-29	∬ 222C 1-2-30	1-2-31
4_	Å 212B 1-2-32	‰ 2030 1-2-33	♯ 266F 1-2-34	♭ 266D 1-2-35	♪ 266A 1-2-36	† 2020 1-2-37	‡ 2021 1-2-38	¶ 00B6 1-2-39	◯ 25EF 1-2-40	1-2-41	1-2-42	1-2-43	1-2-44	1-2-45	1-2-46	1-2-47
5_	─ 2500 1-2-48	│ 2502 1-2-49	┌ 250C 1-2-50	┐ 2510 1-2-51	┘ 2518 1-2-52	└ 2514 1-2-53	├ 251C 1-2-54	┬ 252C 1-2-55	┤ 2524 1-2-56	┴ 2534 1-2-57	┼ 253C 1-2-58	━ 2501 1-2-59	┃ 2503 1-2-60	┏ 250F 1-2-61	┓ 2513 1-2-62	┛ 251B 1-2-63
6_	┗ 2517 1-2-64	┣ 2523 1-2-65	┳ 2533 1-2-66	┫ 252B 1-2-67	┻ 253B 1-2-68	╋ 254B 1-2-69	┠ 2520 1-2-70	┯ 252F 1-2-71	┨ 2528 1-2-72	┷ 2537 1-2-73	┿ 253F 1-2-74	┝ 251D 1-2-75	┰ 2530 1-2-76	┥ 2525 1-2-77	┸ 2538 1-2-78	╂ 2542 1-2-79
7_	1-2-80	1-2-81	1-2-82	1-2-83	1-2-84	1-2-85	1-2-86	1-2-87	1-2-88	1-2-89	1-2-90	1-2-91	1-2-92	1-2-93	1-2-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x2123 (plane 1, row 3: Roman and punctuation)

This row includes punctuation, western Arabic numerals and Roman letters.[9] Compare row 3 of Wansung code and of GB 2312.

Different variants variously encode the ideographic space (U+3000) at 0x212320 (which the MARC specification acknowledges),[7][8] 0x212321 (which is listed in the ANSI standard, and is also acknowledged by MARC),[7][8] or 0x21635F.[9] EACC includes only the hyphen-minus, parentheses and ideographic space in this set.[7]

CCCII/EACC[13][9][15] (prefixed with 0x2123)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_	IDSP 3000 [lower-alpha 1]	!/IDSP 0021/3000 1-3-1[lower-alpha 2]	" 0022 1-3-2	# 0023 1-3-3	$ 0024 1-3-4	% 0025 1-3-5	& 0026 1-3-6	' 0027 1-3-7	(/（ 0028/FF08 1-3-8	)/） 0029/FF09 1-3-9	* 002A 1-3-10	+ 002B 1-3-11	, 002C 1-3-12	-/－ 002D/FF0D 1-3-13	. 002E 1-3-14	/ 002F 1-3-15
3_/B_	0 0030 1-3-16	1 0031 1-3-17	2 0032 1-3-18	3 0033 1-3-19	4 0034 1-3-20	5 0035 1-3-21	6 0036 1-3-22	7 0037 1-3-23	8 0038 1-3-24	9 0039 1-3-25	: 003A 1-3-26	; 003B 1-3-27	< 003C 1-3-28	= 003D 1-3-29	> 003E 1-3-30	? 003F 1-3-31
4_/C_	@ 0040 1-3-32	A 0041 1-3-33	B 0042 1-3-34	C 0043 1-3-35	D 0044 1-3-36	E 0045 1-3-37	F 0046 1-3-38	G 0047 1-3-39	H 0048 1-3-40	I 0049 1-3-41	J 004A 1-3-42	K 004B 1-3-43	L 004C 1-3-44	M 004D 1-3-45	N 004E 1-3-46	O 004F 1-3-47
5_/D_	P 0050 1-3-48	Q 0051 1-3-49	R 0052 1-3-50	S 0053 1-3-51	T 0054 1-3-52	U 0055 1-3-53	V 0056 1-3-54	W 0057 1-3-55	X 0058 1-3-56	Y 0059 1-3-57	Z 005A 1-3-58	[ 005B 1-3-59	\ 005C 1-3-60	] 005D 1-3-61	↑ 2191 1-3-62	_ 005F 1-3-63
6_/E_	`/' 0060/0027 1-3-64	a 0061 1-3-65	b 0062 1-3-66	c 0063 1-3-67	d 0064 1-3-68	e 0065 1-3-69	f 0066 1-3-70	g 0067 1-3-71	h 0068 1-3-72	i 0069 1-3-73	j 006A 1-3-74	k 006B 1-3-75	l 006C 1-3-76	m 006D 1-3-77	n 006E 1-3-78	o 006F 1-3-79
7_/F_	p 0070 1-3-80	q 0071 1-3-81	r 0072 1-3-82	s 0073 1-3-83	t 0074 1-3-84	u 0075 1-3-85	v 0076 1-3-86	w 0077 1-3-87	x 0078 1-3-88	y 0079 1-3-89	z 007A 1-3-90	{ 007B 1-3-91	\| 007C 1-3-92	} 007D 1-3-93	~ 007E 1-3-94

Letter Number Punctuation Symbol Other Undefined Included in EACC.

Character set 0x212A (plane 1, row 10: internal IME characters and geta mark)

In EACC, this row includes several Private Use Area mapped characters used internally to represent character components by the RLIN input method,[17] which is used by the Library of Congress for non-Roman cataloging.[18] These component characters should only be used internally by an IME and, if encountered elsewhere, may be replaced with the geta mark (U+3013),[17] which this row also includes at 0x212A46. This row is unassigned in CCCII,[1] but the geta mark is also listed at that location in some mappings for CCCII.[9]

EACC[13] (prefixed with 0x212A)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		� E8D0 1-10-1	� E8D1 1-10-2	� E8D2 1-10-3	� E8D3 1-10-4	� E8D4 1-10-5	� E8D5 1-10-6	1-10-7	� E8D6 1-10-8	� E8D7 1-10-9	� E8D8 1-10-10	� E8D9 1-10-11	� E8DA 1-10-12	� E8DB 1-10-13	� E8DC 1-10-14	� E8DD 1-10-15
3_	� E8DE 1-10-16	1-10-17	� E8DF 1-10-18	� E8E0 1-10-19	� E8E1 1-10-20	� E8E2 1-10-21	� E8E3 1-10-22	� E8E4 1-10-23	� E8E5 1-10-24	� E8E6 1-10-25	� E8E7 1-10-26	� E8E8 1-10-27	� E8E9 1-10-28	� E8EA 1-10-29	� E8EB 1-10-30	� E8EC 1-10-31
4_	� E8ED 1-10-32	� E8EE 1-10-33	� E8EF 1-10-34	� E8F0 1-10-35	� E8F1 1-10-36	� E8F2 1-10-37	〓 3013 1-10-38	1-10-39	1-10-40	1-10-41	1-10-42	1-10-43	1-10-44	1-10-45	1-10-46	1-10-47
5_	1-10-48	1-10-49	1-10-50	1-10-51	1-10-52	1-10-53	1-10-54	1-10-55	1-10-56	1-10-57	1-10-58	1-10-59	1-10-60	1-10-61	1-10-62	1-10-63
6_	1-10-64	1-10-65	1-10-66	1-10-67	1-10-68	1-10-69	1-10-70	1-10-71	1-10-72	1-10-73	1-10-74	1-10-75	1-10-76	1-10-77	1-10-78	1-10-79
7_	1-10-80	1-10-81	1-10-82	1-10-83	1-10-84	1-10-85	1-10-86	1-10-87	1-10-88	1-10-89	1-10-90	1-10-91	1-10-92	1-10-93	1-10-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x212B (plane 1, row 11: punctuation)

This row contains various punctuation marks used in Chinese,[1][7] in addition to other symbols. CCCII includes a set of 35 punctuation marks in this row.[1] EACC includes only 13 characters in this row (shown boxed below).[7]

CCCII/EACC[13][9][15] (prefixed with 0x212B)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		︵ FE35 1-11-1	︶ FE36 1-11-2	﹁ FE41 1-11-3	﹂ FE42 1-11-4	「 300C 1-11-5	」 300D 1-11-6	︳ FE33 1-11-7	＿ FF3F 1-11-8	﹃ FE43 1-11-9	﹄ FE44 1-11-10	『 300E 1-11-11	』 300F 1-11-12	︴ FE34 1-11-13	﹏ FE4F 1-11-14	︹ FE39 1-11-15
3_	︺ FE3A 1-11-16	〔/［ 3014/FF3B 1-11-17	〕/］ 3015/FF3D 1-11-18	。 3002 1-11-19	・/． 30FB/FF0E 1-11-20	、 3001 1-11-21	⋮ 22EE 1-11-22	⋯ 22EF 1-11-23	， FF0C 1-11-24	； FF1B 1-11-25	： FF1A 1-11-26	？ FF1F 1-11-27	︱ FE31 1-11-28	！ FF01 1-11-29	︲ FE32 1-11-30	︱ FE31 1-11-31
4_	‘ 2018 1-11-32	’ 2019 1-11-33	“ 201C 1-11-34	” 201D 1-11-35	1-11-36	1-11-37	1-11-38	1-11-39	1-11-40	1-11-41	《 300A 1-11-42	》 300B 1-11-43	【 3010 1-11-44	】 3011 1-11-45	〖 3016 1-11-46	〗 3017 1-11-47
5_	1-11-48	＄ FF04 1-11-49	¢ 00A2 1-11-50	₡ 20A1 1-11-51	£ 00A3 1-11-52	¥ 00A5 1-11-53	₨ 20A8 1-11-54	d. 1-11-55	s. 1-11-56	／ FF0F 1-11-57	＃ FF03 1-11-58	％ FF05 1-11-59	⅌ 214C 1-11-60	＠ FF20 1-11-61	¶ 00B6 1-11-62	® 00AE 1-11-63
6_	© 00A9 1-11-64	℅ 2105 1-11-65	＆ FF06 1-11-66	§ 00A7 1-11-67	† 2020 1-11-68	‡ 2021 1-11-69	＊ FF0A 1-11-70	1-11-71	1-11-72	1-11-73	1-11-74	1-11-75	1-11-76	1-11-77	1-11-78	1-11-79
7_	ヽ 30FD 1-11-80	ヾ 30FE 1-11-81	ゝ 309D 1-11-82	ゞ 309E 1-11-83	α 03B1 1-11-84	〒 3012 1-11-85	1-11-86	1-11-87	1-11-88	1-11-89	1-11-90	1-11-91	1-11-92	1-11-93	1-11-94

Letter Number Punctuation Symbol Other Undefined Included in EACC.

Character sets 0x212C–0x212E (plane 1, rows 12–14: radicals and ordinals)

These rows contain Chinese radicals,[1] Roman numerals,[9] celestial stems and terrestrial branches.[15]

CCCII[15] (prefixed with 0x212C)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		1-12-1	⼀ 2F00 1-12-2	⼁ 2F01 1-12-3	⼂ 2F02 1-12-4	⼃ 2F03 1-12-5	⼄ 2F04 1-12-6	⼅ 2F05 1-12-7	1-12-8	⼆ 2F06 1-12-9	⼇ 2F07 1-12-10	⼈ 2F08 1-12-11	⼉ 2F09 1-12-12	⼊ 2F0A 1-12-13	⼋ 2F0B 1-12-14	⼌ 2F0C 1-12-15
3_	⼍ 2F0D 1-12-16	⼎ 2F0E 1-12-17	⼏ 2F0F 1-12-18	⼐ 2F10 1-12-19	⼑ 2F11 1-12-20	⼒ 2F12 1-12-21	⼓ 2F13 1-12-22	⼔ 2F14 1-12-23	⼕ 2F15 1-12-24	⼖ 2F16 1-12-25	⼗ 2F17 1-12-26	⼘ 2F18 1-12-27	⼙ 2F19 1-12-28	⼚ 2F1A 1-12-29	⼛ 2F1B 1-12-30	⼜ 2F1C 1-12-31
4_	1-12-32	⼝ 2F1D 1-12-33	⼞ 2F1E 1-12-34	⼟ 2F1F 1-12-35	⼠ 2F20 1-12-36	⼡ 2F21 1-12-37	⼢ 2F22 1-12-38	⼣ 2F23 1-12-39	⼤ 2F24 1-12-40	⼥ 2F25 1-12-41	⼦ 2F26 1-12-42	⼧ 2F27 1-12-43	⼨ 2F28 1-12-44	⼩ 2F29 1-12-45	⼪ 2F2A 1-12-46	⼫ 2F2B 1-12-47
5_	⼬ 2F2C 1-12-48	⼭ 2F2D 1-12-49	⼮ 2F2E 1-12-50	⼯ 2F2F 1-12-51	⼰ 2F30 1-12-52	⼱ 2F31 1-12-53	⼲ 2F32 1-12-54	⼳ 2F33 1-12-55	⼴ 2F34 1-12-56	⼵ 2F35 1-12-57	⼶ 2F36 1-12-58	⼷ 2F37 1-12-59	⼸ 2F38 1-12-60	⼹ 2F39 1-12-61	⼺ 2F3A 1-12-62	⼻ 2F3B 1-12-63
6_	1-12-64	⼼ 2F3C 1-12-65	⼽ 2F3D 1-12-66	⼾ 2F3E 1-12-67	⼿ 2F3F 1-12-68	⽀ 2F40 1-12-69	⽁ 2F41 1-12-70	⽂ 2F42 1-12-71	⽃ 2F43 1-12-72	⽄ 2F44 1-12-73	⽅ 2F45 1-12-74	⽆ 2F46 1-12-75	⽇ 2F47 1-12-76	⽈ 2F48 1-12-77	⽉ 2F49 1-12-78	⽊ 2F4A 1-12-79
7_	⽋ 2F4B 1-12-80	⽌ 2F4C 1-12-81	⽍ 2F4D 1-12-82	⽎ 2F4E 1-12-83	⽏ 2F4F 1-12-84	⽐ 2F50 1-12-85	⽑ 2F51 1-12-86	⽒ 2F52 1-12-87	⽓ 2F53 1-12-88	⽔ 2F54 1-12-89	⽕ 2F55 1-12-90	⽖ 2F56 1-12-91	⽗ 2F57 1-12-92	⽘ 2F58 1-12-93	⽙ 2F59 1-12-94

CCCII[8][15] (prefixed with 0x212D)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		⽚ 2F5A 1-13-1	⽛ 2F5B 1-13-2	⽜ 2F5C 1-13-3	⽝ 2F5D 1-13-4	1-13-5	⽞ 2F5E 1-13-6	⽟ 2F5F 1-13-7	⽠ 2F60 1-13-8	⽡ 2F61 1-13-9	⽢ 2F62 1-13-10	⽣ 2F63 1-13-11	⽤ 2F64 1-13-12	⽥ 2F65 1-13-13	⽦ 2F66 1-13-14	⽧ 2F67 1-13-15
3_	⽨ 2F68 1-13-16	⽩ 2F69 1-13-17	⽪ 2F6A 1-13-18	⽫ 2F6B 1-13-19	⽬ 2F6C 1-13-20	⽭ 2F6D 1-13-21	⽮ 2F6E 1-13-22	⽯ 2F6F 1-13-23	⽰ 2F70 1-13-24	⽱ 2F71 1-13-25	⽲ 2F72 1-13-26	⽳ 2F73 1-13-27	⽴ 2F74 1-13-28	1-13-29	⽵ 2F75 1-13-30	⽶ 2F76 1-13-31
4_	⽷ 2F77 1-13-32	⽸ 2F78 1-13-33	⽹ 2F79 1-13-34	⽺ 2F7A 1-13-35	⽻ 2F7B 1-13-36	⽼ 2F7C 1-13-37	⽽ 2F7D 1-13-38	⽾ 2F7E 1-13-39	⽿ 2F7F 1-13-40	⾀ 2F80 1-13-41	⾁ 2F81 1-13-42	⾂ 2F82 1-13-43	⾃ 2F83 1-13-44	⾄ 2F84 1-13-45	⾅ 2F85 1-13-46	⾆ 2F86 1-13-47
5_	⾇ 2F87 1-13-48	⾈ 2F88 1-13-49	⾉ 2F89 1-13-50	⾊ 2F8A 1-13-51	⾋ 2F8B 1-13-52	⾌ 2F8C 1-13-53	⾍ 2F8D 1-13-54	⾎ 2F8E 1-13-55	⾏ 2F8F 1-13-56	⾐ 2F90 1-13-57	⾑ 2F91 1-13-58	1-13-59	⾒ 2F92 1-13-60	⾓ 2F93 1-13-61	⾔/訁 2F94/8A01 1-13-62	⾕ 2F95 1-13-63
6_	⾖ 2F96 1-13-64	⾗ 2F97 1-13-65	⾘ 2F98 1-13-66	⾙ 2F99 1-13-67	⾚ 2F9A 1-13-68	⾛ 2F9B 1-13-69	⾜ 2F9C 1-13-70	⾝ 2F9D 1-13-71	⾞ 2F9E 1-13-72	⾟ 2F9F 1-13-73	⾠ 2FA0 1-13-74	⾡ 2FA1 1-13-75	⾢ 2FA2 1-13-76	⾣ 2FA3 1-13-77	⾤ 2FA4 1-13-78	⾥ 2FA5 1-13-79
7_	1-13-80	⾦/釒 2FA6/91D2 1-13-81	⾧ 2FA7 1-13-82	⾨ 2FA8 1-13-83	⾩ 2FA9 1-13-84	⾪ 2FAA 1-13-85	⾫ 2FAB 1-13-86	⾬ 2FAC 1-13-87	⾭ 2FAD 1-13-88	⾮ 2FAE 1-13-89	1-13-90	⾯ 2FAF 1-13-91	⾰ 2FB0 1-13-92	⾱ 2FB1 1-13-93	⾲ 2FB2 1-13-94

CCCII[9][8][15] (prefixed with 0x212E)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		⾳ 2FB3 1-14-1	⾴ 2FB4 1-14-2	⾵ 2FB5 1-14-3	⾶ 2FB6 1-14-4	⾷/飠 2FB7/98E0 1-14-5	⾸ 2FB8 1-14-6	⾹ 2FB9 1-14-7	1-14-8	⾺ 2FBA 1-14-9	⾻ 2FBB 1-14-10	⾼ 2FBC 1-14-11	⾽ 2FBD 1-14-12	⾾ 2FBE 1-14-13	⾿ 2FBF 1-14-14	⿀ 2FC0 1-14-15
3_	⿁ 2FC1 1-14-16	1-14-17	⿂ 2FC2 1-14-18	⿃ 2FC3 1-14-19	⿄ 2FC4 1-14-20	⿅ 2FC5 1-14-21	⿆ 2FC6 1-14-22	⿇ 2FC7 1-14-23	1-14-24	⿈ 2FC8 1-14-25	⿉ 2FC9 1-14-26	⿊ 2FCA 1-14-27	⿋ 2FCB 1-14-28	1-14-29	⿌ 2FCC 1-14-30	⿍ 2FCD 1-14-31
4_	⿎ 2FCE 1-14-32	⿏ 2FCF 1-14-33	1-14-34	⿐ 2FD0 1-14-35	⿑ 2FD1 1-14-36	1-14-37	⿒ 2FD2 1-14-38	1-14-39	⿓ 2FD3 1-14-40	⿔ 2FD4 1-14-41	1-14-42	⿕ 2FD5 1-14-43	1-14-44	1-14-45	1-14-46	1-14-47
5_	甲 7532 1-14-48	乙 4E59 1-14-49	丙 4E19 1-14-50	丁 4E01 1-14-51	戊 620A 1-14-52	己 5DF1 1-14-53	庚 5E9A 1-14-54	辛 8F9B 1-14-55	壬 58EC 1-14-56	癸 7678 1-14-57	1-14-58	1-14-59	1-14-60	1-14-61	1-14-62	1-14-63
6_	子 5B50 1-14-64	丑 4E11 1-14-65	寅 5BC5 1-14-66	卯 536F 1-14-67	辰 8FB0 1-14-68	巳 5DF3 1-14-69	午 5348 1-14-70	未 672A 1-14-71	申 7533 1-14-72	酉 9149 1-14-73	戌 620C 1-14-74	亥 4EA5 1-14-75	1-14-76	1-14-77	1-14-78	1-14-79
7_	1-14-80	Ⅰ 2160 1-14-81	Ⅱ 2161 1-14-82	Ⅲ 2162 1-14-83	Ⅳ 2163 1-14-84	Ⅴ 2164 1-14-85	Ⅵ 2165 1-14-86	Ⅶ 2166 1-14-87	Ⅷ 2167 1-14-88	Ⅸ 2168 1-14-89	Ⅹ 2169 1-14-90	Ⅺ 216A 1-14-91	Ⅻ 216B 1-14-92	1-14-93	1-14-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x212F (plane 1, row 15: Chinese numerals and bopomofo)

This row includes Chinese numerals and bopomofo characters.[1] EACC includes only the ideographic zero (〇).[7]

CCCII/EACC[13][15][9] (prefixed with 0x212F)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		〡 3021 1-15-1	〢 3022 1-15-2	〣 3023 1-15-3	〤 3024 1-15-4	〥 3025 1-15-5	〦 3026 1-15-6	〧 3027 1-15-7	〨 3028 1-15-8	〩 3029 1-15-9	〸 3038 1-15-10	〹 3039 1-15-11	〺 303A 1-15-12	1-15-13	1-15-14	1-15-15
3_	〇 3007 1-15-16	一 4E00 1-15-17	二 4E8C 1-15-18	三 4E09 1-15-19	四 56DB 1-15-20	五 4E94 1-15-21	六 516D 1-15-22	七 4E03 1-15-23	八 516B 1-15-24	九 4E5D 1-15-25	十 5341 1-15-26	百 767E 1-15-27	千 5343 1-15-28	万 4E07 1-15-29	1-15-30	1-15-31
4_	零 96F6 1-15-32	壹 58F9 1-15-33	貳 8CB3 1-15-34	參 53C3 1-15-35	肆 8086 1-15-36	伍 4F0D 1-15-37	陸 9678 1-15-38	柒 67D2 1-15-39	捌 634C 1-15-40	玖 7396 1-15-41	拾 62FE 1-15-42	佰 4F70 1-15-43	仟 4EDF 1-15-44	萬 842C 1-15-45	億 5104 1-15-46	1-15-47
5_	ˊ 02CA 1-15-48	ˇ 02C7 1-15-49	ˋ 02CB 1-15-50	˙/﹒[lower-alpha 3] 02D9/FE52 1-15-51	ㄅ 3105 1-15-52	ㄆ 3106 1-15-53	ㄇ 3107 1-15-54	ㄈ 3108 1-15-55	ㄉ 3109 1-15-56	ㄊ 310A 1-15-57	ㄋ 310B 1-15-58	ㄌ 310C 1-15-59	ㄍ 310D 1-15-60	ㄎ 310E 1-15-61	ㄏ 310F 1-15-62	ㄐ 3110 1-15-63
6_	ㄑ 3111 1-15-64	ㄒ 3112 1-15-65	ㄓ 3113 1-15-66	ㄔ 3114 1-15-67	ㄕ 3115 1-15-68	ㄖ 3116 1-15-69	ㄗ 3117 1-15-70	ㄘ 3118 1-15-71	ㄙ 3119 1-15-72	ㄚ 311A 1-15-73	ㄛ 311B 1-15-74	ㄜ 311C 1-15-75	ㄝ 311D 1-15-76	ㄞ 311E 1-15-77	ㄟ 311F 1-15-78	ㄠ 3120 1-15-79
7_	ㄡ 3121 1-15-80	ㄢ 3122 1-15-81	ㄣ 3123 1-15-82	ㄤ 3124 1-15-83	ㄥ 3125 1-15-84	ㄦ 3126 1-15-85	ㄧ 3127 1-15-86	ㄨ 3128 1-15-87	ㄩ 3129 1-15-88	1-15-89	ü 00FC 1-15-90	1-15-91	1-15-92	1-15-93	1-15-94

Letter Number Punctuation Symbol Other Undefined Included in EACC.

Character set 0x272B (plane 7, row 11: reference mark)

This row contains the reference mark (kome jirushi).[9]

CCCII[9] (prefixed with 0x272B)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
6_	7-11-64	7-11-65	7-11-66	7-11-67	7-11-68	7-11-69	7-11-70	7-11-71	7-11-72	7-11-73	7-11-74	7-11-75	7-11-76	7-11-77	7-11-78	※ 203B 7-11-79

Character set 0x272E–0x272F (plane 7, rows 14–15: alternative bopomofo)

A variant used by libraries in Hong Kong does not include bopomofo characters in plane 1 row 15, but includes them in a different layout in plane 7.[8]

CCCII (Hong Kong)[8] (prefixed with 0x272E)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
7_	7-14-80	7-14-81	7-14-82	7-14-83	7-14-84	ㄅ 3105 7-14-85	ㄆ 3106 7-14-86	ㄇ 3107 7-14-87	ㄈ 3108 7-14-88	ㄉ 3109 7-14-89	ㄊ 310A 7-14-90	ㄋ 310B 7-14-91	ㄌ 310C 7-14-92	ㄍ 310D 7-14-93	ㄎ 310E 7-14-94

CCCII (Hong Kong) (prefixed with 0x272F)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		ㄏ 310F 7-15-1	ㄐ 3110 7-15-2	ㄑ 3111 7-15-3	ㄒ 3112 7-15-4	ㄓ 3113 7-15-5	ㄔ 3114 7-15-6	ㄕ 3115 7-15-7	ㄖ 3116 7-15-8	ㄗ 3117 7-15-9	ㄘ 3118 7-15-10	ㄙ 3119 7-15-11	ㄚ 311A 7-15-12	ㄛ 311B 7-15-13	ㄜ 311C 7-15-14	ㄝ 311D 7-15-15
3_	ㄞ 311E 7-15-16	ㄟ 311F 7-15-17	ㄠ 3120 7-15-18	ㄡ 3121 7-15-19	ㄢ 3122 7-15-20	ㄣ 3123 7-15-21	ㄤ 3124 7-15-22	ㄥ 3125 7-15-23	ㄦ 3126 7-15-24	ㄧ 3127 7-15-25	ㄨ 3128 7-15-26	ㄩ 3129 7-15-27	7-15-28	7-15-29	7-15-30	7-15-31

Letter Number Punctuation Symbol Other Undefined

Character set 0x6921 (plane 73, row 1: Japanese punctuation)

This row is in plane 73, the first plane of layer 13, which contains characters included for Japanese language support.[12] It contains punctuation.[7] Compare row 1 of JIS X 0208, which this row tends to follow the layout of for the characters it includes.

CCCII/EACC[13][9][8] (prefixed with 0x6921)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		73-1-1	73-1-2	73-1-3	73-1-4	73-1-5	・ 30FB 73-1-6	73-1-7	73-1-8	73-1-9	73-1-10	73-1-11	73-1-12	73-1-13	73-1-14	73-1-15
3_	73-1-16	73-1-17	73-1-18	73-1-19	73-1-20	73-1-21	73-1-22	73-1-23	73-1-24	々 3005 73-1-25	〆 3006 73-1-26	73-1-27	ー 30FC 73-1-28	73-1-29	73-1-30	73-1-31
4_	73-1-32	73-1-33	73-1-34	73-1-35	73-1-36	73-1-37	73-1-38	73-1-39	73-1-40	73-1-41	73-1-42	73-1-43	73-1-44	73-1-45	73-1-46	73-1-47
5_	73-1-48	73-1-49	〈 3008 73-1-50	〉 3009 73-1-51	《 300A 73-1-52	》 300B 73-1-53	73-1-54	73-1-55	73-1-56	73-1-57	73-1-58	73-1-59	73-1-60	73-1-61	73-1-62	73-1-63
6_	73-1-64	73-1-65	73-1-66	73-1-67	73-1-68	73-1-69	73-1-70	73-1-71	73-1-72	73-1-73	73-1-74	73-1-75	73-1-76	73-1-77	73-1-78	73-1-79
7_	73-1-80	73-1-81	73-1-82	73-1-83	73-1-84	73-1-85	73-1-86	73-1-87	73-1-88	73-1-89	73-1-90	73-1-91	73-1-92	73-1-93	73-1-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x6924 (plane 73, row 4: hiragana)

This row contains hiragana. Compare row 4 of JIS X 0208.

CCCII/EACC[13][9][8] (prefixed with 0x6924)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		ぁ 3041 73-4-1	あ 3042 73-4-2	ぃ 3043 73-4-3	い 3044 73-4-4	ぅ 3045 73-4-5	う 3046 73-4-6	ぇ 3047 73-4-7	え 3048 73-4-8	ぉ 3049 73-4-9	お 304A 73-4-10	か 304B 73-4-11	が 304C 73-4-12	き 304D 73-4-13	ぎ 304E 73-4-14	く 304F 73-4-15
3_	ぐ 3050 73-4-16	け 3051 73-4-17	げ 3052 73-4-18	こ 3053 73-4-19	ご 3054 73-4-20	さ 3055 73-4-21	ざ 3056 73-4-22	し 3057 73-4-23	じ 3058 73-4-24	す 3059 73-4-25	ず 305A 73-4-26	せ 305B 73-4-27	ぜ 305C 73-4-28	そ 305D 73-4-29	ぞ 305E 73-4-30	た 305F 73-4-31
4_	だ 3060 73-4-32	ち 3061 73-4-33	ぢ 3062 73-4-34	っ 3063 73-4-35	つ 3064 73-4-36	づ 3065 73-4-37	て 3066 73-4-38	で 3067 73-4-39	と 3068 73-4-40	ど 3069 73-4-41	な 306A 73-4-42	に 306B 73-4-43	ぬ 306C 73-4-44	ね 306D 73-4-45	の 306E 73-4-46	は 306F 73-4-47
5_	ば 3070 73-4-48	ぱ 3071 73-4-49	ひ 3072 73-4-50	び 3073 73-4-51	ぴ 3074 73-4-52	ふ 3075 73-4-53	ぶ 3076 73-4-54	ぷ 3077 73-4-55	へ 3078 73-4-56	べ 3079 73-4-57	ぺ 307A 73-4-58	ほ 307B 73-4-59	ぼ 307C 73-4-60	ぽ 307D 73-4-61	ま 307E 73-4-62	み 307F 73-4-63
6_	む 3080 73-4-64	め 3081 73-4-65	も 3082 73-4-66	ゃ 3083 73-4-67	や 3084 73-4-68	ゅ 3085 73-4-69	ゆ 3086 73-4-70	ょ 3087 73-4-71	よ 3088 73-4-72	ら 3089 73-4-73	り 308A 73-4-74	る 308B 73-4-75	れ 308C 73-4-76	ろ 308D 73-4-77	ゎ 308E 73-4-78	わ 308F 73-4-79
7_	ゐ 3090 73-4-80	ゑ 3091 73-4-81	を 3092 73-4-82	ん 3093 73-4-83	73-4-84	73-4-85	73-4-86	73-4-87	73-4-88	73-4-89	73-4-90	73-4-91	73-4-92	73-4-93	73-4-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x6925 (plane 73, row 5: katakana)

This row contains katakana. Compare row 5 of JIS X 0208, which this row corresponds to, besides the addition of the separate dakuten and handakuten.

CCCII/EACC[13][9] (prefixed with 0x6925)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		ァ 30A1 73-5-1	ア 30A2 73-5-2	ィ 30A3 73-5-3	イ 30A4 73-5-4	ゥ 30A5 73-5-5	ウ 30A6 73-5-6	ェ 30A7 73-5-7	エ 30A8 73-5-8	ォ 30A9 73-5-9	オ 30AA 73-5-10	カ 30AB 73-5-11	ガ 30AC 73-5-12	キ 30AD 73-5-13	ギ 30AE 73-5-14	ク 30AF 73-5-15
3_	グ 30B0 73-5-16	ケ 30B1 73-5-17	ゲ 30B2 73-5-18	コ 30B3 73-5-19	ゴ 30B4 73-5-20	サ 30B5 73-5-21	ザ 30B6 73-5-22	シ 30B7 73-5-23	ジ 30B8 73-5-24	ス 30B9 73-5-25	ズ 30BA 73-5-26	セ 30BB 73-5-27	ゼ 30BC 73-5-28	ソ 30BD 73-5-29	ゾ 30BE 73-5-30	タ 30BF 73-5-31
4_	ダ 30C0 73-5-32	チ 30C1 73-5-33	ヂ 30C2 73-5-34	ッ 30C3 73-5-35	ツ 30C4 73-5-36	ヅ 30C5 73-5-37	テ 30C6 73-5-38	デ 30C7 73-5-39	ト 30C8 73-5-40	ド 30C9 73-5-41	ナ 30CA 73-5-42	ニ 30CB 73-5-43	ヌ 30CC 73-5-44	ネ 30CD 73-5-45	ノ 30CE 73-5-46	ハ 30CF 73-5-47
5_	バ 30D0 73-5-48	パ 30D1 73-5-49	ヒ 30D2 73-5-50	ビ 30D3 73-5-51	ピ 30D4 73-5-52	フ 30D5 73-5-53	ブ 30D6 73-5-54	プ 30D7 73-5-55	ヘ 30D8 73-5-56	ベ 30D9 73-5-57	ペ 30DA 73-5-58	ホ 30DB 73-5-59	ボ 30DC 73-5-60	ポ 30DD 73-5-61	マ 30DE 73-5-62	ミ 30DF 73-5-63
6_	ム 30E0 73-5-64	メ 30E1 73-5-65	モ 30E2 73-5-66	ャ 30E3 73-5-67	ヤ 30E4 73-5-68	ュ 30E5 73-5-69	ユ 30E6 73-5-70	ョ 30E7 73-5-71	ヨ 30E8 73-5-72	ラ 30E9 73-5-73	リ 30EA 73-5-74	ル 30EB 73-5-75	レ 30EC 73-5-76	ロ 30ED 73-5-77	ヮ 30EE 73-5-78	ワ 30EF 73-5-79
7_	ヰ 30F0 73-5-80	ヱ 30F1 73-5-81	ヲ 30F2 73-5-82	ン 30F3 73-5-83	ヴ 30F4 73-5-84	ヵ 30F5 73-5-85	ヶ 30F6 73-5-86	◌゙/゛ 3099/309B 73-5-87	◌゚/゜ 309A/309C 73-5-88	73-5-89	73-5-90	73-5-91	73-5-92	73-5-93	73-5-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x6F24–0x6F25 (plane 79, rows 4–5: jamo)

These rows contains Korean jamo.

EACC[13] (prefixed with 0x6F24)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
5_	79-4-48	79-4-49	79-4-50	79-4-51	79-4-52	79-4-53	79-4-54	ㄱ 3131 79-4-55	ㄴ 3134 79-4-56	ㄷ 3137 79-4-57	ㄹ 3139 79-4-58	ㅁ 3141 79-4-59	ㅂ 3142 79-4-60	ㅅ 3145 79-4-61	ㅇ 3147 79-4-62	ㅈ 3148 79-4-63
6_	ㅊ 314A 79-4-64	ㅋ 314B 79-4-65	ㅌ 314C 79-4-66	ㅍ 314D 79-4-67	ㅎ 314E 79-4-68	ㄲ 3132 79-4-69	79-4-70	79-4-71	79-4-72	ㄸ 3138 79-4-73	79-4-74	79-4-75	79-4-76	79-4-77	ㅃ 3143 79-4-78	79-4-79
7_	ㅆ 3146 79-4-80	ㅉ 3149 79-4-81	ㅏ 314F 79-4-82	ㅐ 3150 79-4-83	ㅑ 3151 79-4-84	79-4-85	ㅓ 3153 79-4-86	ㅔ 3154 79-4-87	ㅕ 3155 79-4-88	79-4-89	ㅗ 3157 79-4-90	ㅘ 3158 79-4-91	79-4-92	79-4-93	ㅛ 315B 79-4-94

EACC[13] (prefixed with 0x6F25)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		ㅜ 315C 79-5-1	79-5-2	79-5-3	79-5-4	ㅠ 3160 79-5-5	ㅡ 3161 79-5-6	ㅢ 3162 79-5-7	ㅣ 3163 79-5-8	79-5-9	79-5-10	79-5-11	79-5-12	79-5-13	79-5-14	79-5-15

Letter Number Punctuation Symbol Other Undefined

Character set 0x6F76 (plane 79, row 86: archaic Hangul)

This row contains several historic Hangul characters no longer in regular use. Several of these are mapped to the Private Use Area.[17]

EACC[13] (prefixed with 0x6F76)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
2_		ㆁ 3181 79-86-1	ㆆ 3186 79-86-2	ㅿ 317F 79-86-3	� E8B0 79-86-4	ㆍ 318D 79-86-5	79-86-6	79-86-7	79-86-8	79-86-9	79-86-10	79-86-11	79-86-12	79-86-13	79-86-14	79-86-15
3_	79-86-16	79-86-17	79-86-18	79-86-19	79-86-20	79-86-21	79-86-22	79-86-23	79-86-24	79-86-25	79-86-26	79-86-27	79-86-28	79-86-29	79-86-30	79-86-31
4_	� E8B2 79-86-32	� E8B3 79-86-33	� E8B4 79-86-34	� E8B5 79-86-35	� E8B6 79-86-36	� E8B7 79-86-37	� E8B8 79-86-38	� E8B9 79-86-39	� E8BA 79-86-40	� E8BB 79-86-41	� E8BC 79-86-42	� E8BD 79-86-43	� E8BE 79-86-44	� E8BF 79-86-45	� E8C0 79-86-46	� E8C1 79-86-47
5_	� E8C2 79-86-48	� E8C3 79-86-49	� E8C4 79-86-50	� E8C5 79-86-51	� E8C6 79-86-52	� E8C7 79-86-53	� E8C8 79-86-54	� E8C9 79-86-55	79-86-56	79-86-57	79-86-58	79-86-59	79-86-60	79-86-61	79-86-62	79-86-63
6_	79-86-64	79-86-65	79-86-66	79-86-67	79-86-68	79-86-69	79-86-70	79-86-71	79-86-72	79-86-73	79-86-74	79-86-75	79-86-76	79-86-77	79-86-78	79-86-79
7_	79-86-80	79-86-81	79-86-82	79-86-83	79-86-84	79-86-85	79-86-86	79-86-87	79-86-88	79-86-89	79-86-90	79-86-91	79-86-92	79-86-93	79-86-94

Letter Number Punctuation Symbol Other Undefined

Character set 0x7B25 (plane 91, row 5: supplementary Katakana)

This row contains additional katakana used to write foreign phonemes.[9]

CCCII[9] (prefixed with 0x7B25)
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
7_	91-5-80	91-5-81	91-5-82	91-5-83	91-5-84	91-5-85	91-5-86	91-5-87	91-5-88	ヷ 30F7 91-5-89	ヸ 30F8 91-5-90	ヹ 30F9 91-5-91	91-5-92	91-5-93	91-5-94

Footnotes

Outside of the trail byte range of an ISO 2022 94ⁿ set, but noted as being in use by some implementations.[7]
Coding of the ideographic space specified in the ANSI standard for EACC.[7] This is used as an exclamation mark in CCCII,[9] in addition to the exclamation mark at 0x212B3D.[15] The Hong Kong HKIUG variant of CCCII follows EACC here.[8]
The Encode::HanExtra mappings use U+FE52 for this character.[9] However, it appears here following ˊ, ˇ and ˋ,[15] which the other three tone marks for bopomofo. The mapping U+02D9 is more commonly used for this tone mark in bopomofo ranges of encodings, for example Big5.[19]

References

Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 122–124. ISBN 978-0-596-51447-1.
Tang, Audrey (2007-11-10). "Encode::HanExtra - Extra sets of Chinese encodings". CCCII: The earliest (and most sophisticated) Traditional Chinese encoding... used mostly in library systems.... Map for "CCCII" is supplied by the Koha Taiwan project.
Wittern, Christian (1995-05-01). "Chinese character codes: an update". International Research Institute for Zen Buddhism / Hanazono University. Archived from the original on 2004-10-12.
Jenkins, John H.; Cook, Richard; Lunde, Ken (2020-03-05). "Unicode Han Database (Unihan)". Unicode Standard Annex #38.
"Archived copy". Archived from the original on 2016-06-15. Retrieved 2016-06-15.CS1 maint: archived copy as title (link)
Kangxi Dictionary, p. 1296, char. 1
Library of Congress (2007-12-05). "Code Table East Asian Punctuation Marks". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
Hong Kong Innovative Users Group Unicode Task Force. "HKIUG Code Table for CJK Characters: Mapping to Unicode". University of Hong Kong Libraries.
Tang, Audrey; Koha Taiwan. "Map for CCCII". Encode::HanExtra. CPAN.
"2.4: Multiple byte graphic character sets". International Register of Coded Character Sets to be Used With Escape Sequences (ISO-IR) (PDF). ITSCJ/IPSJ. p. 14.
Library of Congress (2007-12-05). "Technique 2: Using standard alternate graphic character sets". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
Lunde, Ken (1995-12-18). "2.5.2: CCCII". CJK.INF Version 1.9.
Library of Congress (2007-12-05). "East Asian Code Tables". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
Library of Congress (2007-12-05). "Code Table Korean Hangul". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
Characters shown are, in part, cross-referenced against a representative BDF font for CCCII, distributed by Koichi Yasuoka of Kyoto University.
Hong Kong Innovative Users Group (2013-01-07). "Introduction to Hong Kong Innovative Users Group". University of Hong Kong Libraries.
Library of Congress (2004-09-02). "Summary List of MARC 21 Characters Assigned to the Private Use Area (PUA)". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
Morris, Susan (2007). "Finding JACKPHY: Online Cataloging to Include Arabic, Hebrew, Other Scripts". Library of Congress Information Bulletin. 66 (12).
van Kesteren, Anne. "big5". Encoding Standard. WHATWG.

Some information on this page is based on the information on the CNS official website.

External links

CNS 11643 official web site (English version of pages available) has information about the CCCII character set in the "Chinese Information Code" section
Full mapping of EACC to Unicode, from Library of Congress

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[17] Outside of the trail byte range of an ISO 2022 94ⁿ set, but noted as being in use by some implementations.[7]

[18] Coding of the ideographic space specified in the ANSI standard for EACC.[7] This is used as an exclamation mark in CCCII,[9] in addition to the exclamation mark at 0x212B3D.[15] The Hong Kong HKIUG variant of CCCII follows EACC here.[8]

[22] The Encode::HanExtra mappings use U+FE52 for this character.[9] However, it appears here following ˊ, ˇ and ˋ,[15] which the other three tone marks for bopomofo. The mapping U+02D9 is more commonly used for this tone mark in bopomofo ranges of encodings, for example Big5.[19]

[lunde2009-1] Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 122–124. ISBN 978-0-596-51447-1.

[cpan-readme-2] Tang, Audrey (2007-11-10). "Encode::HanExtra - Extra sets of Chinese encodings". CCCII: The earliest (and most sophisticated) Traditional Chinese encoding... used mostly in library systems.... Map for "CCCII" is supplied by the Koha Taiwan project.

[hanazono-3] Wittern, Christian (1995-05-01). "Chinese character codes: an update". International Research Institute for Zen Buddhism / Hanazono University. Archived from the original on 2004-10-12.

[unihan-4] Jenkins, John H.; Cook, Richard; Lunde, Ken (2020-03-05). "Unicode Han Database (Unihan)". Unicode Standard Annex #38.

[5] "Archived copy". Archived from the original on 2016-06-15. Retrieved 2016-06-15.CS1 maint: archived copy as title (link)

[6] Kangxi Dictionary, p. 1296, char. 1

[eapm-7] Library of Congress (2007-12-05). "Code Table East Asian Punctuation Marks". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.

[hkiug-8] Hong Kong Innovative Users Group Unicode Task Force. "HKIUG Code Table for CJK Characters: Mapping to Unicode". University of Hong Kong Libraries.

[cpan-9] Tang, Audrey; Koha Taiwan. "Map for CCCII". Encode::HanExtra. CPAN.

[irreg-10] "2.4: Multiple byte graphic character sets". International Register of Coded Character Sets to be Used With Escape Sequences (ISO-IR) (PDF). ITSCJ/IPSJ. p. 14.

[marc-escs-11] Library of Congress (2007-12-05). "Technique 2: Using standard alternate graphic character sets". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.

[cjkinf246-12] Lunde, Ken (1995-12-18). "2.5.2: CCCII". CJK.INF Version 1.9.

[loc-13] Library of Congress (2007-12-05). "East Asian Code Tables". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.

[14] Library of Congress (2007-12-05). "Code Table Korean Hangul". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.

[bdf-15] Characters shown are, in part, cross-referenced against a representative BDF font for CCCII, distributed by Koichi Yasuoka of Kyoto University.

[hkiug-about-16] Hong Kong Innovative Users Group (2013-01-07). "Introduction to Hong Kong Innovative Users Group". University of Hong Kong Libraries.

[locpua-19] Library of Congress (2004-09-02). "Summary List of MARC 21 Characters Assigned to the Private Use Area (PUA)". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.

[20] Morris, Susan (2007). "Finding JACKPHY: Online Cataloging to Include Arabic, Hebrew, Other Scripts". Library of Congress Information Bulletin. 66 (12).

[21] van Kesteren, Anne. "big5". Encoding Standard. WHATWG.

Character encodings
Early telecommunications	Telegraph code Needle Morse Non-Latin Wabun/Kana Chinese Cyrillic Korean Baudot and Murray FIELDATA ASCII ISO/IEC 646 BCDIC 353 355 357 358 359 360 EBCDIC Teletex and Videotex/Teletext ISO/IEC 6937 / ITU T.51 ITU T.61 ITU T.101 World System Teletext background sets
ISO/IEC 8859	Approved -1 -2 -3 -4 -5 -6 -7 -8 -9 -10 -11 -13 -14 -15 -16 Abandoned -12 Adaptations ISO-IR-182 ISO-IR-200 ISO-IR-201 Proposed but not approved ISO-IR-111 ISO-IR-197 French/Dutch/Turkish draft
Bibliographic use	MARC-8 ANSEL CCCII/EACC ISO 5426 / 5426-2 / 5427 / 5428 / 6438 / 6861 / 6862 / 10585 / 10586 / 10754 / 11822
National standards	ArmSCII BraSCII CNS 11643 ELOT 927 GOST 10859 GB 2312 GB 12052 GB 18030 HKSCS I.S. 434 ISCII JIS X 0201 JIS X 0208 JIS X 0212 JIS X 0213 KOI-7 KPS 9566 KS X 1001 KS X 1002 LST 1284 LST 1564 LST 1590-1 LST 1590-2 LST 1590-3 LST 1590-4 PASCII RUSCII SI 960 TIS-620 TSCII VISCII VSCII YUSCII
ISO/IEC 2022	7-bit CN CN-EXT JP JP-EXT JP-1 JP-2 JP-3 KR ISO/IEC 4873 ISO/IEC 8859 ISO/IEC 10367 Extended Unix Code / EUC CN KR JP TW
Mac OS code pages ("scripts")	Armenian Arabic Barents Cyrillic Celtic CentEuro ChineseSimp / EUC-CN ChineseTrad / Big5 Croatian Cyrillic Devanagari / ISCII Dingbats Farsi (Persian) Gaelic Georgian Greek Gujarati / ISCII Gurmukhi / ISCII Hebrew Iceland Inuit Japanese / Shift JIS Keyboard Korean / EUC-KR Latin (Kermit) Maltese/Esperanto Ogham / I.S. 434 Roman Romanian Sámi Symbol Thai / TIS-620 Turkish Turkic Cyrillic Ukrainian VT100
DOS code pages	100 111 112 113 151 152 161 162 163 164 165 166 210 220 301 437 449 489 620 667 668 707 708 709 710 711 714 715 720 721 737 768 770 771 772 773 774 775 776 777 778 790 850 851 852 853 854 855/872 856 857 858 859 860 861 862 863 864 865 866/808 867 868 869 874/1161/1162 876 877 878 881 882 883 884 885 891 895 896 897 898 899 900 903 904 906 907 909 910 911 926 927 928 929 932 934 936 938 941 942 943 944 946 947 948 949 950/1370 951 966 991 1034 1039 1040 1041 1042 1043 1044 1046 1086 1088 1092 1093 1098 1108 1109 1114 1115 1116 1117 1118 1119 1125/848 1126 1127 1131/849 1139 1167 1168 1300 1351 1361 1362 1363 1372 1373 1374 1375 1380 1381 1385 1386 1391 1392 1393 1394 3012 3021 3843 3844 3845 3846 3847 3848 30000 30001 30002 30003 30004 30005 30006 30007 30008 30009 30010 30011 30012 30013 30014 30015 30016 30017 30018 30019 30020 30021 30022 30023 30024 30025 30026 30027 30028 30029 30030 30031 30032 30033 30034 30039 30040 58152 58210 58335 59234 59829 60258 60853 61282 62306 CS Indic CSX Indic CSX+ Indic CWI-2 Iran System Kamenický KOI8 Mazovia MIK
IBM AIX code pages	367 371 806 813 819 895 896 912 913 914 915 916 919 920 921/901 922/902 923 952 953 954 955 956 957 958 959 960 961 963 964 965 970 971 1004 1006 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1029 1036 1089 1111 1124 1129/1163 1133 1350 1382 1383
IBM code pages for other vendors' encodings	Apple Macintosh 1275 1280 1281 1282 1283 1284 1285 1286 Adobe 1038 1276 1277 DEC 1020 1021 1023 1090 1100 1101 1102 1103 1104 1105 1106 1107 1287 1288 HP 1050 1051 1052 1053 1054 1055 1056 1057 1058
Windows code pages	CER-GS 874/1162 (TIS-620) 932/943 (Shift JIS) 936/1386 (GBK) 950/1370 (Big5) 949/1363 (EUC-KR) 1169 1174 Extended Latin-8 1200 (UTF-16LE) 1201 (UTF-16BE) 1250 1251 1252 1253 1254 1255 1256 1257 1258 1261 1270 54936 (GB18030) Armenian Cyrillic + Finnish Cyrillic + French Cyrillic + German Polytonic Greek 65001 (UTF-8)
Microsoft code pages for other vendors' encodings	Apple Macintosh 10000 10001 10002 10003 10004 10005 10006 10007 10008 10010 10017 10021 10029 10079 10081 10082
EBCDIC code pages	37 390 391 392 393 394 395 435 829 834 835 837 839 881 882 883 884 885 886 887 888 889 890 931 933/1364 935/1388 937/1371 939/1399 1001 1003 1005 1007 1024 1027 1028 1030 1031 1032 1033 1037 1068 1071 1073 1074 1075 1076 1077 1078 1080 1082 1083 1085 1087 1091 1136 1150 1151 1152 1278 1279 1303 1364 1376 1377
DEC terminals (VTx)	Multinational (MCS) National Replacement (NRCS) French Canadian Swiss Spanish United Kingdom Dutch Finnish French Norwegian and Danish Swedish Norwegian and Danish (alternative) 8-bit Greek 8-bit Turkish 7-bit Hebrew 8-bit Hebrew Special Graphics Technical (TCS)
Platform specific	Acorn Adobe Standard Adobe Latin 1 Amstrad CPC Apple I Apple II Apple III ATASCII Atari ST BICS Casio calculators CDC Compucolor II CP/M+ DEC RADIX 50 DEC MCS/NRCS DG International ELWRO-Junior FIELDATA GEM GEOS GSM 03.38 HP Roman Extension HP Roman-8 HP Roman-9 HP FOCAL HP RPL IBM SQUOZE LICS LMBCS Mattel Aquarius Minitel MSX NEC APC NeXT OricSCII PCW PETSCII Sega SC-3000 Sharp calculators Sharp MZ Sinclair QL Teletext TI calculators TRS-80 Ventura International Ventura Symbol WISCII XCCS ZX80 ZX81 ZX Spectrum
Unicode / ISO/IEC 10646	UTF-1 UTF-7 UTF-8 UTF-16 (UTF-16LE/UTF-16BE) / UCS-2 UTF-32 (UTF-32LE/UTF-32BE) / UCS-4 UTF-EBCDIC GB 18030 BOCU-1 CESU-8 SCSU
TeX typesetting system	Cork IL1 IL2 IL3 L7X LGR LY1 OML OMS OMX OT1 OT2 OT3 OT4 PL0 QX T2A T2B T2C T2D T3 T4 T5 TS1 TS3 U X2
Miscellaneous code pages	ABICOMP APL 293 310 (Graphic Escape) 351 (GDDM) 907 (OEM) ISO-IR-68 ARIB STD-B24 HZ IEC-P27-1 INIS 7-bit 8-bit Cyrillic ISO-IR-169 ISO 2033 Johab Mojikyō SEASCII Stanford/ITS TACE16 TRON UTF-5 UTF-6 WTF-8
Control and nonprinting character sets	Morse prosigns C0 and C1 control codes ISO/IEC 6429 / ANSI X3.64 / ECMA-48 / JIS X 0211 ISO 6630 DIN 31626 JIS X 0207 ITU T.101 C0 C1 EBCDIC control codes Unicode control, format and separator characters Whitespace characters
Related topics	Code page Windows code page CCSID Character encodings in HTML Charset detection Han unification Hardware Mojibake
Character sets