CJKV character æŹĄ in traditional and simplified Chinese, Korean, Vietnamese and Japanese forms
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 16.0, Unicode defines a total of 97,680 characters.1
The term ideographs is a misnomer, as the Chinese script is not ideographic but rather logographic.
Until the early 20th century, Vietnam also used Chinese characters (Chữ NÎm), so sometimes the abbreviation CJKV is used.
Sources
The Ideographic Research Group (IRG) is responsible for developing extensions to the encoded repertoires of CJK unified ideographs. IRG processes proposals for new CJK unified ideographs submitted by its member bodies, and after undergoing several rounds of expert review, IRG submits a consolidated set of characters to ISO/IEC JTC 1/SC 2 Working Group 2 (WG2) and the Unicode Technical Committee (UTC) for consideration for inclusion in the ISO/IEC 10646 and Unicode standards. The following IRG member bodies have been involved in the standardization of CJK unified ideographs:
- China
- Hong Kong
- Japan
- South Korea
- North Korea
- Macau
- Taiwan, liaison member represented by the Taipei Computer Association (TCA)
- Vietnam
- Unicode Technical Committee (liaison member, also representing the United States) 2 3
- United Kingdom
- SAT (liaison member)
The ideographs submitted by the UTC and the United Kingdom are not specific to any particular region, but are characters which have been suggested for encoding by individual experts. The ideographs submitted by SAT are required for the SAT DaizĆkyĆ text database.
The table below gives the numbers of encoded CJK unified ideographs for each IRG source for Unicode 16.0.4 The total number of characters (260,840) far exceeds the number of encoded CJK unified ideographs (97,680) as many characters have more than one source.
Country or region | Character count |
---|---|
China | 66,564 |
Hong Kong | 17,654 |
Macau | 344 |
Taiwan (TCA) | 58,601 |
Japan | 52,560 |
South Korea | 20,874 |
North Korea | 23,975 |
Vietnam | 13,284 |
United Kingdom | 2,503 |
SAT | 3,455 |
UTC | 1,026 |
Total | 260,840 |
UTC sources
The majority of characters submitted by the UTC to the IRG are derived from Unicode Technical Committee (UTC) documents.5 Other sources include:
- ABC Chinese-English Dictionary by John DeFrancis
- The Adobe-CNS1 glyph collection
- The Adobe-Japan1 glyph collection
- A Complete Checklist of Species and Subspecies of Chinese Birds (äžćœéžç±»çł»ç»æŁçŽą)
- The Great Nom Dictionary (ÄáșĄi Tá»± Äiá»n Chữ NĂŽm)
- Annotations to Shuowen Jiezi (annotated by Duan Yucai)
- GB18030-2000
- Required Character List Supplied by the Church of Jesus Christ of Latter-day Saints (Hong Kong)
- New Commercial Dictionary (ććĄæ°èŻć ž), Hong Kong
- Modern Chinese Dictionary (ç°ä»Łæ±èŻèŻć ž), by Chinese Academy of Social Sciences, Linguistics Research Institute, Dictionary Editorial Office
- Working Group (WG2) documents
Ordering
The ordering of CJK Unified Ideographs within Unicode blocks (not counting those added to the block later) was initially determined by consulting the following four dictionaries. Primarily, they were arranged in Kangxi Dictionary order, with the other dictionaries consulted, in order, for characters not found in the Kangxi Dictionary, to determine which Kangxi Dictionary character they should follow in the ordering.6
- Kangxi Dictionary
- Dai Kan-Wa Jiten
- Hanyu Da Zidian
- Dae Jaweon
This system is not used for more recently-added Unicode blocks. The Ideographic Research Group no longer uses the Dae Jaweon,7 nor the Dai Kan-Wa Jiten,8 in its work. The Kangxi Dictionary and Hanyu Da Zidian are still used 7 both in existing character source references,9 and as potential replacements for existing source references discovered to be erroneous.10 Similarly, although a (real or virtual) Kangxi Dictionary index was previously provided as part of the submission data for UTC-source characters, this is no longer the case.11 Instead, the stroke type of the first residual stroke (first stroke which does not form part of the radical) is supplied with all submitted characters, and used to order characters with the same radical and stroke count within the new Unicode block.12
The basic block named CJK Unified Ideographs (4E00â9FFF) contains 20,992 basic Chinese characters in the range U+4E00 through U+9FFF. The block not only includes characters used in the Chinese writing system but also kanji used in the Japanese writing system, hanja in Korea, and chữ NĂŽm characters in Vietnamese. Many characters in this block are used in all three writing systems, while others are in only one or two of the three.
This block is also known as the Unified Repertoire and Ordering (URO), especially when it needs to be differentiated from the other CJK Unified Ideographs blocks.13
The first 20,902 characters in the block are arranged according to the Kangxi Dictionary ordering of radicals. In this system the characters written with the fewest strokes are listed first. The remaining characters were added later, and so are not in radical order.
The block is the result of Han unification,14 which was somewhat controversial within East Asia.15 Since single characters used in more than one of Chinese, Japanese and Korean were coded in the same location, and the modern typographical conventions and handwriting curricula differ slightly between regions (not necessarily along language boundariesâfor example, Hong Kong and Taiwan, which both use Traditional Chinese, have slightly different local conventions),16 the appearance of a selected glyph could depend on the particular font being used. However, the URO applies the source separation rule, meaning that pairs of characters treated as distinct in a character set used as a source for the URO (e.g. JIS X 0208 as used in e.g. Shift JIS) would remain pairs of separate characters in the new Unicode encoding.17
Using variation selectors, it is possible to specify certain variant CJK ideograms within Unicode.18 The Adobe-Japan1 character set, which has 14,684 ideographic variation sequences,19 is an extreme example of the use of variation selectors.20
Charts
4E00-62FF,6300-77FF,7800-8CFF,8D00-9FFF.
Sources
Note: Most characters appear in multiple sources, so the sum of individual character counts (108,480) is far greater than the number of encoded characters (20,992).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | G0 | GB 2312-80 | 6,763 | 20,933 |
G1 | GB 12345-90 (Traditional Chinese analogue to GB 2312-80) | 2,202 | ||
G3 | GB 13131 (unpublished Traditional Chinese analogue to GB 7589-87) | 4,834 | ||
G5 | GB 13132 (unpublished Traditional Chinese analogue to GB 7590-87) | 2,841 | ||
G7 | Modern Chinese general character chart (Simplified Chinese: ç°ä»Łæ±èŻéçšćèĄš) | 42 | ||
G8 | GB 8565-88 | 203 | ||
GCE | National Academy for Educational Research | 4 | ||
GDM | Place name characters from the Public Order Administration, Ministry of Public Security of the People's Republic of China | 2 | ||
GE | GB 16500-95 | 3,770 | ||
GFC | Modern Chinese Standard Dictionary (ç°ä»Łæ±èŻè§èèŻć žçŹŹäșç) | 2 | ||
GGFZ | Tongyong Guifan Hanzi Zidian (éçšè§èæ±ććć ž) | 1 | ||
GH | GB/T 15564-1995 | 59 | ||
GHZ | Hanyu Da Zidian (æŒąèȘ性ćć ž) | 1 | ||
GHZR | Hanyu Da Zidian 2nd ed. (æ±èŻć€§ćć ž, 珏äșç) | 1 | ||
GK | GB 12052 -89 | 89 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 16 | ||
GKX | Kangxi Dictionary (ćș·çćć ž) | 5 | ||
GLK | Longkan Shoujian (éŸéŸæé) | 1 | ||
GT | Standard Telegraph Codebook (revised), 1983 | 8 | ||
GU | No source (the original source reference may have been moved) | 88 | ||
GZFY | Hanyu Fangyan Dacidian (æ±èŻæčèšć€§èŻć ž) | 1 | ||
Hong Kong | H | Hong Kong Supplementary Character Set, 2008 | 2,292 | 15,376 |
HB0 | Computer Chinese Glyph and Character Code Mapping Table, Technical Report C-26 (é»è ŠçšäžæććèćçąŒć°ç §èĄš, æèĄéć ±C-26) | 9 | ||
HB1 | Big-5, Level 1 | 5,401 | ||
HB2 | Big-5, Level 2 | 7,650 | ||
HD | Hong Kong Supplementary Character Set, 2016 | 24 | ||
Japan | J0 | JIS X 0208-1990 | 6,356 | 18,249 |
J1 | JIS X 0212-1990 | 3,058 | ||
J13 | JIS X 0213:2004 level-3 characters replacing J1 characters | 1,037 | ||
J13A | JIS X 0213:2004 level-3 character addendum from JIS X 0213:2000 level-3 replacing J1 character | 2 | ||
J14 | JIS X 0213:2004 level-4 characters replacing J1 characters | 1,704 | ||
J3 | JIS X 0213:2004 Level 3 | 95 | ||
J3A | JIS X 0213:2004 Level 3 addendum | 7 | ||
J4 | JIS X 0213:2004 Level 4 | 301 | ||
JARIB | ARIB STD-B24 | 3 | ||
JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 5,686 | ||
South Korea | K0 | KS C 5601-87 (now KS X 1001:2004) | 4,620 | 15,442 |
K1 | KS C 5657-91 (now KS X 1002:2001) | 2,855 | ||
K2 | PKS C 5700-1:1994 (now KS X 1027-1:2011) | 7,911 | ||
K3 | PKS C 5700-2:1994 (now KS X 1027-2:2011) | 1 | ||
K4 | PKS C 5700-3:1998 (now KS X 1027-3:2011) | 4 | ||
K6 | KS X 1027-5:2014 | 49 | ||
KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 1 | ||
KU | No source (the original source reference may have been moved) | 1 | ||
North Korea | KP0 | KPS 9566-97 | 4,652 | 15,010 |
KP1 | KPS 10721-2000 | 10,358 | ||
Macau | MA | HKSCS-2008 | 29 | 200 |
MB1 | Big Five | 10 | ||
MB2 | Big Five | 7 | ||
MC | MCSCS Reference | 3 | ||
MD | MCSCS horizontal extensions | 127 | ||
MDH | MCSCS horizontal extensions | 24 | ||
Taiwan | T1 | CNS 11643-1992 plane 1 | 5,413 | 18,384 |
T2 | CNS 11643-1992 plane 2 | 7,651 | ||
T3 | CNS 11643-1992 plane 3 | 4,144 | ||
T4 | CNS 11643-1992 plane 4 | 894 | ||
T5 | CNS 11643-1992 plane 5 | 64 | ||
T6 | CNS 11643-1992 plane 6 | 31 | ||
T7 | CNS 11643-1992 plane 7 | 16 | ||
TB | CNS 11643-2007 plane 11 | 2 | ||
TC | CNS 11643-2007 plane 12 | 2 | ||
TE | CNS 11643-2007 plane 14 | 9 | ||
TF | CNS 11643-2007 plane 15 | 158 | ||
Vietnam | V0 | TCVN 5773:1993 | 599 | 4,808 |
V1 | TCVN 6056:1995 | 3,305 | ||
V2 | VHN 01-1998 | 759 | ||
V3 | VHN 02-1998 | 91 | ||
V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 19 | ||
VN | Vietnamese horizontal extensions | 35 | ||
n/a | UTC | UTC sources | 78 | 78 |
In Unicode 4.1, 14 HKSCS-2004 characters and 8 GB 18030 characters were assigned to between U+9FA6 and U+9FBB code points. Since then, other additions were added to this block for various reasons, all summarized in the version history section below.
The block named CJK Unified Ideographs Extension A (3400â4DBF) contains 6,592 additional characters in the range U+3400 through U+4DBF.
Charts
Sources
Note: Most characters appear in more than one source, so the sum of individual character counts (23,954) is far greater than the number of encoded characters (6,592).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | G3 | GB 13131 (unpublished Traditional Chinese analogue to GB 7589-87) | 2,391 | 6,197 |
G5 | GB 13132 (unpublished Traditional Chinese analogue to GB 7590-87) | 1,226 | ||
G7 | Modern Chinese general character chart | 120 | ||
GGFZ | Tongyong Guifan Hanzi Zidian (éçšè§èæ±ććć ž) | 2 | ||
GHZ | Hanyu Da Zidian (æŒąèȘ性ćć ž) | 340 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 3 | ||
GKX | Kangxi Dictionary (ćș·çćć ž) | 1,889 | ||
GS | Singapore Chinese characters 1 | 226 | ||
Hong Kong | H | Hong Kong Supplementary Character Set, 2008 | 572 | 572 |
Japan | J3 | JIS X 0213:2004 Level 3 | 2 | 5,856 |
J4 | JIS X 0213:2004 Level 4 | 78 | ||
JA | Japanese IT Vendors Contemporary Ideographs, 1993 | 574 | ||
JA3 | JIS X 0213:2004 level-3 characters replacing JA characters | 17 | ||
JA4 | JIS X 0213:2004 level-4 characters replacing JA characters | 67 | ||
JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 5,118 | ||
South Korea | K3 | PKS C 5700-2:1994 (now KS X 1027-2:2011) | 1,833 | 1,867 |
K4 | PKS C 5700-3:1998 (now KS X 1027-3:2011) | 2 | ||
K6 | KS X 1027-5:2014 | 28 | ||
KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 3 | ||
KU | No source (the original source reference may have been moved) | 1 | ||
North Korea | KP0 | KPS 9566-97 | 1 | 3,191 |
KP1 | KPS 10721-2000 | 3,190 | ||
Macau | MA | HKSCS-2008 | 4 | 12 |
MD | MCSCS horizontal extensions | 8 | ||
Taiwan | T3 | CNS 11643-1992 plane 3 | 2,179 | 5,916 |
T4 | CNS 11643-1992 plane 4 | 2,919 | ||
T5 | CNS 11643-1992 plane 5 | 399 | ||
T6 | CNS 11643-1992 plane 6 | 200 | ||
T7 | CNS 11643-1992 plane 7 | 133 | ||
TE | CNS 11643-2007 plane 14 | 1 | ||
TF | CNS 11643-2007 plane 15 | 85 | ||
United Kingdom | UK | IRG N2107R2 | 3 | 3 |
Vietnam | V0 | TCVN 5773:1993 | 140 | 319 |
V2 | VHN 01-1998 | 149 | ||
V3 | VHN 02-1998 | 19 | ||
V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 5 | ||
VN | Vietnamese horizontal extensions | 6 | ||
n/a | UTC | UTC sources | 21 | 21 |
The block named CJK Unified Ideographs Extension B (20000â2A6DF) contains 42,720 characters in the range U+20000 through U+2A6DF. These include most of the characters used in the Kangxi Dictionary that are not in the basic CJK Unified Ideographs block, as well as many HĂĄn-NĂŽm characters that were formerly used to write Vietnamese.
Charts
20000-215FF,21600-230FF,23100-245FF,24600-260FF,26100-275FF,27600-290FF,29100-2A6DF.
Sources
Note: Many characters appear in more than one source, so the sum of individual character counts (99,784) is far greater than the number of encoded characters (42,720).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | G3 | GB 13131 (unpublished Traditional Chinese analogue to GB 7589-87) | 1 | 30,550 |
G4K | Siku Quanshu (ććș«ć šæž) | 477 | ||
GBK | Encyclopedia of China (äžć性çŸç§ć šæž) | 86 | ||
GCH | Cihai (èŸæ”·) | 247 | ||
GCY | Ciyuan (èŸæș) | 66 | ||
GFZ | Founder Press System | 65 | ||
GGFZ | Tongyong Guifan Hanzi Zidian (éçšè§èæ±ććć ž) | 5 | ||
GHC | Hanyu Da Cidian (æŒąèȘ性è©ć ž) | 553 | ||
GHF | Hanwen fodian yinan suzi huishi yu yanjiu (æŒąæäœć žçéŁäżććœéèç ç©¶) | 1 | ||
GHZ | Hanyu Da Zidian (æŒąèȘ性ćć ž) | 10,507 | ||
GHZR | Hanyu Da Zidian 2nd ed. (æ±èŻć€§ćć ž, 珏äșç) | 1 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 17 | ||
GKX | Kangxi Dictionary (ćș·çćć ž) | 18,469 | ||
GU | No source (the original source reference may have been moved) | 55 | ||
Hong Kong | H | Hong Kong Supplementary Character Set, 2008 | 1,703 | 1,703 |
Japan | J3 | JIS X 0213:2004 Level 3 | 25 | 25,745 |
J3A | JIS X 0213:2004 Level 3 addendum | 1 | ||
J4 | JIS X 0213:2004 Level 4 | 277 | ||
JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 25,442 | ||
South Korea | K1 | KS C 5657-91 (now KS X 1002:2001) | 1 | 395 |
K4 | PKS C 5700-3:1998 (now KS X 1027-3:2011) | 166 | ||
K6 | KS X 1027-5:2014 | 214 | ||
KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 14 | ||
North Korea | KP1 | KPS 10721-2000 | 5,765 | 5,765 |
Macau | MA | HKSCS-2008 | 9 | 38 |
MC | MCSCS Reference | 2 | ||
MD | MCSCS horizontal extensions | 27 | ||
Taiwan | T3 | CNS 11643-1992 plane 3 | 25 | 30,193 |
T4 | CNS 11643-1992 plane 4 | 3,408 | ||
T5 | CNS 11643-1992 plane 5 | 8,111 | ||
T6 | CNS 11643-1992 plane 6 | 5,934 | ||
T7 | CNS 11643-1992 plane 7 | 6,299 | ||
TA | CNS 11643-2007 plane 10 | 8 | ||
TB | CNS 11643-2007 plane 11 | 6 | ||
TC | CNS 11643-2007 plane 12 | 1 | ||
TF | CNS 11643-2007 plane 15 | 6,401 | ||
United Kingdom | UK | IRG N2107R2 | 12 | 12 |
Vietnam | V0 | TCVN 5773:1993 | 1,570 | 5,299 |
V1 | TCVN 6056:1995 | 1 | ||
V2 | VHN 01-1998 | 2,286 | ||
V3 | VHN 02-1998 | 422 | ||
V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 33 | ||
VN | Vietnamese horizontal extensions | 987 | ||
Buddhist canon | SAT | SAT DaizĆkyĆ Text Database | 1 | 1 |
n/a | UTC | UTC sources | 83 | 83 |
The block named CJK Unified Ideographs Extension C (2A700â2B73F) contains 4,154 characters in the range U+2A700 through U+2B739. It was initially added in Unicode 5.2 (2009).
Charts
Sources
Note: Some characters appear in more than one source, so the sum of individual character counts (4,634) is greater than the number of encoded characters (4,154).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | GBK | Encyclopedia of China (äžć性çŸç§ć šæž) | 74 | 1,130 |
GCH | Cihai (èŸæ”·) | 264 | ||
GCY | Ciyuan (èŸæș) | 1 | ||
GCYY | Chinese Academy of Surveying and Mapping ideographs | 55 | ||
GDM | Place name characters from the Public Order Administration, Ministry of Public Security of the People's Republic of China | 1 | ||
GFZ | Founder Press System | 1 | ||
GGFZ | Tongyong Guifan Hanzi Zidian (éçšè§èæ±ććć ž) | 2 | ||
GGH | Gudai Hanyu Cidian (ć€ä»Łæ±èŻèŻć ž) | 51 | ||
GHC | Hanyu Da Cidian (æŒąèȘ性è©ć ž) | 14 | ||
GHZ | Hanyu Da Zidian (æŒąèȘ性ćć ž) | 1 | ||
GHZR | Hanyu Da Zidian 2nd ed. (æ±èŻć€§ćć ž, 珏äșç) | 1 | ||
GJZ | Commercial Press ideographs | 61 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 6 | ||
GKX | Kangxi Dictionary (ćș·çćć ž) | 6 | ||
GXC | Xiandai Hanyu Cidian (ç°ä»Łæ±èŻèŻć ž) | 25 | ||
GZFY | Hanyu Fangyan Dacidian (æ±èŻæčèšć€§èŻć ž) | 202 | ||
GZJW | Yin Zhou Jinwen Jicheng Yinde (æź·ćšéæéæćŒćŸ) | 365 | ||
Hong Kong | H | Hong Kong Supplementary Character Set, 2008 | 1 | 1 |
Japan | JK | Japanese Kokuji Collection | 367 | 431 |
JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 64 | ||
South Korea | K5 | Korean IRG Hanja Character Set (later became KS X 1027-4:2011) | 404 | 406 |
K6 | KS X 1027-5:2014 | 1 | ||
KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 1 | ||
North Korea | KP1 | KPS 10721-2000 | 8 | 8 |
Macau | MC | MCSCS Reference | 17 | 21 |
MD | MCSCS horizontal extensions | 4 | ||
Taiwan | T5 | CNS 11643-1992 plane 5 | 1 | 1,752 |
TC | CNS 11643-2007 plane 12 | 634 | ||
TD | CNS 11643-2007 plane 13 | 766 | ||
TE | CNS 11643-2007 plane 14 | 350 | ||
TU | No source (the original source reference may have been moved) | 1 | ||
United Kingdom | UK | IRG N2107R2 | 1 | 1 |
Vietnam | V0 | TCVN 5773:1993 | 4 | 795 |
V1 | TCVN 6056:1995 | 2 | ||
V2 | VHN 01-1998 | 1 | ||
V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 782 | ||
VN | Vietnamese horizontal extensions | 6 | ||
n/a | UTC | UTC sources | 89 | 89 |
The block named CJK Unified Ideographs Extension D (2B740â2B81F) contains 222 characters in the range U+2B740 through U+2B81D that were added in Unicode 6.0 (2010).
Charts
Sources
Note: Some characters appear in more than one source, so the sum of individual character counts (239) is greater than the number of encoded characters (222).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | GCH | Cihai (èŸæ”·) | 1 | 78 |
GDM | Place name characters from the Public Order Administration, Ministry of Public Security of the People's Republic of China | 1 | ||
GIDC | ID System of the Ministry of Public Security of China | 9 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 2 | ||
GXC | Xiandai Hanyu Cidian (ç°ä»Łæ±èŻèŻć ž) | 4 | ||
GXM | Characters for use in personal names in China from Public Order Administration, Ministry of Public Security of the People's Republic of China | 22 | ||
GZH | Zhonghua Zihai (äžććæ”·) | 39 | ||
Japan | JH | Hanyo-Denshi Program (æ±çšé»ćæ ć ±äș€æç°ćąæŽćăăă°ă©ă ) | 107 | 117 |
JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 10 | ||
Taiwan | TB | CNS 11643-2007 plane 11 | 24 | 24 |
n/a | UTC | UTC sources | 20 | 20 |
The block named CJK Unified Ideographs Extension E (2B820â2CEAF) contains 5,762 characters in the range U+2B820 through U+2CEA1 that were added in Unicode 8.0 (2015).
Charts
Sources
Note: Some characters appear in more than one source, so the sum of individual character counts (5,919) is greater than the number of encoded characters (5,762).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | GBK | Encyclopedia of China (äžć性çŸç§ć šæž) | 15 | 2,822 |
GCH | Cihai (èŸæ”·) | 112 | ||
GCY | Ciyuan (èŸæș) | 3 | ||
GCYY | Chinese Academy of Surveying and Mapping ideographs | 98 | ||
GDZ | Geology Press ideographs | 1 | ||
GGFZ | Tongyong Guifan Hanzi Zidian (éçšè§èæ±ććć ž) | 4 | ||
GGH | Gudai Hanyu Cidian (ć€ä»Łæ±èŻèŻć ž) | 175 | ||
GHC | Hanyu Da Cidian (æŒąèȘ性è©ć ž) | 7 | ||
GIDC | ID System of the Ministry of Public Security of China | 37 | ||
GJZ | Commercial Press ideographs | 147 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 2 | ||
GKX | Kangxi Dictionary (ćș·çćć ž) | 22 | ||
GRM | People's Daily ideographs | 3 | ||
GU | No source (the original source reference may have been moved) | 1 | ||
GWZ | Hanyu Da Cidian Press ideographs | 12 | ||
GXC | Xiandai Hanyu Cidian (ç°ä»Łæ±èŻèŻć ž) | 57 | ||
GXH | Xinhua Zidian (æ°ććć ž) | 4 | ||
GZFY | Hanyu Fangyan Dacidian (æ±èŻæčèšć€§èŻć ž) | 712 | ||
GZJW | Yin Zhou Jinwen Jicheng Yinde (æź·ćšéæéæćŒćŸ) | 1,410 | ||
Hong Kong | HD | Hong Kong Supplementary Character Set, 2016 | 1 | 1 |
Japan | JK | Japanese Kokuji Collection | 415 | 503 |
JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 88 | ||
South Korea | KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 7 | 7 |
Macau | MC | MCSCS Reference | 48 | 51 |
MD | MCSCS horizontal extensions | 3 | ||
Taiwan | T3 | CNS 11643-1992 plane 3 | 2 | 1,261 |
TB | CNS 11643-2007 plane 11 | 2 | ||
TC | CNS 11643-2007 plane 12 | 323 | ||
TD | CNS 11643-2007 plane 13 | 595 | ||
TE | CNS 11643-2007 plane 14 | 339 | ||
United Kingdom | UK | IRG N2107R2 | 2 | 2 |
Vietnam | V0 | TCVN 5773:1993 | 6 | 1,036 |
V2 | VHN 01-1998 | 1 | ||
V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 1,023 | ||
VN | Vietnamese horizontal extensions | 6 | ||
n/a | UTC | UTC sources | 236 | 236 |
The block named CJK Unified Ideographs Extension F (2CEB0â2EBEF) contains 7,473 characters in the range U+2CEB0 through 2EBE0 that were added in Unicode 10.0 (2017). It includes more than 1,000 Sawndip characters for Zhuang.
Charts
Sources
Note: Some characters appear in more than one source, so the sum of individual character counts (7,775) is greater than the number of encoded characters (7,473).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | GCY | Ciyuan (èŸæș) | 122 | 1,309 |
GFC | Modern Chinese Standard Dictionary (ç°ä»Łæ±èŻè§èèŻć žçŹŹäșç) | 27 | ||
GIDC | ID System of the Ministry of Public Security of China | 1 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 5 | ||
GLGYJ | Zhuang Liao Songs Research (棟æćčæç ç©¶) | 1 | ||
GOCD | Oxford English-Chinese Chinese-English Dictionary (çæŽ„è±æ±æ±è±èŻć ž) | 2 | ||
GPGLG | Zhuang Folk Song Culture Series - Pingguo County Liao Songs (ćŁźææ°ææćäžäčŠâąćčłæćčæ) | 70 | ||
GXHZ | Xinhua Da Zidian (æ°ć性ćć ž) | 51 | ||
GZ | Ancient Zhuang Character Dictionary (ć€ćŁźććć ž) | 995 | ||
GZJW | Yin Zhou Jinwen Jicheng Yinde (æź·ćšéæéæćŒćŸ) | 33 | ||
GZYS | Chinese Ancient Ethnic Characters Research (äžćœæ°æć€æćç ç©¶) | 2 | ||
Hong Kong | HD | Hong Kong Supplementary Character Set, 2016 | 1 | 1 |
Japan | JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 1,646 | 1,646 |
South Korea | KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 1,810 | 1,810 |
Macau | MC | MCSCS Reference | 22 | 22 |
Taiwan | T3 | CNS 11643-1992 plane 3 | 1 | 3 |
T6 | CNS 11643-1992 plane 6 | 1 | ||
TC | CNS 11643-2007 plane 12 | 1 | ||
United Kingdom | UK | IRG N2107R2 | 2 | 2 |
Vietnam | V0 | TCVN 5773:1993 | 1 | 17 |
V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 8 | ||
VN | Vietnamese horizontal extensions | 8 | ||
Buddhist canon | SAT | SAT DaizĆkyĆ Text Database | 2,884 | 2,884 |
n/a | UTC | UTC sources | 81 | 81 |
A block named CJK Unified Ideographs Extension G was added as part of Unicode 13.0 to the Tertiary Ideographic Plane in the range U+30000 through U+3134F, containing 4,939 characters.22
Charts
Sources
Note: Some characters appear in more than one source, so the sum of individual character counts (5,081) is greater than the number of encoded characters (4,939).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | GHZR | Hanyu Da Zidian 2nd ed. (æ±èŻć€§ćć ž, 珏äșç) | 878 | 2,082 |
GPGLG | Zhuang Folk Song Culture Series - Pingguo County Liao Songs (ćŁźææ°ææćäžäčŠâąćčłæćčæ) | 13 | ||
GZ | Ancient Zhuang Character Dictionary (ć€ćŁźććć ž) | 1,191 | ||
South Korea | KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 435 | 435 |
Taiwan | T13 | CNS 11643 (pending new version) plane 19 | 347 | 353 |
TB | CNS 11643-2007 plane 11 | 3 | ||
TC | CNS 11643-2007 plane 12 | 2 | ||
TD | CNS 11643-2007 plane 13 | 1 | ||
United Kingdom | UK | IRG N2107R2 | 1,566 | 1,566 |
Vietnam | V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 6 | 76 |
VN | Vietnamese horizontal extensions | 70 | ||
Buddhist canon | SAT | SAT DaizĆkyĆ Text Database | 329 | 329 |
n/a | UTC | UTC sources | 240 | 240 |
A block named CJK Unified Ideographs Extension H was added as part of Unicode 15.0 to the Tertiary Ideographic Plane in the range U+31350 through U+323AF, containing 4,192 characters.23
Charts
Sources
Note: Some characters appear in more than one source, so the sum of individual character counts (4,309) is greater than the number of encoded characters (4,192).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | GDM | Place name characters from the Public Order Administration, Ministry of Public Security of the People's Republic of China | 128 | 829 |
GHC | Hanyu Da Cidian (æŒąèȘ性è©ć ž) | 27 | ||
GKJ | Terms in Sciences and Technologies (ç§æçšć) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST) | 30 | ||
GLGYJ | Zhuang Liao Songs Research (棟æćčæç ç©¶) | 11 | ||
GPGLG | Zhuang Folk Song Culture Series - Pingguo County Liao Songs (ćŁźææ°ææćäžäčŠâąćčłæćčæ) | 14 | ||
GU | No source (the original source reference may have been moved) | 1 | ||
GXM | Characters for use in personal names in China from Public Order Administration, Ministry of Public Security of the People's Republic of China | 216 | ||
GZ | Ancient Zhuang Character Dictionary (ć€ćŁźććć ž) | 285 | ||
GZA-1 | A Vibrant and Unbroken TransmissionâFilial Piety and Zhuang Funeral Songs (ççäžæŻçäŒ æżâąćäžćŁźæèĄćæäčç ç©¶) | 6 | ||
GZA-2 | Annotated Long Zhuang Morality Songs (棟æäŒŠçéćŸ·éżèŻäŒ æŹæèŻæłš) | 38 | ||
GZA-3 | Compendium of Old Zhuang Folksong TextsâWooing Songs vol. 1âLiao Songs (ćŁźææ°æć€ç±éæâąæ æïŒäžïŒćčæ) | 2 | ||
GZA-4 | Compendium of Old Zhuang Folksong TextsâWooing Songs vol. 2âFwen Nganx (ćŁźææ°æć€ç±éæâąæ æïŒäșïŒæŹąđȘ€) | 11 | ||
GZA-6 | Zhuang Proverbs from China (äžćœćŁźæè°èŻ) | 59 | ||
GZA-7 | Ancient RemembranceâZhuang Creation Myth Songs (èżć€çèżœćżâąćŁźæćäžç„èŻć€æç ç©¶) | 1 | ||
South Korea | KC | Korean History On-Line (íê” ììŹ ì 볎 í”í© ìì€í ) | 512 | 512 |
North Korea | KP1 | KPS 10721-2000 | 1 | 1 |
Taiwan | T12 | CNS 11643 (pending new version) plane 18 | 7 | 714 |
T13 | CNS 11643 (pending new version) plane 19 | 696 | ||
T4 | CNS 11643-1992 plane 4 | 1 | ||
T6 | CNS 11643-1992 plane 6 | 1 | ||
TB | CNS 11643-2007 plane 11 | 5 | ||
TC | CNS 11643-2007 plane 12 | 3 | ||
TE | CNS 11643-2007 plane 14 | 1 | ||
United Kingdom | UK | IRG N2232R | 917 | 917 |
Vietnam | V0 | TCVN 5773:1993 | 6 | 931 |
V4 | Kho Chữ Hån NÎm Mã Hoå (Hån NÎm Coded Character Repertoire) | 74 | ||
VN | Vietnamese horizontal extensions | 851 | ||
Buddhist canon | SAT | SAT DaizĆkyĆ Text Database | 241 | 241 |
n/a | UTC | UTC sources | 164 |
A block named CJK Unified Ideographs Extension I was added as part of Unicode 15.1 to the Supplementary Ideographic Plane in the range U+2EBF0 through U+2EE5F, containing 622 characters.24
Charts
Sources
Note: Some characters appear in more than one source, making the sum of individual character counts (625) more than the number of encoded characters (622).21
Country or region | Code | Source 25 | Character count | Total |
---|---|---|---|---|
China | GIDC23 | ID system of the Ministry of Public Security of China, 2023 | 622 | 622 |
Japan | JMJ | Character Information Development and Maintenance Project for e-Government âMojiJoho-Kiban Projectâ (æćæ ć ±ćșç€æŽćäșæ„) | 1 | 1 |
n/a | UTC | UTC sources | 2 | 2 |
The block named CJK Compatibility Ideographs (F900âFAFF) was created to retain round-trip compatibility with other standards.
However, twelve characters in this block actually have the âUnified Ideographâ property: U+FA0E ïš, U+FA0F ïš, U+FA11 ïš, U+FA13 ïš, U+FA14 ïš, U+FA1F ïš, U+FA21 ïšĄ, U+FA23 ïšŁ, U+FA24 , U+FA27 ïš§, U+FA28 ïšš, and U+FA29 ïš©.1 None of the other characters in this and other âCompatibilityâ blocks relate to CJK unification.
While éŸ and äș are not considered unifiable, U+FA20 ïš CJK COMPATIBILITY IDEOGRAPH-FA20 is considered a duplicate to U+8612 è CJK UNIFIED IDEOGRAPH-8612.
Charts
Sources
Note: All characters appear in more than one source, so the sum of individual character counts (40) is greater than the number of encoded characters (12).21
Country or region | Code | Source 23 | Character count | Total |
---|---|---|---|---|
China | GU | No source (the original source reference may have been moved) | 12 | 12 |
Japan | J3 | JIS X 0213:2004 Level 3 | 3 | 12 |
J4 | JIS X 0213:2004 Level 4 | 3 | ||
JA | Japanese IT Vendors Contemporary Ideographs, 1993 | 1 | ||
JA3 | JIS X 0213:2004 level-3 characters replacing JA characters | 1 | ||
JMJ | Character Information Development and Maintenance Project for e-Government "MojiJoho-Kiban Project" (æćæ ć ±ćșç€æŽćäșæ„) | 4 | ||
Taiwan | TF | CNS 11643-2007 plane 15 | 1 | 1 |
Vietnam | V0 | TCVN 5773:1993 | 3 | 3 |
n/a | UTC | UTC sources | 12 | 12 |
Known issues
Disunification
U+4039
The character U+4039 (äč) was a unification of two different characters (one with jiÄ ć€Ÿ phonetic and one with shÇn ă phonetic) until Unicode 5.0. However, they were lexically different characters that should not have been unified; they have different pronunciations and different meanings.
The proposal of disunification of U+4039 26 was accepted for Unicode 5.1, encoding a new character at U+9FC3 (éż) to represent shÇn.
In CJK Unified Ideographs Extension B, some characters are incorrectly unified with others. These characters include U+2017B (đ »), U+204AF (đ Ż) and U+24CB2 (đ€ČČ). The first two characters contained a wrong unification of Chinese Mainland and Vietnamese source of their glyph, while the last one unifies the Chinese Mainland and Taiwanese ones.27
Also in CJK Unified Ideographs Extension B, hundreds of glyph variants were encoded by mistake.28 Additionally, an ISO/IEC JTC 1/SC 2 report has found that six exact duplicates (where the same character has inadvertently been encoded twice) and two semi-duplicates (where the CJK-B character represents a de facto disunification of two glyph forms unified in the corresponding BMP character) were encoded by mistake:29
- U+34A8 ăš = U+20457 đ : U+20457 is the same as the China-source glyph for U+34A8, but it is significantly different from the Taiwan-source glyph for U+34A8
- U+3DB7 ă¶· = U+2420E đ€: same glyph shapes
- U+8641 è = U+27144 đ§ : U+27144 is the same as the Korean-source glyph for U+8641, but it is significantly different from the Chinese Mainland-, Taiwan- and Japan-source glyphs for U+8641
- U+204F2 đ Č = U+23515 đŁ: same glyph shapes, but ordered under different radicals
- U+249BC đ€ŠŒ = U+249E9 đ€§©: same glyph shapes
- U+24BD2 đ€Ż = U+2A415 đȘ: same glyph shapes, but ordered under different radicals
- U+26842 𩡠= U+26866 𩡩: same glyph shapes
- U+FA23 ïšŁ = U+27EAF đ§șŻ: same glyph shapes (U+FA23 ïšŁ is a unified CJK ideograph, despite its name âCJK COMPATIBILITY IDEOGRAPH-FA23.â)
Apart from the ten blocks of âUnified Ideographs,â Unicode has about a dozen more blocks with not-unified CJK-characters. These are mainly CJK radicals, strokes, punctuation, marks, symbols and compatibility characters. Although some characters have their (decomposable) counterparts in other blocks, the usages can be different. An example of a not-unified CJK-character is U+3007 ă IDEOGRAPHIC NUMBER ZERO in the CJK Symbols and Punctuation block. Although it is not covered under âCJK Unified Ideographsâ, it is treated as a CJK-character for all other intents and purposes.30
Four blocks of compatibility characters are included for compatibility with legacy text handling systems and older character sets:
- CJK Compatibility (3300â33FF)
- CJK Compatibility Forms (FE30âFE4F)
- CJK Compatibility Ideographs (F900âFAFF)
- CJK Compatibility Ideographs Supplement (2F800â2FA1F)
They include forms of characters for vertical text layout and rich text characters that Unicode recommends handling through other means. Therefore, their use is discouraged.
Font support
The blocks CJK Unified Ideographs and CJK Unified Ideographs Extension A, being parts of the Basic Multilingual Plane, are supported by the majority of the CJK fonts. However, Japanese and Korean fonts usually have fewer characters (about 13,000 and 8,000, respectively) than Chinese. Extensions B, C, D are supported by additional fonts MingLiU-ExtB, MingLiU_HKSCS-ExtB, PMingLiU-ExtB, SimSun-ExtB included in Microsoft Windows since Vista.31
Unicode version | Addition | Plane | Characters added | Total characters |
---|---|---|---|---|
1.0 (1991) | CJK Unified Ideographs | Basic Multilingual Plane (BMP) | 20,902 | 20,914 |
CJK Compatibility Ideographs | BMP | 12 | ||
3.0 (1999) | CJK Unified Ideographs Extension A | BMP | 6,582 | 27,496 |
3.1 (2001) | CJK Unified Ideographs Extension B | Supplementary Ideographic Plane (SIP) | 42,711 | 70,207 |
4.1 (2005) | CJK Unified Ideographs: Ideographs from HKSCS-2004 and GB 18030-2000 not in ISO 10646 | BMP | 22 | 70,229 |
5.1 (2008) | CJK Unified Ideographs: Ideographs from Adobe Japan and disunification of U+4039 | BMP | 8 | 70,237 |
5.2 (2009) | CJK Unified Ideographs Extension C | SIP | 4,149 | 74,394 |
8 other characters from ARIB #47, #95, #93 and HKSCS | BMP | 8 | ||
6.0 (2010) | CJK Unified Ideographs Extension D | SIP | 222 | 74,616 |
6.1 (2012) | 1 character corresponding to Adobe-Japan1-6 CID+20156 | BMP | 1 | 74,617 |
8.0 (2015) | CJK Unified Ideographs Extension E | SIP | 5,762 | 80,388 |
9 other characters | BMP | 9 | ||
10.0 (2017) | CJK Unified Ideographs Extension F | SIP | 7,473 | 87,882 |
21 other characters | BMP | 21 | ||
11.0 (2018) | CJK Unified Ideographs | BMP | 5 | 87,887 |
13.0 (2020) | CJK Unified Ideographs | BMP | 13 | 92,856 |
CJK Unified Ideographs Extension A | BMP | 10 | ||
CJK Unified Ideographs Extension B | SIP | 7 | ||
CJK Unified Ideographs Extension G | Tertiary Ideographic Plane (TIP) | 4,939 | ||
14.0 (2021) | CJK Unified Ideographs | BMP | 3 | 92,865 |
CJK Unified Ideographs Extension B | SIP | 2 | ||
CJK Unified Ideographs Extension C | SIP | 4 | ||
15.0 (2022) | CJK Unified Ideographs Extension C | SIP | 1 | 97,058 |
CJK Unified Ideographs Extension H | TIP | 4,192 | ||
15.1 (2023) | CJK Unified Ideographs Extension I | SIP | 622 | 97,680 |
See also
- Han unification
- List of Unicode characters
- List of CJK fonts
- Ideographic Research Group
- Chinese cultural sphere
Notes
References
External links
- UK-Source Ideographs (Documents IRG N2107R2 and IRG N2232R)
Footnotes
-
âUnicode 16.0 UCD: PropList.txtâ. 2024-05-31. Retrieved 2024-09-14. â© â©2
-
IRG Convenor (2024-12-10). âIRG Experts Listâ. ISO/IEC JTC1 / SC2 /WG2/ IRG N2769. â©
-
Lunde, Ken (2024-09-13). âUS/Unicode Activity Report for IRG #63 Meetingâ (PDF). ISO/IEC JTC1 / SC2 /WG2/ IRG N2700. â©
-
âUnicode 16.0 UCD: Unihan: Unihan_IRGSources.txtâ. 2024-07-31. Retrieved 2024-09-10. â©
-
Lunde, Ken (2024-07-31). âUAX #45: U-source Ideographsâ. Unicode Consortium. â©
-
â18.1.7. Han Ideograph Arrangementâ. The Unicode Standard: Core Specification. Version 16.0.0. Unicode Consortium. â©
-
â3.3. Dictionary Indicesâ. Unicode Han Database (Unihan). UAX #38. Three of the dictionary properties represent official IRG indices for the dictionaries used in the four dictionary sorting algorithm. Two (
kIRGHanyuDaZidian
andkIRGKangXi
) are still being used by the IRG, but the other one (kIRGDaeJaweon
) is not. â© â©2 -
Lunde, Ken (2022-09-01). âProposal to remove/improve provisional Unihan database propertiesâ (PDF). p. 6. UTC L2/22-188. In addition, the IRG no longer uses this dictionary for its ongoing work. â©
-
âkIRG_GSourceâ. Unicode Han Database (Unihan). UAX #38. GKX: Kangxi Dictionary ideographs (ćș·çćć ž) 9th edition (1958) including the addendum (ćș·çćć ž)èŁéș. GHZ: Hanyu Dazidian ideographs (æŒąèȘ性ćć ž). â©
-
Lunde, Ken (2018-02-22). âProposed kIRG_GSource Changes & Correctionsâ (PDF). UTC L2/18-065; ISO/IEC JTC1 / SC2 /WG2/ IRG N2297. â©
-
â2. Text File Dataâ. U-Source Ideographs. Unicode Consortium. UAX #45. A KangXi dictionary index for the ideograph, as described in Unicode Standard Annex #38, âUnicode Han Database (Unihan)â [UAX38]. This field is no longer used and contains no data. â©
-
Lunde, Ken (2024-09-30). âProposal to remove FS (first residual stroke) value from submissionsâ (PDF). ISO/IEC JTC1 / SC2 /WG2/ IRG N2713. This document proposes that the inclusion of first residual stroke (aka FS) values be removed from the submission requirements for new CJK Unified Ideographs [âŠ] The ISO/IEC 10646 Project Editor, when compiling an IRG working set into a new CJK Unified Ideographs extension block, uses the FS values to sort ideographs that share the same Radical-Stroke (Radical + SC) value. â©
-
Lunde, Ken (2012-09-16). âUROâ. CJK Type Blog. Adobe Inc. â©
-
The Unicode Standard 4.0, Appendix A - Han Unification History â©
-
Suzanne Topping, âThe secret life of Unicodeâ. Archived from the original on 2007-11-14. Retrieved 2010-05-12.
{{[cite web](https://en.wikipedia.org/wiki/Template:Cite_web "Template:Cite web")}}
: CS1 maint: bot: original URL status unknown (link) â© -
Lu, Qin (2015-06-08). âThe Proposed Hong Kong Character Setâ (PDF). ISO/IEC JTC1 / SC2 /WG2/ IRG N2074. â©
-
â Chapter 11 - East Asian scripts â, The Unicode standard, 4.0. â©
-
âIdeographic Variation Databaseâ. 2022-09-13. Retrieved 2022-09-20. â©
-
âIVD Statsâ. 2022-09-13. Retrieved 2022-09-20. â©
-
PRI 108: Combined registration of the Adobe Japan1 collection and of sequences in that collection â©
-
âUnihan_IRGSources.txt (from Unihan.zip)â. 2023-07-15. Retrieved 2024-09-10. â© â©2 â©3 â©4 â©5 â©6 â©7 â©8 â©9 â©10 â©11
-
âUnicode 13.0.0â. 10 March 2020. Retrieved 10 March 2020. â©
-
âUnicode 15.0.0â. 13 September 2022. Retrieved 14 September 2022. â©
-
âUnicode 15.1.0â. 2023-09-12. Retrieved 2023-09-12. â©
-
âUAX #38: Unicode Han Database (Unihan)â. Unicode Consortium. 2024-07-31. â©
-
Andrew West and John Jenkins, proposal of disunification of U+4039 â©
-
Eiso Chan (éæ°žèȘ), Comments on four error glyphs on CJK Unified Ideographs Ext B & E.[1] â©
-
Taichi Kawabata. âIRGN1155 Possible Duplicatesâ (.zip). Retrieved 2019-06-22. â©
-
Cook, Richard (6 October 2003). âDefect Report on Duplicate Encoded CJK Formsâ (PDF). ISO/IEC JTC1/SC2/WG2. Retrieved 2012-03-28. â©
-
GB/T 15835-2011ăćșçç©äžæ°ćçšæłă. China Guojia Biaozhun. https://journals.usst.edu.cn/uploadfile/file/GBT%2015835-2011%E3%80%8A%E5%87%BA%E7%89%88%E7%89%A9%E4%B8%8A%E6%95%B0%E5%AD%97%E7%94%A8%E6%B3%95%E3%80%8B.pdf â©
-
Lunde, Ken (2009). CJKV Information Processing. OâReilly. pp. 633â 634. ISBN 978-0-596-51447-1. â©