CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character encodings, in addition to their CJK Unified Ideographs assignments, in order to retain round-trip compatibility between Unicode and those encodings. However, it also contains 12 unified ideographs sourced from Japanese character sets from IBM.

The block has dozens of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD).1 2 These sequences specify the desired glyph variant for a given Unicode character.

Character sources

Sources for the original collection of CJK Compatibility Ideographs include:

  • South Korean KS X 1001 (U+F900–U+FA0B, 268 characters)
  • Taiwanese Big5 (U+FA0C–U+FA0D, 2 characters)
  • “IBM 32”: 32 Japanese characters from IBM (U+FA0E–U+FA2D; see below)

In ensuing versions of the standard, more characters have been added to the block from:

  • South Korean KS X 1001 (U+FA2E–U+FA2F, 2 characters)
  • Japanese JIS X 0213 (U+FA30–U+FA6A, 59 characters)
  • Japanese ARIB STD-B24 (U+FA6B–U+FA6D, 3 characters)
  • North Korean KPS 10721-2000 (U+FA70–U+FAD9, 106 characters)

IBM Japanese double-byte EBCDIC includes several kanji which do not exist in, or do not round-trip from, JIS X 0208. These were included as gaiji in extensions to Shift JIS and EUC-JP from IBM (e.g. code page 942), NEC, the Open Software Foundation, and Microsoft (e.g. Windows code page 932). However, they were not used as a source for the original Unified Repertoire and Ordering (URO). Instead, 32 of the IBM extension kanji, those which had not been included in the URO from other sources, were included in the CJK Compatibility Ideographs block in the range U+FA0E–U+FA2D.

Of these 32 characters:

  • 19 are unifiable with characters in the URO, and are therefore compatibility ideographs in the strict sense.

  • One (U+FA20 ïš  CJK COMPATIBILITY IDEOGRAPH-FA20) is a kyĆ«jitai form of a kokuji whose extended shinjitai form exists in the URO (U+8612 蘒 CJK UNIFIED IDEOGRAPH-8612). Both are hyƍgai kanji, and are variants of the jinmeiyƍ kanji U+8429 萩 CJK UNIFIED IDEOGRAPH-8429 (i.e. Kummerowia). U+FA20 was assigned a normalisation to U+8612, even though the 韜 and äș€ components, while both forms of radical 213, are not usually considered unifiable.3

  • The remaining 12 are kokuji characters which are actually unified ideographs (with the Unified_Ideograph property, and which do not change upon normalisation). In spite of their inclusion in the CJK Compatibility Ideographs block and their algorithmically generated character names beginning with ” CJK COMPATIBILITY IDEOGRAPH ”, they are not duplicates of characters in the original CJK Unified Ideographs block in any respect;4 5 11 of these 12 are completely non-duplicate, while U+FA23 ïšŁ CJK COMPATIBILITY IDEOGRAPH-FA23 was later unintentionally duplicated in CJK Unified Ideographs Extension B as U+27EAF đ§șŻ CJK UNIFIED IDEOGRAPH-27EAF. They are as follows:

  • U+FA0E  CJK COMPATIBILITY IDEOGRAPH-FA0E

  • U+FA0F  CJK COMPATIBILITY IDEOGRAPH-FA0F

  • U+FA11 ïš‘ CJK COMPATIBILITY IDEOGRAPH-FA11

  • U+FA13 ïš“ CJK COMPATIBILITY IDEOGRAPH-FA13

  • U+FA14 ïš” CJK COMPATIBILITY IDEOGRAPH-FA14

  • U+FA1F  CJK COMPATIBILITY IDEOGRAPH-FA1F

  • U+FA21 ïšĄ CJK COMPATIBILITY IDEOGRAPH-FA21

  • U+FA23 ïšŁ CJK COMPATIBILITY IDEOGRAPH-FA23

  • U+FA24  CJK COMPATIBILITY IDEOGRAPH-FA24

  • U+FA27 ïš§ CJK COMPATIBILITY IDEOGRAPH-FA27

  • U+FA28 ïšš CJK COMPATIBILITY IDEOGRAPH-FA28

  • U+FA29 ïš© CJK COMPATIBILITY IDEOGRAPH-FA29

Block

CJK Compatibility Ideographs [1] [2] [3]
Official Unicode Consortium code chart (PDF)
0123456789ABCDEF
U+F90x
U+F91x
U+F92xï€ ï€Ąï€ąï€Łï€€ï€„ï€Šï€§ï€šï€©ï€Șï€«ï€Źï€­ï€źï€Ż
U+F93xï€Čï€łï€Žï€”ï€¶ï€·ï€žï€čï€șï€»ï€Œï€œï€Ÿï€ż
U+F94x
U+F95x
U+F96xï„ ï„Ąï„ąï„Łï„€ï„„ï„Šï„§ï„šï„©ï„Șï„«ï„Źï„­ï„źï„Ż
U+F97xï„Čï„łï„Žï„”ï„¶ï„·ï„žï„čï„șï„»ï„Œï„œï„Ÿï„ż
U+F98x
U+F99x
U+F9AxïŠ ïŠĄïŠąïŠŁïŠ€ïŠ„ïŠŠïŠ§ïŠšïŠ©ïŠȘïŠ«ïŠŹïŠ­ïŠźïŠŻ
U+F9BxïŠČïŠłïŠŽïŠ”ïŠ¶ïŠ·ïŠžïŠčïŠșïŠ»ïŠŒïŠœïŠŸïŠż
U+F9Cx燎療蓼遼龍暈阮劉杻柳流溜琉留硫紐
U+F9Dx類六戮陸倫崙淪輪律慄栗率隆利吏履
U+F9Exï§ ï§Ąï§ąï§Łï§€ï§„ï§Šï§§ï§šï§©ï§Șï§«ï§Źï§­ï§źï§Ż
U+F9Fxï§°ï§±ï§Čï§łï§Žï§”ï§¶ï§·ï§žï§čï§șï§»ï§Œï§œï§Ÿï§ż
U+FA0x
U+FA1x
U+FA2xïš ïšĄïšąïšŁïš€ïš„ïšŠïš§ïššïš©ïšȘïš«ïšŹïš­ïšźïšŻ
U+FA3xïš°ïš±ïšČïšłïšŽïš”ïš¶ïš·ïšžïščïšșïš»ïšŒïšœïšŸïšż
U+FA4x懲敏既暑梅海渚漢煮爫琢碑社祉祈祐
U+FA5x祖祝禍禎穀突節練縉繁署者臭艹艹著
U+FA6xï© ï©Ąï©ąï©Łï©€ï©„ï©Šï©§ï©šï©©ï©Șï©«ï©Źï©­
U+FA7x並况ï©Čï©łï©Žï©”ï©¶ï©·ï©žï©čï©șï©»ï©Œï©œï©Ÿï©ż
U+FA8xïȘ€ïȘïȘ‚ïȘƒïȘ„ïȘ…ïȘ†ïȘ‡ïȘˆïȘ‰ïȘŠïȘ‹ïȘŒïȘïȘŽïȘ
U+FA9xïȘïȘ‘ïȘ’ïȘ“ïȘ”ïȘ•ïȘ–ïȘ—ïȘ˜ïȘ™ïȘšïȘ›ïȘœïȘïȘžïȘŸ
U+FAAxïȘ ïȘĄïȘąïȘŁïȘ€ïȘ„ïȘŠïȘ§ïȘšïȘ©ïȘȘïȘ«ïȘŹïȘ­ïȘźïȘŻ
U+FABxïȘ°ïȘ±ïȘČïȘłïȘŽïȘ”ïȘ¶ïȘ·ïȘžïȘčïȘșïȘ»ïȘŒïȘœïȘŸïȘż
U+FACx變贈輸遲醙鉶陼難靖韛響頋頻鬒龜𢡊
U+FADx𢡄𣏕㮝䀘䀹𥉉𥳐𧻓齃龎
U+FAEx
U+FAFx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
3.^ Yellow areas indicate the 12 unified CJK characters encoded in this block.

History

The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Compatibility Ideographs block:

Version1CountL2 IDWG2 IDIRG IDDocument
1.0.1U+F900..FA2D302N782Ksar, Mike (1991-10-12), Attachment to N 767 WG2-Paris meeting copies of working papers
L2/03-399Fok, Anthony (2003-10-13), Unihan reported errors / changes re kHKSCS entries
L2/03-367N2667Suignard, Michel; Muller, Eric; Jenkins, John (2003-10-22), CJK Ideograph source references corrections
L2/03-398Nguyen, D. (2003-10-29), Unihan reported errors / changes re kCowles
L2/03-417Muller, Eric (2003-10-31), Variation sequences for CJK Compatibility characters
L2/06-309RKarlsson, Kent (2006-11-07), Bug in DerivedNumericValues.txt
L2/06-324R2Moore, Lisa (2006-11-29), "Consensus 109-C18", UTC #109 Minutes, Add numeric values to 8 compatibility ideographs to match their canonical characters.
L2/08-238Cook, Richard; Lunde, Ken (2008-06-09), Recommendation For IRG To Use IVD Collections
L2/08-373N3525Lunde, Ken; Muller, Eric (2008-10-06), Handling CJK compatibility characters with variation sequences
L2/08-425Cook, Richard; Lunde, Ken (2008-11-18), IRG Use of IVD Collections
L2/09-003RMoore, Lisa (2009-02-12), "WG2 — Compatibility Ideographs", UTC #118 / L2 #215 Minutes
L2/09-080N3590Muller, Eric (2009-03-11), Difficulties with compatibility ideographs
L2/09-290Muller, Eric (2009-08-07), Draft IVD registration for Compatibility Characters
L2/11-243N4111Sources for Orphaned CJK Ideographs, 2011-06-14
L2/11-254Constable, Peter (2011-06-20), "Update to UTR #45 U-Source Ideographs requested", UTC Liaison Report from WG2
N4103"Resolution 58.05", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
L2/17-090Chung, Jaemin (2017-04-07), Proposal to add informative notes and cross-reference to U+F92C and U+F9B8
L2/17-103Moore, Lisa (2017-05-18), "B.4.1 Proposal to add informative notes and cross-reference to U+F92C and U+F9B8", UTC #151 Minutes
3.2U+FA30..FA6A59L2/99-016N1935Paterson, Bruce (1998-11-30), Editorial corrigenda on CJK compatibility ideographs, and other items
L2/99-240Addition of fifty six KANJIs for compatibility, 1999-07-15
L2/99-232N2003Umamaheswaran, V. S. (1999-08-03), "7.2.2.1 Editorial corrigenda on CJK compatibility", Minutes of WG 2 meeting 36, Fukuoka, Japan, 1999-03-09--15
L2/99-311Addition of fifty six KANJIs for compatibility, 1999-08-23
L2/99-313N2095Sato, T. K. (1999-09-08), Addition of CJK ideographs which are already "unified"
L2/99-316Whistler, Ken (1999-09-13), Comments on JCS proposal
L2/99-322Collins, Lee (1999-10-11), Comments on JCS compatibility characters in L2/99-310 through L2/99-313
L2/99-365Moore, Lisa (1999-11-23), Comments on JCS Proposals
L2/99-383N2142N710The response to WG2 resolution M37.16: CJK compatibility ideographs from JIS (WG2 N2104), 1999-12-09
L2/00-010N2103Umamaheswaran, V. S. (2000-01-05), "8.8", Minutes of WG 2 meeting 37, Copenhagen, Denmark: 1999-09-13—16
L2/99-260RMoore, Lisa (2000-02-07), "JCS Proposals", Minutes of the UTC/L2 meeting in Mission Viejo, October 26-28, 1999
L2/00-101N2197Sato, T. K. (2000-03-15), Update: CJK COMPATIBILITY IDEOGRAPH request
L2/00-172N2221Sato, T. K. (2000-04-20), JIS COMPATIBILITY IDEOGRAPHS (draft for ammendment-1) [sic]
N2221RJIS COMPATIBILITY IDEOGRAPHS (draft for ammendment-1) [sic] revised, 2000-06-01
L2/00-190Moore, Lisa (2000-06-22), UTC Rescinds Acceptance of Four Duplicate Radicals from JIS X 213
L2/00-234N2203 (rtf,txt)Umamaheswaran, V. S. (2000-07-21), "7.3", Minutes from the SC2/WG2 meeting in Beijing, 2000-03-21 -- 24
L2/00-337N2273JIS compatibility ideographs, 2000-09-19
L2/00-378N2295Sato, T. K. (2000-10-26), Feedback from Japan on N2281 -- working draft on pDAM 1 -- CJK Compatibility
L2/01-420Whistler, Ken (2001-10-30), "1. SC2 M11-04", WG2 (Singapore) Resolution Consent Docket for UTC
L2/01-405RMoore, Lisa (2001-12-12), "Consensus 89-C20", Minutes from the UTC/L2 meeting in Mountain View, November 6-9, 2001
L2/06-321Whistler, Ken (2006-10-03), UCD Bug re JIS 0213
L2/06-324R2Moore, Lisa (2006-11-29), "Consensus 109-C16", UTC #109 Minutes, Give U+FA30..U+FA6A the ideographic property, and fix the wordbreak property.
4.1U+FA70..FAD9106L2/01-050N2253Umamaheswaran, V. S. (2001-01-21), "7.2.4 Proposal to add the Hanja column to 10646-1", Minutes of the SC2/WG2 meeting in Athens, September 2000
L2/01-350N2375Proposal to add 160 Compatibility Hanja code table of D P R of Korea into CJK Compatibility Ideographs, 2001-09-03
L2/02-154N2403Umamaheswaran, V. S. (2002-04-22), "TC 2", Draft minutes of WG 2 meeting 41, Hotel Phoenix, Singapore, 2001-10-15/19
N2478"Korea (DPRK):T2, USA T5", Proposed Disposition of comments on SC2 N 3584 (PDAM text for Amendment 2 to ISO/IEC 10646-1:2000), 2002-05-08
L2/02-232N2493Sato, T. K.; Kobayashi, Tatsuo; Pak, Tong Gi (2002-05-22), Proposal to add 122 compatibility Hanja code table of the D P R of Korea into the CJK Compatibility Ideographs of ISO/IEC 10646-1:2000
N2541"USA T.8", Proposed disposition of comments on SC2 N 3624 (FPDAM text for Amendment 2 to ISO/IEC 10646-1:2000), 2002-12-02
N2540Freytag, Asmus (2002-12-05), Corrections to CJK Compatibility Ideographs Table in FPDAM
L2/02-465N2566Collins, Lee; Freytag, Asmus (2002-12-09), Review of DPRK Compatibility Ideographs
L2/02-471N2572CJK Compatibility Ideographs (Unicode 3.2, page 399), 2002-12-18
L2/02-472N2573Report of DPRK compatibility characters ad hoc meeting, 2002-12-11
L2/02-468N2569Suignard, Michel (2002-12-12), "USA T.5 e, USA T.8", Proposed disposition of comments on SC2 N 3624 (FPDAM text for Amendment 2 to ISO/IEC 10646-1:2000)
L2/03-023N2569RSuignard, Michel (2003-01-27), "USA T.5 e, USA T.8", Disposition of Comments Report on 10646-1/FPDAM 2
L2/03-346Chang, Cora (2003-10-20), Analysis of characters in WG2 documents N2572, N2573
L2/03-346.1Chang, Cora (2003-10-20), Analysis of characters in WG2 documents N2572, N2573 [spreadsheet without glyphs]
L2/04-207N2776N1062Proposal to add 106 Compatibility Hanjas of D P R of Korea to CJK Compatibility Ideographs, 2004-05-25
L2/04-330Whistler, Ken (2004-08-03), "E", WG2 Consent Docket
L2/04-316Moore, Lisa (2004-08-19), "100-C12", UTC #100 Minutes
L2/05-050RN2924RFreytag, Asmus (2005-01-28), Charts - Amendments 1 and 2 to ISO/IEC 10646:2003
L2/10-367N3899KP1-0000, 2010-09-30
L2/11-243N4111Sources for Orphaned CJK Ideographs, 2011-06-14
L2/11-254Constable, Peter (2011-06-20), "Update to UTR #45 U-Source Ideographs requested", UTC Liaison Report from WG2
N4103"Resolution 58.05", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
5.2U+FA6B..FA6D3N3353 (pdf,doc)Umamaheswaran, V. S. (2007-10-10), "M51.10", Unconfirmed minutes of WG 2 meeting 51 Hanzhou, China; 2007-04-24/27
L2/07-387Proposal to encode six CJK Ideographs in UCS, 2007-10-17
L2/08-184N3318R (pdf,appendix)Revised proposal to encode six CJK Ideographs in UCS, 2008-03-25
L2/08-318N3453 (pdf,doc)Umamaheswaran, V. S. (2008-08-13), "M52.2k", Unconfirmed minutes of WG 2 meeting 52
L2/08-161R2Moore, Lisa (2008-11-05), "Consensus 115-C14", UTC #115 Minutes
6.1U+FA2E..FA2F2L2/10-087N3747A solution proposed by R.O.Korea for incorrectly mapped compatibility chars, 2010-03-19
L2/10-108Moore, Lisa (2010-05-19), "Consensus 123-C8", UTC #123 / L2 #220 Minutes
N3803 (pdf,doc)"M56.08l", Unconfirmed minutes of WG 2 meeting no. 56, 2010-09-24

See also

References

Footnotes

  1. “Ideographic Variation Database”. Unicode Consortium. ↩

  2. “UTS #37, Unicode Ideographic Variation Database”. Unicode Consortium. ↩

  3. Ideographic Research Group (2024-11-19). “UCS Ideograph Non-Unifiable Component Variations Summary List (NUCV)“. UCV & NUCV Lists (PDF). ISO/IEC JTC1 / SC2 /WG2/ IRG N2746. ↩

  4. “PropList.txt”. Unicode Consortium. ↩

  5. Freytag, Asmus; McGowan, Rick; Whistler, Ken (2021-06-14). “Known Anomalies in Unicode Character Names”. Unicode Consortium. Unicode Technical Note #27. These 12 characters are unified CJK ideographs, not compatibility ideographs, despite their names. ↩