CNS character set
Encyclopedia
The CNS 11643 character set (Chinese National Standard 11643), also officially known as the "Chinese Standard Interchange Code" (中文標準交換碼), is officially the standard character set of the Republic of China
.
(In practice, variants of Big5
are de facto standard.)
CNS 11643 is a superset of ASCII
designed to conform to ISO 2022.
It contains 16 planes, so the maximum possible number of encodable characters is 16×94×94 = 141376.
Planes 12 to 15 (35344 code points) are specifically designated for user-defined characters.
Unlike CCCII, the encoding of variant characters in CNS 11643 is not related.
EUC-TW is a representation of CNS 11643 in Extended Unix Code
(EUC) form.
Republic of China
The Republic of China , commonly known as Taiwan , is a unitary sovereign state located in East Asia. Originally based in mainland China, the Republic of China currently governs the island of Taiwan , which forms over 99% of its current territory, as well as Penghu, Kinmen, Matsu and other minor...
.
(In practice, variants of Big5
Big5
Big-5 or Big5 is a character encoding method used in Taiwan, Hong Kong, and Macau for Traditional Chinese characters.Mainland China, which uses Simplified Chinese Characters, uses the GB instead.- Organization :...
are de facto standard.)
CNS 11643 is a superset of ASCII
ASCII
The American Standard Code for Information Interchange is a character-encoding scheme based on the ordering of the English alphabet. ASCII codes represent text in computers, communications equipment, and other devices that use text...
designed to conform to ISO 2022.
It contains 16 planes, so the maximum possible number of encodable characters is 16×94×94 = 141376.
Planes 12 to 15 (35344 code points) are specifically designated for user-defined characters.
Unlike CCCII, the encoding of variant characters in CNS 11643 is not related.
EUC-TW is a representation of CNS 11643 in Extended Unix Code
Extended Unix Code
Extended Unix Code is a multibyte character encoding system used primarily for Japanese, Korean, and simplified Chinese.The structure of EUC is based on the ISO-2022 standard, which specifies a way to represent character sets containing a maximum of 94 characters, or 8836 characters, or 830584 ...
(EUC) form.