| Index: > A B C D E F G H I J K L M N O P Q R S T U V W X Y Z |
|
|||||
| First Prev [ 1 2 ] Next Last |
CJK is a collective term for Chinese, Japanese, and Korean, which comprise the largest of East Asian languages. The term is used in the field of software and communications internationalization.
The term CJKV is used to mean CJK plus Vietnamese, which used Chinese characters prior to adopting a written language solely on Romanization.
These languages all share the fact that their writing systems are based partly or entirely on Chinese characters — Hanzi in Chinese, Kanji in Japanese, and Hanja in Korean. Chinese requires between 4000 characters for a basic vocabulary to 40,000 characters for reasonably complete coverage. Whereas Japanese and Korean use fewer characters — complete literacy in Japan can be expected with 2000 characters — idiosyncratic use of Chinese characters in proper names requires many more. This number of characters cannot fit in the 256-character code space of 8-bit encodings, and therefore requires at least a 16-bit fixed width character encoding or multi-byte variable-length encodings. 16-bit fixed width encodings, such as Unicode up to and including version 2.0, are now deprecated due to the requirement that software in China support the GB18030GB18030 is the registered internet name for the official character set of the People's Republic of China. This character set is formally called "Chinese National Standard GB 18030-2000: Information Technology Chinese ideograms coded character set for info character set.
Although CJK encodings have common character sets, the encodings often used to represent them have been developed separately by different East Asian governments, and are mutually incompatible. Unicode has attempted, with some controversy, to unify the character sets in a process known as Han unificationHan unification is the process used by the authors of Unicode and the Universal Character Set to map multiple character sets of the CJK languages into a single set of unified glyphs. The Chinese characters are common to Chinese (where they are called " ha.
CJK character encodings should consist minimally of Han characters plus language-specific phonetic scripts such as pinyinPinyin (, pinyin) literally means "join together sounds" (a less literal translation being "phoneticize", "spell" or "transcription") in Chinese and usually refers to Hany pinyin (, literal meaning: " Han language pinyin"), which is a system of romanizati, bopomofo, hiraganaHiragana (, literally "smooth kana") are a Japanese syllabary, one of four Japanese writing systems (the others are katakana, kanji and romaji). Hiragana are used for: Japanese words for which there are no kanji, for example particles such as kara and suf, katakanaKatakana (, literally: "fragmentary kana") are a Japanese syllabary, one of four Japanese writing systems (the others are hiragana, kanji and romaji). Katakana are characterized by squarish lines and are the simplest of the Japanese scripts. Katakana are, and HangulHangul is the native alphabet used to write the Korean language (as opposed to the Hanja system borrowed from China). For other romanized spellings of "Hangul," please see Names below. While Hangul writing may appear ideographic to the uninitiated, it is.
CJK character encodings include:
The CJK character sets take up the bulk of the Unicode code space. There is much controversy among experts of Chinese characters about the desirability and technical merit of the " Han unification" process used to map multiple Chinese and Japanese characters sets into a single set of unified glyphs.