摘要: |
本研究提出了一种锡伯文、满文、传统蒙古文和托忒文的多语种统一字符编码方案。该方案的编码字符集是对应文种的规范字符,且采用统一的拉丁字母顺序排列,更适合国际化的发展趋势。采用这一方案的计算机处理系统与基于名义字符编码的系统相比,就锡伯文、满文而言,完全不需要自由变体选择符,对传统蒙古文等其他语种也能极大的减少自由变体选择符的使用,更符合一般用户的使用习惯,方便文字信息的查询、搜索和处理。 |
关键词: 规范字符 字符编码 多语种 |
DOI:10.13657/j.cnki.gxkxyxb.20180213.001 |
投稿时间:2017-12-26 |
基金项目:新疆自治区科技型中小企业创新基金项目"锡伯文、满文输入法的研发"(2014531069)资助。 |
|
Multilingual Unified Character Encoding Scheme based on a Normalized Character Set |
FU Yong1,2, GUO Gong1
|
(1.College of Mathematics & System Sciences, Xinjiang University, Urumqi, Xinjiang, 830046, China;2.China Xibo Language and Culture Research Center, Yili Normal College, Yili, Xinjiang, 835000, China) |
Abstract: |
The paper proposes a multilingual unified character coding scheme in Sibe, Manchu, traditional Mongolian and Todo. The coded character set of the scheme is the normalized characters of the corresponding language and is arranged in a unified Latin alphabetic order, which is more suitable for the international development trend. Compared with the system based on the nominal character encoding, the computer processing system adopting this scheme does not need a free variant selector as for Sibe and Manchu. It can also greatly reduce the use of free variation selectors in the traditional Mongolian and other languages, which is more consistent with the general user's habits, facilitating the query, search, and processing of textual information. |
Key words: normalized characters character encoding multilingual |