广西科学院学报

引用本文：

付勇,郭公.基于规范字符集的多语种统一字符编码方案[J].广西科学院学报,2018,34(1):51-56. [点击复制]
FU Yong,GUO Gong.Multilingual Unified Character Encoding Scheme based on a Normalized Character Set[J].Journal of Guangxi Academy of Sciences,2018,34(1):51-56. [点击复制]

【打印本页】【在线阅读全文】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

本文已被：浏览 1150次下载 2753次	码上扫一扫！
基于规范字符集的多语种统一字符编码方案
付勇^1,2, 郭公¹
0 字体:加大+\|默认\|缩小-
(1.新疆大学数学与系统科学学院, 新疆乌鲁木齐 830046;2.伊犁师范学院, 中国锡伯语言文化研究中心, 新疆伊宁 835000)

摘要:

本研究提出了一种锡伯文、满文、传统蒙古文和托忒文的多语种统一字符编码方案。该方案的编码字符集是对应文种的规范字符,且采用统一的拉丁字母顺序排列,更适合国际化的发展趋势。采用这一方案的计算机处理系统与基于名义字符编码的系统相比,就锡伯文、满文而言,完全不需要自由变体选择符,对传统蒙古文等其他语种也能极大的减少自由变体选择符的使用,更符合一般用户的使用习惯,方便文字信息的查询、搜索和处理。

关键词: 规范字符字符编码多语种

DOI：10.13657/j.cnki.gxkxyxb.20180213.001

投稿时间：2017-12-26

基金项目:新疆自治区科技型中小企业创新基金项目"锡伯文、满文输入法的研发"(2014531069)资助。

Multilingual Unified Character Encoding Scheme based on a Normalized Character Set

FU Yong^1,2, GUO Gong¹

(1.College of Mathematics & System Sciences, Xinjiang University, Urumqi, Xinjiang, 830046, China;2.China Xibo Language and Culture Research Center, Yili Normal College, Yili, Xinjiang, 835000, China)

Abstract:

The paper proposes a multilingual unified character coding scheme in Sibe, Manchu, traditional Mongolian and Todo. The coded character set of the scheme is the normalized characters of the corresponding language and is arranged in a unified Latin alphabetic order, which is more suitable for the international development trend. Compared with the system based on the nominal character encoding, the computer processing system adopting this scheme does not need a free variant selector as for Sibe and Manchu. It can also greatly reduce the use of free variation selectors in the traditional Mongolian and other languages, which is more consistent with the general user's habits, facilitating the query, search, and processing of textual information.

Key words: normalized characters character encoding multilingual

用微信扫一扫