上海交通大学学报(英文版) ›› 2017, Vol. 22 ›› Issue (1): 82-086.doi: 10.1007/s12204-017-1804-x

• • 上一篇    下一篇

Chinese to Braille Translation Based on Braille Word Segmentation Using Statistical Model

WANG Xiangdong1,3* (王向东), YANG Yang2 (杨阳), ZHANG Jinchao3 (张金超), JIANG Wenbin3 (姜文斌), LIU Hong1,3 (刘 宏), QIAN Yueliang1,3 (钱跃良   

  1. (1. Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; 2. Jiangsu Enterprise Information Operation Center, China Telecom Corporation Limited, Nanjing 210037, China; 3. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China)
  • 出版日期:2017-02-28 发布日期:2017-04-04
  • 通讯作者: WANG Xiangdong1,3* (王向东) E-mail:xdwang@ict.ac.cn

Chinese to Braille Translation Based on Braille Word Segmentation Using Statistical Model

WANG Xiangdong1,3* (王向东), YANG Yang2 (杨阳), ZHANG Jinchao3 (张金超), JIANG Wenbin3 (姜文斌), LIU Hong1,3 (刘 宏), QIAN Yueliang1,3 (钱跃良   

  1. (1. Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; 2. Jiangsu Enterprise Information Operation Center, China Telecom Corporation Limited, Nanjing 210037, China; 3. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China)
  • Online:2017-02-28 Published:2017-04-04
  • Contact: WANG Xiangdong1,3* (王向东) E-mail:xdwang@ict.ac.cn

摘要: Automatic translation of Chinese text to Chinese Braille is important for blind people in China to acquire information using computers or smart phones. In this paper, a novel scheme of Chinese-Braille translation is proposed. Under the scheme, a Braille word segmentation model based on statistical machine learning is trained on a Braille corpus, and Braille word segmentation is carried out using the statistical model directly without the stage of Chinese word segmentation. This method avoids establishing rules concerning syntactic and semantic information and uses statistical model to learn the rules stealthily and automatically. To further improve the performance, an algorithm of fusing the results of Chinese word segmentation and Braille word segmentation is also proposed. Our results show that the proposed method achieves accuracy of 92.81% for Braille word segmentation and considerably outperforms current approaches using the segmentation-merging scheme.

关键词: perceptron algorithm, Chinese Braille, word segmentation

Abstract: Automatic translation of Chinese text to Chinese Braille is important for blind people in China to acquire information using computers or smart phones. In this paper, a novel scheme of Chinese-Braille translation is proposed. Under the scheme, a Braille word segmentation model based on statistical machine learning is trained on a Braille corpus, and Braille word segmentation is carried out using the statistical model directly without the stage of Chinese word segmentation. This method avoids establishing rules concerning syntactic and semantic information and uses statistical model to learn the rules stealthily and automatically. To further improve the performance, an algorithm of fusing the results of Chinese word segmentation and Braille word segmentation is also proposed. Our results show that the proposed method achieves accuracy of 92.81% for Braille word segmentation and considerably outperforms current approaches using the segmentation-merging scheme.

Key words: Chinese Braille, word segmentation, perceptron algorithm

中图分类号: