上海交通大学学报(英文版) ›› 2011, Vol. 16 ›› Issue (5): 524-529.doi: 10.1007/s12204-011-1189-1

• 论文 • 上一篇    下一篇

Investigation of Improved Approaches to Bayes Risk Decoding

 XU Hai-hua (徐海华),    ZHU Jie (朱   杰)   

  1. (Department of Electronic Engineering, Shanghai Jiaotong University,
    Shanghai 200240, China)
  • 收稿日期:2010-07-11 出版日期:2011-10-29 发布日期:2011-10-20
  • 通讯作者: XU Hai-hua (徐海华) E-mail:haihua.xu@aispeech.com

Investigation of Improved Approaches to Bayes Risk Decoding

 XU Hai-hua (徐海华),    ZHU Jie (朱   杰)   

  1. (Department of Electronic Engineering, Shanghai Jiaotong University,
    Shanghai 200240, China)
  • Received:2010-07-11 Online:2011-10-29 Published:2011-10-20
  • Contact: XU Hai-hua (徐海华) E-mail:haihua.xu@aispeech.com

摘要: Abstract:  Bayes risk (BR) decoding methods have been widely
investigated in the speech recognition area due to its flexibility and
complexity compared with the maximum a posteriori (MAP) method regarding to
minimum word error (MWE) optimization. This paper investigates two improved
approaches to the BR decoding, aiming at minimizing word error. The novelty
of the proposed methods is shown in the explicit optimization of the
objective function, the value of which is calculated by an improved forward
algorithm on the lattice. However, the result of the first method is
obtained by an expectation maximization (EM) like iteration, while the
result of the second one is achieved by traversing the confusion network
(CN), both of which lead to an optimized objective function value with
distinct approaches. Experimental results indicate that the proposed methods
result in an error reduction for lattice rescoring, compared with the
traditional CN method for lattice rescoring.

关键词: Bayes risk (BR), confusion network (CN), speech
recognition,
lattice rescoring

Abstract: Abstract:  Bayes risk (BR) decoding methods have been widely
investigated in the speech recognition area due to its flexibility and
complexity compared with the maximum a posteriori (MAP) method regarding to
minimum word error (MWE) optimization. This paper investigates two improved
approaches to the BR decoding, aiming at minimizing word error. The novelty
of the proposed methods is shown in the explicit optimization of the
objective function, the value of which is calculated by an improved forward
algorithm on the lattice. However, the result of the first method is
obtained by an expectation maximization (EM) like iteration, while the
result of the second one is achieved by traversing the confusion network
(CN), both of which lead to an optimized objective function value with
distinct approaches. Experimental results indicate that the proposed methods
result in an error reduction for lattice rescoring, compared with the
traditional CN method for lattice rescoring.

Key words: Bayes risk (BR), confusion network (CN), speech
recognition,
lattice rescoring

中图分类号: