Distribution-based feature normalization for robust speech recognition leveraging context and dynamics cues

Yu Chen Kao, Berlin Chen

研究成果: 雜誌貢獻會議論文同行評審

1 引文 斯高帕斯(Scopus)

摘要

Recently, histogram equalization (HEQ) of speech features has received considerable attention in the area of robust speech recognition because of its relative simplicity and good empirical performance. In this paper, we present a novel extension to the conventional HEQ approach in two significant aspects. First, polynomial regression of various orders is employed to efficiently perform feature normalization building up the notion of HEQ. Second, not only the contextual distributional statistics but also the dynamics of feature values are taken as the input to the presented regression functions for better normalization performance. By doing so, we can to some extent relax the dimension-independence and bag-offrames assumptions made by the conventional HEQ approach. All experiments were carried out on the Aurora-2 database and task and further verified on the Aurora-4 database and task. The corresponding results demonstrate that our proposed methods can achieve considerable word error rate reductions over the baseline systems and offer additional performance gains for the AFE-processed features.

原文英語
頁(從 - 到)2958-2962
頁數5
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
出版狀態已發佈 - 2013
事件14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, 法国
持續時間: 2013 8月 252013 8月 29

ASJC Scopus subject areas

  • 語言與語言學
  • 人機介面
  • 訊號處理
  • 軟體
  • 建模與模擬

指紋

深入研究「Distribution-based feature normalization for robust speech recognition leveraging context and dynamics cues」主題。共同形成了獨特的指紋。

引用此