Exploring joint equalization of spatial-temporal contextual statistics of speech features for robust speech recognition

Hsin Ju Hsieh, Jeih Weih Hung, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

3 引文 斯高帕斯(Scopus)

摘要

Histogram equalization (HEQ) of speech features has recently become an active focus of much research in the field of robust speech recognition due to its inherent neat formulation and remarkable performance. Our work in this paper continues this general line of research in two significant aspects. First, a novel framework for joint equalization of spatial-temporal contextual statistics of speech features is proposed. For this idea to work, we leverage simple differencing and averaging operations to render the contextual relationships of feature vector components, not only between different dimensions but also between consecutive speech frames, for speech feature normalization. Second, we exploit a polynomial-fitting scheme to efficiently approximate the inverse of the cumulative density function of training speech, so as to work in conjunction with the presented normalization framework. As such, it provides the advantages of lower storage and time consumption when compared with the conventional HEQ methods. All experiments were carried out on the Aurora-2 database and task. The performance of the methods deduced from our proposed framework was thoroughly tested and verified by comparisons with other popular robustness methods, which suggests the utility of our methods.

原文英語
主出版物標題13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
頁面2621-2624
頁數4
出版狀態已發佈 - 2012
事件13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, 美国
持續時間: 2012 9月 92012 9月 13

出版系列

名字13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
3

其他

其他13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
國家/地區美国
城市Portland, OR
期間2012/09/092012/09/13

ASJC Scopus subject areas

  • 電腦網路與通信
  • 通訊

指紋

深入研究「Exploring joint equalization of spatial-temporal contextual statistics of speech features for robust speech recognition」主題。共同形成了獨特的指紋。

引用此