Improved histogram equalzaiton (HEQ) for robust speech recogntion

Shih Hsiang Lin*, Hung Bin Chen, Yao Ming Yeh, Berlin Chen

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

摘要

With the rapid development of Intelligent Transportation Systems (ITS), how to provide users with a natural and efficient human-machine interface is now becoming a crucial issue for driver safety. It is no doubt that speech will be one of the best mediators of human-machine interaction; however, the performance of automatic speech recognition (ASR) always radically degrades when the input speech is corrupted by varying noises. In this paper, we consider the use of histogram equalization (HEQ) for robust ASR. A novel data fitting scheme was presented to efficiently approximate the inverse of the cumulative density function of training speech for HEQ, which has the merits of lower storage and time consumption compared to the conventional table-lookup or quantile based HEQ approaches. Moreover, a more elaborate attempt of using multiple inverse functions for different noise conditions was investigated as well. All experiments were carried out on the Aurora-2 standard database and task. Very encouraging results were obtained. The proposed robustness technique has also been properly integrated into our prototype system for in-vehicle traffic information retrieval using spoken queries.

原文英語
主出版物標題Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007
發行者IEEE Computer Society
頁面2234-2237
頁數4
ISBN(列印)1424410177, 9781424410170
DOIs
出版狀態已發佈 - 2007
事件IEEE International Conference onMultimedia and Expo, ICME 2007 - Beijing, 中国
持續時間: 2007 7月 22007 7月 5

出版系列

名字Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007

其他

其他IEEE International Conference onMultimedia and Expo, ICME 2007
國家/地區中国
城市Beijing
期間2007/07/022007/07/05

ASJC Scopus subject areas

  • 電腦繪圖與電腦輔助設計
  • 軟體

指紋

深入研究「Improved histogram equalzaiton (HEQ) for robust speech recogntion」主題。共同形成了獨特的指紋。

引用此