Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition

Hsin Ju Hsieh, Berlin Chen, Jeih Weih Hung

研究成果: 雜誌貢獻會議論文同行評審

7 引文 斯高帕斯(Scopus)

摘要

Histogram equalization (HEQ) of acoustic features has received considerable attention in the area of robust speech recognition because of its relative simplicity and good empirical performance. This paper presents a novel HEQbased feature extraction approach that performs equalization in both acoustic frequency and modulation frequency domains for obtaining better noise-robust features. In particular, the real and imaginary acoustic spectra are first individually transformed to the modulation domain via discrete Fourier transform (DFT). The HEQ process is then carried on the corresponding magnitude modulation spectra so as to compensate for the noise distortions. Finally, the equalized modulation spectra are converted back to form the real and imaginary acoustic spectra, respectively. By doing so, we can enhance not only the magnitude but also the phase components of the acoustic spectra, and thereby create more noise-robust cepstral features. The experiments conducted on the Aurora-2 clean-condition database and task reveal that the presented approach delivers superior recognition accuracy in comparison with some other HEQ-related methods and the well-known advanced front-end (AFE) extraction scheme, which supports the potential utility of this novel approach.

原文英語
頁(從 - 到)2997-3001
頁數5
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
出版狀態已發佈 - 2013 一月 1
事件14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, 法国
持續時間: 2013 八月 252013 八月 29

ASJC Scopus subject areas

  • 語言與語言學
  • 人機介面
  • 訊號處理
  • 軟體
  • 建模與模擬

指紋

深入研究「Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition」主題。共同形成了獨特的指紋。

引用此