Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition

Hsin Ju Hsieh, Berlin Chen, Jeih Weih Hung

研究成果: 書貢獻/報告類型會議論文篇章

1 引文 斯高帕斯(Scopus)

摘要

This paper proposes to enhance the complex-valued acoustic spectrograms of speech signals via the technique of histogram equalization (HEQ) to produce noise-robust features for recognition. The presented method extends our previous work in the task of spectrogram enhancement and has two significant aspects. First, we process the real and imaginary parts of acoustic spectrograms separately, and therefore both of the corresponding magnitude and phase components can be enhanced implicitly. Second, we apply FIR filters to the intra-frame acoustic spectra to acquire the respective local structural statistics, which are subsequently employed to perform various types of HEQ on the acoustic spectrograms for robustifying the resulting speech features. All experiments were carried out on the Aurora-2 database and task. The performance of the presented methods was thoroughly tested and verified by comparisons with other well-known robustness methods, which reveals the capability of our methods in promoting the noise robustness of speech features.

原文英語
主出版物標題2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9786163618238
DOIs
出版狀態已發佈 - 2014 2月 12
事件2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 - Chiang Mai, 泰国
持續時間: 2014 12月 92014 12月 12

出版系列

名字2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

其他

其他2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
國家/地區泰国
城市Chiang Mai
期間2014/12/092014/12/12

ASJC Scopus subject areas

  • 訊號處理
  • 資訊系統

指紋

深入研究「Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition」主題。共同形成了獨特的指紋。

引用此