Leveraging distributional characteristics of modulation spectra for robust speech recognition

Yu Chen Kao, Berlin Chen

研究成果: 書貢獻/報告類型會議貢獻

3 引文 斯高帕斯(Scopus)

摘要

Modulation spectrum processing of speech features has recently become an active area of intensive research in the speech recognition community. As for normalization of modulation spectra, spectral histogram equalization (SHE) seems to be one of the most effective techniques that have been used to compensate the nonlinear distortion. In this paper, we investigate a novel use of polynomial-fitting techniques for modulation histogram equalization, which has the advantages of lower storage and time consumption when compared with the conventional SHE methods. Further, we also investigated the possibility of combining our approach with other temporal feature normalization methods. The automatic speech recognition (ASR) experiments were carried out on the Aurora-2 standard noise-robust ASR task. The performance of the proposed approach was thoroughly tested and verified by comparisons with the other popular modulation spectrum normalization methods, which suggests the utility of the proposed approach.

原文英語
主出版物標題2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012
頁面120-125
頁數6
DOIs
出版狀態已發佈 - 2012 十一月 12
事件2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012 - Montreal, QC, 加拿大
持續時間: 2012 七月 22012 七月 5

出版系列

名字2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012

其他

其他2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012
國家加拿大
城市Montreal, QC
期間12/7/212/7/5

ASJC Scopus subject areas

  • Computer Science Applications
  • Signal Processing

指紋 深入研究「Leveraging distributional characteristics of modulation spectra for robust speech recognition」主題。共同形成了獨特的指紋。

  • 引用此

    Kao, Y. C., & Chen, B. (2012). Leveraging distributional characteristics of modulation spectra for robust speech recognition. 於 2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012 (頁 120-125). [6310476] (2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012). https://doi.org/10.1109/ISSPA.2012.6310476