Modulation spectrum factorization for robust speech recognition

Wen Yi Chu*, Jeih Weih Hung, Berlin Chen

*此作品的通信作者

研究成果: 會議貢獻類型會議論文同行評審

9 引文 斯高帕斯(Scopus)

摘要

This paper presents a novel approach to improving the noise robustness of speech features built on top of nonnegative matrix factorization (NMF). To do this, we employ NMF to extract a common set of basis spectral vectors that cover the intrinsic temporal structure inherent in the modulation spectra of clean training speech features. The new modulation spectra of the speech features, constructed by mapping the original modulation spectra into the space spanned by these basis vectors, are demonstrated with good noise-robust capabilities. All experiments were conducted using the Aurora-2 database and task. The results show that the proposed NMF-based approach, together with mean and variance normalization (MVN), can provide average error reduction rates of over 65% and 12% relative as compared with the baseline MFCC system and that using the MVN method alone, respectively.

原文英語
頁面1-6
頁數6
出版狀態已發佈 - 2011
事件Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011 - Xi'an, 中国
持續時間: 2011 10月 182011 10月 21

其他

其他Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011
國家/地區中国
城市Xi'an
期間2011/10/182011/10/21

ASJC Scopus subject areas

  • 資訊系統
  • 訊號處理

指紋

深入研究「Modulation spectrum factorization for robust speech recognition」主題。共同形成了獨特的指紋。

引用此