The performance of current automatic speech recognition (ASR) systems often deteriorates radically when the input speech is corrupted by various kinds of noise sources. Quite a few of techniques have been proposed to improve ASR robustness over the last few decades. Related work reported in the literature can be generally divided into two aspects according to whether the orientation of the methods is either from the feature domain or from the corresponding probability distributions. In this paper, we present a polynomial regression approach which has the merit of directly characterizing the relationship between the speech features and their corresponding probability distributions to compensate the noise effects. Two variants of the proposed approach are also extensively investigated as well. All experiments are conducted on the Aurora-2 database and task. Experimental results show that for clean-condition training, our approaches achieve considerable word error rate reductions over the baseline system, and also significantly outperform other conventional methods.
|出版狀態||已發佈 - 2007 十二月 1|
|事件||2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007 - Kyoto, 日本|
持續時間: 2007 十二月 9 → 2007 十二月 13
|其他||2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007|
|期間||2007/12/09 → 2007/12/13|
ASJC Scopus subject areas