Abstract
The performance of current automatic speech recognition (ASR) systems often deteriorates radically when the input speech is corrupted by various kinds of noise sources. Quite a few of techniques have been proposed to improve ASR robustness over the last few decades. Related work reported in the literature can be generally divided into two aspects according to whether the orientation of the methods is either from the feature domain or from the corresponding probability distributions. In this paper, we present a polynomial regression approach which has the merit of directly characterizing the relationship between the speech features and their corresponding probability distributions to compensate the noise effects. Two variants of the proposed approach are also extensively investigated as well. All experiments are conducted on the Aurora-2 database and task. Experimental results show that for clean-condition training, our approaches achieve considerable word error rate reductions over the baseline system, and also significantly outperform other conventional methods.
Original language | English |
---|---|
Pages | 87-92 |
Number of pages | 6 |
Publication status | Published - 2007 |
Event | 2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007 - Kyoto, Japan Duration: 2007 Dec 9 → 2007 Dec 13 |
Other
Other | 2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007 |
---|---|
Country/Territory | Japan |
City | Kyoto |
Period | 2007/12/09 → 2007/12/13 |
Keywords
- Clustering
- Histogram equalization
- Polynomial regression
- Robustness
- Speech recognition
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Software
- Artificial Intelligence