Abstract
This study presents a maximum a posteriori (MAP) based perceptual modeling approach to deal with the issue of recognition degradation in noisy environment. In this approach, MAP-based noise detection is first applied to identify the noise segment in an utterance. Subtractive-type enhancement algorithm with masking properties of the human auditory system is then used to reduce the noise effect. Finally, MAP-based incremental noise model adaptation is developed to overcome the model inconsistencies between training and testing environments. For performance evaluation of the proposed approach, a Mandarin keyword recognition system was constructed. The experimental results show that the proposed approach achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods.
Original language | English |
---|---|
Pages (from-to) | 999-1013 |
Number of pages | 15 |
Journal | Journal of Information Science and Engineering |
Volume | 22 |
Issue number | 5 |
Publication status | Published - 2006 Sept |
Externally published | Yes |
Keywords
- Audible noise suppression
- Incremental model adaptation
- MAP-based perceptual modeling
- Noise detection
- Noisy speech recognition
- Speech enhancement
ASJC Scopus subject areas
- Software
- Human-Computer Interaction
- Hardware and Architecture
- Library and Information Sciences
- Computational Theory and Mathematics