Abstract
This study presents a maximum a posteriori (MAP) based perceptual modeling approach to deal with the issue of recognition degradation in noisy environment. In this approach, MAP-based noise detection is first applied to identify the noise segment in an utterance. Subtractive-type enhancement algorithm with masking properties of the human auditory system is then used to reduce the noise effect. Finally, MAP-based incremental noise model adaptation is developed to overcome the model inconsistencies between training and testing environments. For performance evaluation of the proposed approach, a Mandarin keyword recognition system was constructed. The experimental results show that the proposed approach achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods.
| Original language | English |
|---|---|
| Pages (from-to) | 999-1013 |
| Number of pages | 15 |
| Journal | Journal of Information Science and Engineering |
| Volume | 22 |
| Issue number | 5 |
| Publication status | Published - 2006 Sept |
| Externally published | Yes |
Keywords
- Audible noise suppression
- Incremental model adaptation
- MAP-based perceptual modeling
- Noise detection
- Noisy speech recognition
- Speech enhancement
ASJC Scopus subject areas
- Software
- Human-Computer Interaction
- Hardware and Architecture
- Library and Information Sciences
- Computational Theory and Mathematics