Improved histogram equalzaiton (HEQ) for robust speech recogntion

Shih Hsiang Lin, Hung Bin Chen, Yao Ming Yeh, Berlin Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

With the rapid development of Intelligent Transportation Systems (ITS), how to provide users with a natural and efficient human-machine interface is now becoming a crucial issue for driver safety. It is no doubt that speech will be one of the best mediators of human-machine interaction; however, the performance of automatic speech recognition (ASR) always radically degrades when the input speech is corrupted by varying noises. In this paper, we consider the use of histogram equalization (HEQ) for robust ASR. A novel data fitting scheme was presented to efficiently approximate the inverse of the cumulative density function of training speech for HEQ, which has the merits of lower storage and time consumption compared to the conventional table-lookup or quantile based HEQ approaches. Moreover, a more elaborate attempt of using multiple inverse functions for different noise conditions was investigated as well. All experiments were carried out on the Aurora-2 standard database and task. Very encouraging results were obtained. The proposed robustness technique has also been properly integrated into our prototype system for in-vehicle traffic information retrieval using spoken queries.

Original languageEnglish
Title of host publicationProceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007
Pages2234-2237
Number of pages4
Publication statusPublished - 2007 Dec 1
EventIEEE International Conference onMultimedia and Expo, ICME 2007 - Beijing, China
Duration: 2007 Jul 22007 Jul 5

Publication series

NameProceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007

Other

OtherIEEE International Conference onMultimedia and Expo, ICME 2007
CountryChina
CityBeijing
Period07/7/207/7/5

    Fingerprint

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Software

Cite this

Lin, S. H., Chen, H. B., Yeh, Y. M., & Chen, B. (2007). Improved histogram equalzaiton (HEQ) for robust speech recogntion. In Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007 (pp. 2234-2237). [4285130] (Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007).