Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition

Shih Hsiang Lin*, Yao Ming Yeh, Berlin Chen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Noise robustness is one of the primary challenges facing most automatic speech recognition (ASR) systems. A vast amount of research efforts on preventing the degradation of ASR performance under various noisy environments have been made during the past several years. In this paper, we consider the use of histogram equalization (HEQ) for robust ASR. In contrast to conventional methods, a novel data fitting method based on polynomial regression was presented to efficiently approximate the inverse of the cumulative density functions of speech feature vectors for HEQ. Moreover, a more elaborate attempt of using such polynomial regression models to directly characterizing the relationship between the speech feature vectors and their corresponding probability distributions, under various noise conditions, was proposed as well. All experiments were carried out on the Aurora-2 database and task. The performance of the presented methods were extensively tested and verified by comparison with the other methods. Experimental results shown that for cleancondition training, our method achieved a considerable word error rate reduction over the baseline system, and also significantly outperformed the other methods.

Original languageEnglish
Title of host publicationInternational Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007
PublisherUnavailable
Pages197-200
Number of pages4
ISBN (Print)9781605603162
Publication statusPublished - 2007
Event8th Annual Conference of the International Speech Communication Association, Interspeech 2007 - Antwerp, Belgium
Duration: 2007 Aug 272007 Aug 31

Publication series

NameInternational Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007
Volume1
ISSN (Electronic)1990-9772

Other

Other8th Annual Conference of the International Speech Communication Association, Interspeech 2007
Country/TerritoryBelgium
CityAntwerp
Period2007/08/272007/08/31

Keywords

  • Histogram equalization
  • Noise robustness
  • Polynomial regression model
  • Speech recognition

ASJC Scopus subject areas

  • Software
  • General Energy
  • Communication
  • Energy Engineering and Power Technology
  • Management, Monitoring, Policy and Law
  • Fuel Technology
  • Computer Science Applications
  • Renewable Energy, Sustainability and the Environment
  • Modelling and Simulation
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition'. Together they form a unique fingerprint.

Cite this