Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition

Hsin Ju Hsieh, Berlin Chen, Jeih Weih Hung

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper proposes to enhance the complex-valued acoustic spectrograms of speech signals via the technique of histogram equalization (HEQ) to produce noise-robust features for recognition. The presented method extends our previous work in the task of spectrogram enhancement and has two significant aspects. First, we process the real and imaginary parts of acoustic spectrograms separately, and therefore both of the corresponding magnitude and phase components can be enhanced implicitly. Second, we apply FIR filters to the intra-frame acoustic spectra to acquire the respective local structural statistics, which are subsequently employed to perform various types of HEQ on the acoustic spectrograms for robustifying the resulting speech features. All experiments were carried out on the Aurora-2 database and task. The performance of the presented methods was thoroughly tested and verified by comparisons with other well-known robustness methods, which reveals the capability of our methods in promoting the noise robustness of speech features.

Original languageEnglish
Title of host publication2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9786163618238
DOIs
Publication statusPublished - 2014 Feb 12
Event2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 - Chiang Mai, Thailand
Duration: 2014 Dec 92014 Dec 12

Publication series

Name2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

Other

Other2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
Country/TerritoryThailand
CityChiang Mai
Period2014/12/092014/12/12

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems

Fingerprint

Dive into the research topics of 'Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition'. Together they form a unique fingerprint.

Cite this