TY - GEN
T1 - Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition
AU - Hsieh, Hsin Ju
AU - Chen, Berlin
AU - Hung, Jeih Weih
N1 - Publisher Copyright:
© 2014 Asia-Pacific Signal and Information Processing Ass.
PY - 2014/2/12
Y1 - 2014/2/12
N2 - This paper proposes to enhance the complex-valued acoustic spectrograms of speech signals via the technique of histogram equalization (HEQ) to produce noise-robust features for recognition. The presented method extends our previous work in the task of spectrogram enhancement and has two significant aspects. First, we process the real and imaginary parts of acoustic spectrograms separately, and therefore both of the corresponding magnitude and phase components can be enhanced implicitly. Second, we apply FIR filters to the intra-frame acoustic spectra to acquire the respective local structural statistics, which are subsequently employed to perform various types of HEQ on the acoustic spectrograms for robustifying the resulting speech features. All experiments were carried out on the Aurora-2 database and task. The performance of the presented methods was thoroughly tested and verified by comparisons with other well-known robustness methods, which reveals the capability of our methods in promoting the noise robustness of speech features.
AB - This paper proposes to enhance the complex-valued acoustic spectrograms of speech signals via the technique of histogram equalization (HEQ) to produce noise-robust features for recognition. The presented method extends our previous work in the task of spectrogram enhancement and has two significant aspects. First, we process the real and imaginary parts of acoustic spectrograms separately, and therefore both of the corresponding magnitude and phase components can be enhanced implicitly. Second, we apply FIR filters to the intra-frame acoustic spectra to acquire the respective local structural statistics, which are subsequently employed to perform various types of HEQ on the acoustic spectrograms for robustifying the resulting speech features. All experiments were carried out on the Aurora-2 database and task. The performance of the presented methods was thoroughly tested and verified by comparisons with other well-known robustness methods, which reveals the capability of our methods in promoting the noise robustness of speech features.
UR - http://www.scopus.com/inward/record.url?scp=84949926199&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84949926199&partnerID=8YFLogxK
U2 - 10.1109/APSIPA.2014.7041568
DO - 10.1109/APSIPA.2014.7041568
M3 - Conference contribution
AN - SCOPUS:84949926199
T3 - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
BT - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
Y2 - 9 December 2014 through 12 December 2014
ER -