TY - GEN
T1 - Investigating Manifold Learning Technique for Robust Speech Recognition
AU - Yan, Bi Cheng
AU - Shih, Chin Hong
AU - Chen, Berlin
AU - Liu, Shih Hung
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - Developing robustness methods is imperative to retaining good performance for automatic speech recognition (ASR)systems when being confronted with different environmental noise or channel distortion. Previous studies have pointed out that exploration of low-dimensional structures of speech features is beneficial to generating robust features so as to enhance ASR performance. Along this research direction, we argue that the intrinsic structures of speech features lying on a manifold subspace of low dimensionality residing in their original ambient space of high dimensionality. This way, noise components can be ruled out by projecting noisy speech features into the pre-learned subspace of manifold structures. This paper explores the intrinsic geometric low-dimensional manifold structures inherent speech features' modulation spectra, with the goal to generate speech features that are more robust to environmental noise and channel distortion. The key novelty of our work is two-fold: 1)we put forward an innovative use of the graph-regularization based method to generate robust speech features by preserving the inherent manifold structures of modulation spectra and excluding irrelevant ones, and 2)we also compare our approach with several mainstream methods that also explores low-dimensional structures of data instances with in-depth analysis. A comprehensive set of empirical experiments carried out on an ASR benchmark task seem to reveal the superior performance of our proposed methods.
AB - Developing robustness methods is imperative to retaining good performance for automatic speech recognition (ASR)systems when being confronted with different environmental noise or channel distortion. Previous studies have pointed out that exploration of low-dimensional structures of speech features is beneficial to generating robust features so as to enhance ASR performance. Along this research direction, we argue that the intrinsic structures of speech features lying on a manifold subspace of low dimensionality residing in their original ambient space of high dimensionality. This way, noise components can be ruled out by projecting noisy speech features into the pre-learned subspace of manifold structures. This paper explores the intrinsic geometric low-dimensional manifold structures inherent speech features' modulation spectra, with the goal to generate speech features that are more robust to environmental noise and channel distortion. The key novelty of our work is two-fold: 1)we put forward an innovative use of the graph-regularization based method to generate robust speech features by preserving the inherent manifold structures of modulation spectra and excluding irrelevant ones, and 2)we also compare our approach with several mainstream methods that also explores low-dimensional structures of data instances with in-depth analysis. A comprehensive set of empirical experiments carried out on an ASR benchmark task seem to reveal the superior performance of our proposed methods.
KW - automatic speech recognition
KW - low-dimensional structures
KW - manifold learning
KW - robustness
UR - http://www.scopus.com/inward/record.url?scp=85062784456&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85062784456&partnerID=8YFLogxK
U2 - 10.1109/IALP.2018.8629269
DO - 10.1109/IALP.2018.8629269
M3 - Conference contribution
AN - SCOPUS:85062784456
T3 - Proceedings of the 2018 International Conference on Asian Language Processing, IALP 2018
SP - 68
EP - 73
BT - Proceedings of the 2018 International Conference on Asian Language Processing, IALP 2018
A2 - Dong, Minghui
A2 - Bijaksana, Moch.
A2 - Sujaini, Herry
A2 - Negara, Arif Bijaksana Putra
A2 - Romadhony, Ade
A2 - Ruskanda, Fariska Z.
A2 - Nurfadhilah, Elvira
A2 - Aini, Lyla Ruslana
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 22nd International Conference on Asian Language Processing, IALP 2018
Y2 - 15 November 2018 through 17 November 2018
ER -