融合多種深層類神經網路聲學模型與分類技術於華語錯誤發音檢測之研究

Yao Chi Hsu, Ming Han Yang, Hsiao Tsung Hung, Yuwen Hsiung, Yao Ting Sung, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

Automatic mispronunciation detection plays a crucial role in a computer assisted pronunciation training (CAPT) system. The main purpose of mispronunciation detection is to judge whether the pronunciations of a non-native speaker are correct or not. In general, the process of mispronunciation detection can be divided into two parts: 1) a front-end feature extraction module that generates pronunciation detection features based on an input speech segment and its associated reference acoustic models; and 2) a back-end classification module that determines the correctness of the pronunciation of the speech segment according to the output of a classifier that takes the pronunciation detection features of the segment as the input. The main contributions of this work are three-fold. First, we investigate the use of two state-of-the-art acoustic models, respectively based on deep neural networks (DNN) and convolutional neural networks (CNN), and compare their effectiveness for the extraction of discriminative pronunciation detection features. Second, we experiment with different types of classification methods and propose a novel integration of DNN- and CNN-based decision scores at the back-end. Third, we provide an extensive set of empirical evaluations on the aforementioned two modules and associated methods based on a recently compiled corpus for learning Mandarin Chinese as the second language. The experimental results reveal the performance utility of our approach in relation to several existing baselines.

貢獻的翻譯標題Exploring combinations of various deep neural network based acoustic models and classification techniques for Mandarin mispronunciation detection
原文繁體中文
主出版物標題Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015
編輯Sin-Horng Chen, Hsin-Min Wang, Jen-Tzung Chien, Hung-Yu Kao, Wen-Whei Chang, Yih-Ru Wang, Shih-Hung Wu
發行者The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
頁面103-120
頁數18
ISBN(電子)9789573079286
出版狀態已發佈 - 2015 十月 1
事件27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015 - Hsinchu, 臺灣
持續時間: 2015 十月 12015 十月 2

出版系列

名字Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015

會議

會議27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015
國家/地區臺灣
城市Hsinchu
期間2015/10/012015/10/02

Keywords

  • Automatic Speech Recognition
  • Convolutional Neural Networks
  • Deep Neural Networks
  • Mispronunciation detection

ASJC Scopus subject areas

  • 言語和聽力
  • 語言與語言學

指紋

深入研究「融合多種深層類神經網路聲學模型與分類技術於華語錯誤發音檢測之研究」主題。共同形成了獨特的指紋。

引用此