Exploring the Integration of E2E ASR and Pronunciation Modeling for English Mispronunciation Detection

Hsin Wei Wang, Bi Cheng Yan, Yung Chang Hsu, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

There has been increasing demand to develop effective computer-assisted language training (CAPT) systems, which can provide feedback on mispronunciations and facilitate second-language (L2) learners to improve their speaking proficiency through repeated practice. Due to the shortage of non-native speech for training the automatic speech recognition (ASR) module of a CAPT system, the corresponding mispronunciation detection performance is often affected by imperfect ASR. Recognizing this importance, we in this paper put forward a two-stage mispronunciation detection method. In the first stage, the speech uttered by an L2 learner is processed by an end-to-end ASR module to produce N-best phone sequence hypotheses. In the second stage, these hypotheses are fed into a pronunciation model which seeks to faithfully predict the phone sequence hypothesis that is most likely pronounced by the learner, so as to improve the performance of mispronunciation detection. Empirical experiments conducted a English benchmark dataset seem to confirm the utility of our method.

原文英語
主出版物標題ROCLING 2021 - Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing
編輯Lung-Hao Lee, Chia-Hui Chang, Kuan-Yu Chen
發行者The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
頁面124-131
頁數8
ISBN(電子)9789869576949
出版狀態已發佈 - 2021
事件33rd Conference on Computational Linguistics and Speech Processing, ROCLING 2021 - Taoyuan, 臺灣
持續時間: 2021 10月 152021 10月 16

出版系列

名字ROCLING 2021 - Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing

會議

會議33rd Conference on Computational Linguistics and Speech Processing, ROCLING 2021
國家/地區臺灣
城市Taoyuan
期間2021/10/152021/10/16

ASJC Scopus subject areas

  • 語言與語言學
  • 語言和語言學
  • 言語和聽力

指紋

深入研究「Exploring the Integration of E2E ASR and Pronunciation Modeling for English Mispronunciation Detection」主題。共同形成了獨特的指紋。

引用此