An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling

Bi Cheng Yan, Meng Che Wu, Hsiao Tsung Hung, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

30 引文 斯高帕斯(Scopus)

摘要

Mispronunciation detection and diagnosis (MDD) is a core component of computer-assisted pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with categorical errors (viz. one canonical phone is substituted by another one, aside from those mispronunciations caused by deletions or insertions). However, accurate detection and diagnosis of non-categorial or distortion errors (viz. approximating L2 phones with L1 (first-language) phones, or erroneous pronunciations in between) still seems out of reach. In view of this, we propose to conduct MDD with a novel end-to-end automatic speech recognition (E2E-based ASR) approach. In particular, we expand the original L2 phone set with their corresponding anti-phone set, making the E2E-based MDD approach have a better capability to take in both categorical and non-categorial mispronunciations, aiming to provide better mispronunciation detection and diagnosis feedback. Furthermore, a novel transfer-learning paradigm is devised to obtain the initial model estimate of the E2E-based MDD system without resource to any phonological rules. Extensive sets of experimental results on the L2-ARCTIC dataset show that our best system can outperform the existing E2E baseline system and pronunciation scoring based method (GOP) in terms of the F1-score, by 11.05% and 27.71%, respectively.

原文英語
主出版物標題Interspeech 2020
發行者International Speech Communication Association
頁面3032-3036
頁數5
ISBN(列印)9781713820697
DOIs
出版狀態已發佈 - 2020
事件21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 - Shanghai, 中国
持續時間: 2020 10月 252020 10月 29

出版系列

名字Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
2020-October
ISSN(列印)2308-457X
ISSN(電子)1990-9772

會議

會議21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
國家/地區中国
城市Shanghai
期間2020/10/252020/10/29

ASJC Scopus subject areas

  • 語言與語言學
  • 人機介面
  • 訊號處理
  • 軟體
  • 建模與模擬

指紋

深入研究「An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling」主題。共同形成了獨特的指紋。

引用此