Peppanet: Effective Mispronunciation Detection and Diagnosis Leveraging Phonetic, Phonological, and Acoustic Cues

Bi Cheng Yan, Hsin Wei Wang, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

6 引文 斯高帕斯(Scopus)

摘要

Mispronunciation detection and diagnosis (MDD) aims to detect erroneous pronunciation segments in an L2 learner's articulation and subsequently provide informative diagnostic feedback. Most existing neural methods follow a dictation-based modeling paradigm that finds out pronunciation errors and returns diagnostic feedback at the same time by aligning the recognized phone sequence uttered by an L2 learner to the corresponding canonical phone sequence of a given text prompt. However, the main downside of these methods is that the dictation process and alignment process are mostly made independent of each other. In view of this, we present a novel end-to-end neural method, dubbed PeppaNet, building on a unified structure that can jointly model the dictation process and the alignment process. The model of our method learns to directly predict the pronunciation correctness of each canonical phone of the text prompt and in turn provides its corresponding diagnostic feedback. In contrast to the conventional dictation-based methods that rely mainly on a free-phone recognition process, PeppaNet makes good use of an effective selective gating mechanism to simultaneously incorporate phonetic, phonological and acoustic cues to generate corrections that are more proper and phonetically related to the canonical pronunciations. Extensive sets of experiments conducted on the L2-ARCTIC benchmark dataset seem to show the merits of our proposed method in comparison to some recent top-of-the-line methods.

原文英語
主出版物標題2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1045-1051
頁數7
ISBN(電子)9798350396904
DOIs
出版狀態已發佈 - 2023
事件2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Doha, 卡塔尔
持續時間: 2023 1月 92023 1月 12

出版系列

名字2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings

會議

會議2022 IEEE Spoken Language Technology Workshop, SLT 2022
國家/地區卡塔尔
城市Doha
期間2023/01/092023/01/12

ASJC Scopus subject areas

  • 電腦視覺和模式識別
  • 硬體和架構
  • 媒體技術
  • 儀器
  • 語言和語言學

指紋

深入研究「Peppanet: Effective Mispronunciation Detection and Diagnosis Leveraging Phonetic, Phonological, and Acoustic Cues」主題。共同形成了獨特的指紋。

引用此