跳至主導覽 跳至搜尋 跳過主要內容

PG-MDD: Prompt-Guided Mispronunciation Detection and Diagnosis Leveraging Articulatory Features

  • Meng Shin Lin*
  • , Bi Cheng Yan
  • , Tien Hong Lo
  • , Hsin Wei Wang
  • , Yue Yang He
  • , Wei Cheng Chao
  • , Berlin Chen
  • *此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

摘要

Mispronunciation detection and diagnosis (MDD) manages to pinpoint phonetic errors of L2 (second-language) learners and then provides timely and informative diagnosis on erroneous pronunciation segments. Recently, dictation-based neural methods have emerged as an appealing modeling paradigm for MDD, which simultaneously identifies pronunciation errors and provides diagnostic feedback by aligning the recognized phone sequence to the corresponding canonical phone sequence of a given text prompt. Despite their decent performance in terms of F1-score, dictation-based models still struggle to accurately detect pronunciation errors with balanced precision and recall evaluations, resulting in inferior learning efficiency for L2 learners. In view of this, we propose a novel prompt-guided dictation-based MDD model, dubbed PG-MDD, that can efficiently strike a balance the precision and recall rates while maintaining a high-performing F1-score. PG-MDD first jointly optimizes the mispronunciation detection and diagnosis processes during the training phase, while aptly guiding the diagnosis process with phone-dependent thresholds in the inference phase. In addition, a novel multi-view audio encoder is introduced to render the fine-grained articulatory cues within learners' speech. A comprehensive set of empirical experiments conducted on the L2-ARCTIC benchmark dataset suggests the practical feasibility of our method in relation to several competitive baselines.

原文英語
主出版物標題APSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798350367331
DOIs
出版狀態已發佈 - 2024
事件2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024 - Macau, 中国
持續時間: 2024 12月 32024 12月 6

出版系列

名字APSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024

會議

會議2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024
國家/地區中国
城市Macau
期間2024/12/032024/12/06

ASJC Scopus subject areas

  • 人工智慧
  • 電腦科學應用
  • 硬體和架構
  • 訊號處理

指紋

深入研究「PG-MDD: Prompt-Guided Mispronunciation Detection and Diagnosis Leveraging Articulatory Features」主題。共同形成了獨特的指紋。

引用此