Mandarin Chinese Mispronunciation Detection and Diagnosis Leveraging Deep Neural Network Based Acoustic Modeling and Training Techniques

Berlin Chen*, Yao Chi Hsu

*此作品的通信作者

研究成果: 書貢獻/報告類型篇章

5 引文 斯高帕斯(Scopus)

摘要

Automatic mispronunciation detection and diagnosis are two critical and integral components of a computer-assisted pronunciation training (CAPT) system, collectively facilitating second-language (L2) learners to pinpoint erroneous pronunciations in a given utterance so as to improve their spoken proficiency. In this chapter, we will first briefly introduce the latest trends and developments in mispronunciation detection and diagnosis with state-of-the-art automatic speech recognition (ASR) methodologies, especially those using deep neural network based acoustic models. Afterward, we present an effective training approach that estimates the deep neural network based acoustic models involved in the mispronunciation detection process by optimizing an objective directly linked to the ultimate performance evaluation metric. We also investigate the extent to which the subsequent mispronunciation diagnosis process can benefit from the use of these specifically trained acoustic models. For this purpose, we recast mispronunciation diagnosis as a classification problem and a set of indicative features are derived. A series of experiments on a Mandarin Chinese mispronunciation detection and diagnosis task are conducted to evaluate the performance merits of such an approach.

原文英語
主出版物標題Chinese Language Learning Sciences
發行者Springer Nature
頁面217-234
頁數18
DOIs
出版狀態已發佈 - 2019

出版系列

名字Chinese Language Learning Sciences
ISSN(列印)2520-1719
ISSN(電子)2520-1727

ASJC Scopus subject areas

  • 語言與語言學
  • 教育
  • 語言和語言學
  • 電腦科學應用

指紋

深入研究「Mandarin Chinese Mispronunciation Detection and Diagnosis Leveraging Deep Neural Network Based Acoustic Modeling and Training Techniques」主題。共同形成了獨特的指紋。

引用此