改善多細粒度的發音評測上資料不平衡的問題

Meng Shin Lin, Hsin Wei Wang, Tien Hong Lo, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

Automatic Pronunciation Assessment (APA) aims to quantify non-native (L2) learners' pronunciation proficiency in a specific language. With technological advancements, APA now evaluates various aspects of pronunciation, from phoneme level to sentence level, including accuracy, fluency, stress, and more. However, current APA methods rely on the Mean Squared Error (MSE) loss function, which struggles with imbalanced labels across different levels of granularity. This imbalance affects model generalizability and fairness, as MSE tends to underestimate rare labels. Despite these issues, existing research has not adequately addressed data imbalance. To address this gap, we draw inspiration from class-balanced loss functions in visual classification. Our approach involves resampling and introducing a trainable variable to narrow the gap between training and testing sets in imbalanced regression tasks, aiming to alleviate label imbalance effects in APA. Evaluating our method on the Speechocean762 dataset, known for significant word-level label imbalance, we observe remarkable enhancements in performance. Our proposed approach shows promise in tackling challenges stemming from imbalanced data in automatic pronunciation assessment.

貢獻的翻譯標題Addressing the issue of Data Imbalance in Multi-granularity Pronunciation Assessment
原文繁體中文
主出版物標題ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing
編輯Jheng-Long Wu, Ming-Hsiang Su, Hen-Hsen Huang, Yu Tsao, Hou-Chiang Tseng, Chia-Hui Chang, Lung-Hao Lee, Yuan-Fu Liao, Wei-Yun Ma
發行者The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
頁面134-140
頁數7
ISBN(電子)9789869576963
出版狀態已發佈 - 2023
事件35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023 - Taipei City, 臺灣
持續時間: 2023 10月 202023 10月 21

出版系列

名字ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing

會議

會議35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023
國家/地區臺灣
城市Taipei City
期間2023/10/202023/10/21

Keywords

  • Automatic Pronunciation Assessment
  • data imbalanced
  • regression loss function

ASJC Scopus subject areas

  • 語言與語言學
  • 言語和聽力

指紋

深入研究「改善多細粒度的發音評測上資料不平衡的問題」主題。共同形成了獨特的指紋。

引用此