跳至主導覽 跳至搜尋 跳過主要內容

ConPCO: Preserving Phoneme Characteristics For Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization

  • Bi Cheng Yan
  • , Yi Cheng Wang
  • , Jiun Ting Li
  • , Meng Shin Lin
  • , Hsin Wei Wang
  • , Wei Cheng Chao
  • , Berlin Chen*
  • *此作品的通信作者

研究成果: 雜誌貢獻會議論文同行評審

2   連結會在新分頁中打開 引文 斯高帕斯(Scopus)

摘要

Automatic pronunciation assessment (APA) manages to evaluate the pronunciation proficiency of a second language (L2) learner in a target language. Existing efforts typically draw on regression models for proficiency score prediction, wherein the models are trained to estimate target values without explicitly accounting for phoneme-awareness in the feature space. In this paper, we propose a contrastive phonemic ordinal regularizer (ConPCO) tailored for regression-based APA models to generate more phoneme-discriminative features while factoring in the ordinal relationships among the regression targets. The proposed ConPCO first aligns the phoneme representations of an APA model and textual embeddings of phonetic transcriptions via contrastive learning. Afterward, the phoneme characteristics are retained by regulating the distances between inter- and intra-phoneme categories in the feature space while allowing for the ordinal relationships among the output targets. We further design and develop a hierarchical APA model to evaluate the effectiveness of our regularizer. A series of experiments conducted on the speechocean762 benchmark dataset suggests the feasibility and effectiveness of our approach in relation to several competitive baselines.

ASJC Scopus subject areas

  • 軟體
  • 訊號處理
  • 電氣與電子工程

指紋

深入研究「ConPCO: Preserving Phoneme Characteristics For Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization」主題。共同形成了獨特的指紋。

引用此