跳至主導覽 跳至搜尋 跳過主要內容

Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies

研究成果: 雜誌貢獻會議論文同行評審

1   !!Link opens in a new tab 引文 斯高帕斯(Scopus)

摘要

Automatic Speech Assessment (ASA) has seen notable advancements with the utilization of self-supervised features (SSL) in recent research.However, a key challenge in ASA lies in the imbalanced distribution of data, particularly evident in English test datasets.To address this challenge, we approach ASA as an ordinal classification task, introducing Weighted Vectors Ranking Similarity (W-RankSim) as a novel regularization technique.W-RankSim encourages closer proximity of weighted vectors in the output layer for similar classes, implying that feature vectors with similar labels would be gradually nudged closer to each other as they converge towards corresponding weighted vectors.Extensive experimental evaluations confirm the effectiveness of our approach in improving ordinal classification performance for ASA.Furthermore, we propose a hybrid model that combines SSL and handcrafted features, showcasing how the inclusion of handcrafted features enhances performance in an ASA system.

原文英語
頁(從 - 到)4004-4008
頁數5
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
DOIs
出版狀態已發佈 - 2024
事件25th Interspeech Conferece 2024 - Kos Island, 希腊
持續時間: 2024 9月 12024 9月 5

ASJC Scopus subject areas

  • 語言與語言學
  • 人機介面
  • 訊號處理
  • 軟體
  • 建模與模擬

指紋

深入研究「Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies」主題。共同形成了獨特的指紋。

引用此