用於語音增強之偽影感知加權損失函數

En Lun Yu, Kuan Hsun Ho, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

The Speech Enhancement (SE) system not only enhances the perceptual quality of speech but also make the ASR performance robust in noisy enviornments when integrating with ASR systems. However, single-channel SE may generate detrimental artifacts to ASR recognition, leading to recognition errors. Recent research indicates that by introducing the novel SE loss function NAaLoss and fine-tuning the model, the generation of artifacts can be effectively reduced. Nonetheless, this approach still needs to be revised in its underlying assumptions. Therefore, we extensively analyze this method in this study and conduct numerous experiments and case studies to identify the inconsistencies. To address this, we propose an improved loss function, AaWLoss. AaWLoss successfully resolves the potential loss of noise-condition artifact suppression inherent in NAaLoss under the same settings through modifications and optimizations. Furthermore, AaWLoss achieves peak performance in suppressing artifacts under clean conditions, even adding information beneficial for ASR recognition to the enhanced clean speech.

貢獻的翻譯標題AaWLoss: An Artifact-aware Weighted Loss Function for Speech Enhancement
原文繁體中文
主出版物標題ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing
編輯Jheng-Long Wu, Ming-Hsiang Su, Hen-Hsen Huang, Yu Tsao, Hou-Chiang Tseng, Chia-Hui Chang, Lung-Hao Lee, Yuan-Fu Liao, Wei-Yun Ma
發行者The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
頁面71-78
頁數8
ISBN(電子)9789869576963
出版狀態已發佈 - 2023
事件35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023 - Taipei City, 臺灣
持續時間: 2023 10月 202023 10月 21

出版系列

名字ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing

會議

會議35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023
國家/地區臺灣
城市Taipei City
期間2023/10/202023/10/21

Keywords

  • noise-robust speech Recognition
  • processing artifacts
  • single-channel speech enhancement

ASJC Scopus subject areas

  • 語言與語言學
  • 言語和聽力

指紋

深入研究「用於語音增強之偽影感知加權損失函數」主題。共同形成了獨特的指紋。

引用此