The NTNU Super Monster Team (SPMT) system for the Formosa Speech Recognition Challenge 2023 - Hakka ASR

Tzu Ting Yang, Hsin Wei Wang, Meng Ting Tsai, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

This paper aims to record the progress of the NTNU Super Monster Team (SMPT) in the Formosa Speech Recognition Challenge 2023 (FSR-2023), which is the third event of the Formosa Speech in the Wild (FSW) project. The primary task was to recognize Hakka speech using a corpus of Hakka speakers in Taiwan. We present our participation results in Track 1: Taiwanese Hakka recommended characters speech recognition. Recently, the percentage of Hakka speakers in Taiwan is only about 5.5 percent of the total population, and is still decreasing year by year, which causes resistance in acquiring the corpus; due to the strong ethnic identity of the Hakka cultural group, it has a strong linguistic independence and exclusivity. In summary, the scarcity of Hakka pairedcorpus and the difficulty of learning other dialects for mutual benefit have undoubtedly aggravated the difficulty of the FSR-2023. In this study, we try to investigate the interleaving effects of various components by integrating data augmentation, self-supervised learning features, large-scale speech recognition models, and language models to improve the performance of Hakka speech recognition. This article aims to explore the impact of various modules on Hakka speech recognition performance and has ultimately achieved fruitful results. We hoped that this effort can contribute to the preservation of endangered languages in our country.

原文英語
主出版物標題ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing
編輯Jheng-Long Wu, Ming-Hsiang Su, Hen-Hsen Huang, Yu Tsao, Hou-Chiang Tseng, Chia-Hui Chang, Lung-Hao Lee, Yuan-Fu Liao, Wei-Yun Ma
發行者The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
頁面413-421
頁數9
ISBN(電子)9789869576963
出版狀態已發佈 - 2023
事件35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023 - Taipei City, 臺灣
持續時間: 2023 10月 202023 10月 21

出版系列

名字ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing

會議

會議35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023
國家/地區臺灣
城市Taipei City
期間2023/10/202023/10/21

ASJC Scopus subject areas

  • 語言與語言學
  • 言語和聽力

指紋

深入研究「The NTNU Super Monster Team (SPMT) system for the Formosa Speech Recognition Challenge 2023 - Hakka ASR」主題。共同形成了獨特的指紋。

引用此