A Preliminary Study on Environmental Sound Classification Leveraging Large-Scale Pretrained Model and Semi-Supervised Learning

You Sheng Tsao, Tien Hong Lo, Jiun Ting Li, Shi Yan Weng, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

With the widespread commercialization of smart devices, research on environmental sound classification has gained more and more attention in recent years. In this paper, we set out to make effective use of large-scale audio pretrained model and semi-supervised model training paradigm for environmental sound classification. To this end, an environmental sound classification method is first put forward, whose component model is built on top a large-scale audio pretrained model. Further, to simulate a low-resource sound classification setting where only limited supervised examples are made available, we instantiate the notion of transfer learning with a recently proposed training algorithm (namely, FixMatch) and a data augmentation method (namely, SpecAugment) to achieve the goal of semi-supervised model training. Experiments conducted on benchmark dataset UrbanSound8K reveal that our classification method can lead to an accuracy improvement of 2.4% in relation to a current baseline method.

原文英語
主出版物標題ROCLING 2021 - Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing
編輯Lung-Hao Lee, Chia-Hui Chang, Kuan-Yu Chen
發行者The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
頁面103-110
頁數8
ISBN(電子)9789869576949
出版狀態已發佈 - 2021
事件33rd Conference on Computational Linguistics and Speech Processing, ROCLING 2021 - Taoyuan, 臺灣
持續時間: 2021 10月 152021 10月 16

出版系列

名字ROCLING 2021 - Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing

會議

會議33rd Conference on Computational Linguistics and Speech Processing, ROCLING 2021
國家/地區臺灣
城市Taoyuan
期間2021/10/152021/10/16

ASJC Scopus subject areas

  • 語言與語言學
  • 語言和語言學
  • 言語和聽力

指紋

深入研究「A Preliminary Study on Environmental Sound Classification Leveraging Large-Scale Pretrained Model and Semi-Supervised Learning」主題。共同形成了獨特的指紋。

引用此