Few-Shot Open-Set Keyword Spotting with Multi-Stage Training

Lo Ya Li*, Tien Hong Lo, Jeih Weih Hung, Shih Chieh Huang, Berlin Chen

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

摘要

As the advance of human-computer interaction technologies continued, keyword spotting (KWS) systems have gained prominence in everyday devices. This study is dedicated to exploring innovative approaches for few-shot keyword recognition under open-set conditions, a challenging yet crucial area in speech processing. To this end, we design and develop a multi-stage training method that synergistically combines the advantages of acoustic and phonetic features, thereby substantially enhancing the ability of a KWS model. By learning multi-type features with joint training from only one dataset, our KWS model is equipped with a more robustness feature extractor to deal with few-shot KWS. Experimental results demonstrate that our model outperforms strong baselines by achieving a 15% improvement in recognition accuracy on open-set tests in a 10shot-10way setting. This research confirms the effectiveness of our multi-stage strategy and suggests promising directions for future development in keyword recognition technologies.

原文英語
主出版物標題APSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798350367331
DOIs
出版狀態已發佈 - 2024
事件2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024 - Macau, 中国
持續時間: 2024 12月 32024 12月 6

出版系列

名字APSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024

會議

會議2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024
國家/地區中国
城市Macau
期間2024/12/032024/12/06

ASJC Scopus subject areas

  • 人工智慧
  • 電腦科學應用
  • 硬體和架構
  • 訊號處理

指紋

深入研究「Few-Shot Open-Set Keyword Spotting with Multi-Stage Training」主題。共同形成了獨特的指紋。

引用此