TENET: A Time-Reversal Enhancement Network for Noise-Robust ASR

Fu An Chao, Shao Wei Fan Jiang, Bi Cheng Yan, Jeih Weih Hung, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

4 引文 斯高帕斯(Scopus)

摘要

Due to the unprecedented breakthroughs brought about by deep learning, speech enhancement (SE) techniques have been developed rapidly and play an important role prior to acoustic modeling so as to mitigate noise effects on speech. To increase the perceptual quality of speech, the current state-of-the-art in the realm of SE adopts adversarial training by connecting an objective metric to the discriminator. However, there is no guarantee that optimizing the perceptual quality of speech will necessarily lead to improved automatic speech recognition (ASR) performance. In this study, we present TENET††Inspired by the movie - TENET, Christopher Nolan, 2020., ∗∗Some of the enhanced audio samples can be found from https://fuann.github.io/TENET., a novel Time-reversal Enhancement NETwork, which leverages the transformation of an input noisy signal itself, i.e., the time-reversed version, in conjunction with a Siamese network and a complex dual-path Transformer to promote SE performance for noise-robust ASR. Extensive experiments conducted on the Voicebank-DEMAND dataset show that TENET can achieve stellar results compared to a few top-of-the-line methods in terms of both SE and ASR evaluation metrics. To demonstrate the model generalization ability, we further evaluate TENET on the test set of scenarios contaminated with unseen noise, and the results also confirm the superiority of this promising method.

原文英語
主出版物標題2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面55-61
頁數7
ISBN(電子)9781665437394
DOIs
出版狀態已發佈 - 2021
事件2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Cartagena, 哥伦比亚
持續時間: 2021 12月 132021 12月 17

出版系列

名字2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Proceedings

會議

會議2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021
國家/地區哥伦比亚
城市Cartagena
期間2021/12/132021/12/17

ASJC Scopus subject areas

  • 電腦視覺和模式識別
  • 訊號處理
  • 語言和語言學

指紋

深入研究「TENET: A Time-Reversal Enhancement Network for Noise-Robust ASR」主題。共同形成了獨特的指紋。

引用此