Enhanced Bert-Based Ranking Models for Spoken Document Retrieval

Hsiao Yun Lin, Tien Hong Lo, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

8 引文 斯高帕斯(Scopus)

摘要

The Bidirectional Encoder Representations from Transformers (BERT) model has recently achieved record-breaking success on many natural language processing (NLP) tasks such as question answering and language understanding. However, relatively little work has been done on ad-hoc information retrieval (IR), especially for spoken document retrieval (SDR). This paper adopts and extends BERT for SDR, while its contributions are at least three-fold. First, we augment BERT with extra language features such as unigram and inverse document frequency (IDF) statistics to make it more applicable to SDR. Second, we also explore the incorporation of confidence scores into document representations to see if they could help alleviate the negative effects resulting from imperfect automatic speech recognition (ASR). Third, we conduct a comprehensive set of experiments to compare our BERT-based ranking methods with other state-of-The-Art ones and investigate the synergy effect of them as well.

原文英語
主出版物標題2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面601-606
頁數6
ISBN(電子)9781728103068
DOIs
出版狀態已發佈 - 2019 12月
事件2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Singapore, 新加坡
持續時間: 2019 12月 152019 12月 18

出版系列

名字2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Proceedings

會議

會議2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019
國家/地區新加坡
城市Singapore
期間2019/12/152019/12/18

ASJC Scopus subject areas

  • 電腦網路與通信
  • 訊號處理
  • 語言和語言學
  • 通訊

指紋

深入研究「Enhanced Bert-Based Ranking Models for Spoken Document Retrieval」主題。共同形成了獨特的指紋。

引用此