Incorporating paragraph embeddings and density peaks clustering for spoken document summarization

Kuan Yu Chen, Kai Wun Shih, Shih Hung Liu, Berlin Chen, Hsin Min Wang

研究成果: 書貢獻/報告類型會議論文篇章

6 引文 斯高帕斯(Scopus)

摘要

Representation learning has emerged as a newly active research subject in many machine learning applications because of its excellent performance. As an instantiation, word embedding has been widely used in the natural language processing area. However, as far as we are aware, there are relatively few studies investigating paragraph embedding methods in extractive text or speech summarization. Extractive summarization aims at selecting a set of indicative sentences from a source document to express the most important theme of the document. There is a general consensus that relevance and redundancy are both critical issues for users in a realistic summarization scenario. However, most of the existing methods focus on determining only the relevance degree between sentences and a given document, while the redundancy degree is calculated by a post-processing step. Based on these observations, three contributions are proposed in this paper. First, we comprehensively compare the word and paragraph embedding methods for spoken document summarization. Next, we propose a novel summarization framework which can take both relevance and redundancy information into account simultaneously. Consequently, a set of representative sentences can be automatically selected through a one-pass process. Third, we further plug in paragraph embedding methods into the proposed framework to enhance the summarization performance. Experimental results demonstrate the effectiveness of our proposed methods, compared to existing state-of-the-art methods.

原文英語
主出版物標題2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面207-214
頁數8
ISBN(電子)9781479972913
DOIs
出版狀態已發佈 - 2016 2月 10
事件IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Scottsdale, 美国
持續時間: 2015 12月 132015 12月 17

出版系列

名字2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings

其他

其他IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015
國家/地區美国
城市Scottsdale
期間2015/12/132015/12/17

ASJC Scopus subject areas

  • 人工智慧
  • 電腦網路與通信
  • 電腦視覺和模式識別

指紋

深入研究「Incorporating paragraph embeddings and density peaks clustering for spoken document summarization」主題。共同形成了獨特的指紋。

引用此