Extractive Chinese spoken document summarization using probabilistic ranking models

Yi Ting Chen*, Suhan Yu, Hsin Min Wang, Berlin Chen

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

4 引文 斯高帕斯(Scopus)

摘要

The purpose of extractive summarization is to automatically select indicative sentences, passages, or paragraphs from an original document according to a certain target summarization ratio, and then sequence them to form a concise summary. In this paper, in contrast to conventional approaches, our objective is to deal with the extractive summarization problem under a probabilistic modeling framework. We investigate the use of the hidden Markov model (HMM) for spoken document summarization, in which each sentence of a spoken document is treated as an HMM for generating the document, and the sentences are ranked and selected according to their likelihoods. In addition, the relevance model (RM) of each sentence, estimated from a contemporary text collection, is integrated with the HMM model to improve the representation of the sentence model. The experiments were performed on Chinese broadcast news compiled in Taiwan. The proposed approach achieves noticeable performance gains over conventional summarization approaches.

原文英語
主出版物標題Chinese Spoken Language Processing - 5th International Symposium, ISCSLP 2006, Proceedings
頁面660-671
頁數12
DOIs
出版狀態已發佈 - 2006
事件5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006 - Singapore, 新加坡
持續時間: 2006 12月 132006 12月 16

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
4274 LNAI
ISSN(列印)0302-9743
ISSN(電子)1611-3349

其他

其他5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006
國家/地區新加坡
城市Singapore
期間2006/12/132006/12/16

ASJC Scopus subject areas

  • 理論電腦科學
  • 一般電腦科學

指紋

深入研究「Extractive Chinese spoken document summarization using probabilistic ranking models」主題。共同形成了獨特的指紋。

引用此