Chinese text summarization using a trainable summarizer and latent semantic analysis

Jen Yuan Yeh, Hao Ren Ke, Wei Pang Yang

研究成果: 書貢獻/報告類型會議貢獻

20 引文 斯高帕斯(Scopus)

摘要

In this paper, two novel approaches are proposed to extract important sentences from a document to create its summary. The first is a corpus-based approach using feature analysis. It brings up three new ideas: 1) to employ ranked position to emphasize the significance of sentence position, 2) to reshape word unit to achieve higher accuracy of keyword importance, and 3) to train a score function by the genetic algorithm for obtaining a suitable combination of feature weights. The second approach combines the ideas of latent semantic analysis and text relationship maps to interpret conceptual structures of a document. Both approaches are applied to Chinese text summarization. The two approaches were evaluated by using a data corpus composed of 100 articles about politics from New Taiwan Weekly, and when the compression ratio was 30%, average recalls of 52.0% and 45.6% were achieved respectively.

原文英語
主出版物標題Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
發行者Springer Verlag
頁面76-87
頁數12
ISBN(列印)3540002618, 9783540002611
出版狀態已發佈 - 2002 一月 1
事件5th International Conference on Asian Digital Libraries, ICADL 2002 - Singapore, 新加坡
持續時間: 2002 十二月 112002 十二月 14

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2555
ISSN(列印)0302-9743
ISSN(電子)1611-3349

其他

其他5th International Conference on Asian Digital Libraries, ICADL 2002
國家新加坡
城市Singapore
期間02/12/1102/12/14

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

指紋 深入研究「Chinese text summarization using a trainable summarizer and latent semantic analysis」主題。共同形成了獨特的指紋。

  • 引用此

    Yeh, J. Y., Ke, H. R., & Yang, W. P. (2002). Chinese text summarization using a trainable summarizer and latent semantic analysis. 於 Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (頁 76-87). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 2555). Springer Verlag.