Toward generic title generation for clustered documents

Yuen Hsien Tseng*, Chi Jen Lin, Hsiu Han Chen, Yu I. Lin

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

24 引文 斯高帕斯(Scopus)

摘要

A cluster labeling algorithm for creating generic titles based on external resources such as WordNet is proposed. Our method first extracts category-specific terms as cluster descriptors. These descriptors are then mapped to generic terms based on a hypernym search algorithm. The proposed method has been evaluated on a patent document collection and a subset of the Reuters-21578 collection. Experimental results revealed that our method performs as anticipated. Real-case applications of these generic terms show promising in assisting humans in interpreting the clustered topics. Our method is general enough such that it can be easily extended to use other hierarchical resources for adaptable label generation.

原文英語
主出版物標題Information Retrieval Technology - Third Asia Information Retrieval Symposium, AIRS 2006, Proceedings
發行者Springer Verlag
頁面145-157
頁數13
ISBN(列印)3540457801, 9783540457800
DOIs
出版狀態已發佈 - 2006
事件3rd Asia Information Retrieval Symposium, AIRS 2006 - Singapore, 新加坡
持續時間: 2006 10月 162006 10月 18

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
4182 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

其他

其他3rd Asia Information Retrieval Symposium, AIRS 2006
國家/地區新加坡
城市Singapore
期間2006/10/162006/10/18

ASJC Scopus subject areas

  • 理論電腦科學
  • 一般電腦科學

指紋

深入研究「Toward generic title generation for clustered documents」主題。共同形成了獨特的指紋。

引用此