Integrating log-based and text-based methods towards automatic Web thesaurus construction

Hsiao Tieh Pu*, Lee Feng Chien


研究成果: 雜誌貢獻期刊論文同行評審

1 引文 斯高帕斯(Scopus)


This paper presents an approach to investigating the possibility for constructing an automatic and scalable thesaurus based on Web users' vocabularies with search interests. The proposed approach mainly includes two techniques, namely, relevant term extraction and concept clustering. The former combines query-session-based and text-based methods to extract relevant terms for a given search term; and the latter organizes these relevant terms into concept classes based on the search results from search engines. Some initial experiments have been conducted to test feasibility of the proposed approach to organizing Web users' vocabularies. The obtained results show that relevant terms could be extracted efficiently and concept classes be more well organized. The approach has a great potential to benefit the automatic construction of a large scale thesaurus for future Web IR applications.

頁(從 - 到)463-471
期刊Proceedings of the ASIST Annual Meeting
出版狀態已發佈 - 2004 十一月

ASJC Scopus subject areas

  • 資訊系統
  • 圖書館與資訊科學


深入研究「Integrating log-based and text-based methods towards automatic Web thesaurus construction」主題。共同形成了獨特的指紋。