The robustness of domain lexico-taxonomy: Expanding domain lexicon with CiLin

Chu Ren Huang, Xiang Bing Li, Jia Fei Hong

研究成果: 會議貢獻類型會議論文同行評審

2 引文 斯高帕斯(Scopus)

摘要

This paper deals with the robust expansion of Domain Lexico-Taxonomy (DLT). DLT is a domain taxonomy enriched with domain lexica. DLT was proposed as an infrastructure for crossing domain barriers (Huang et al. 2004). The DLT proposal is based on the observation that domain lexica contain entries that are also part of a general lexicon. Hence, when entries of a general lexicon are marked with their associated domain attributes, this information can have two important applications. First, the DLT will serve as seeds for domain lexica. Second, the DLT offers the most reliable evidence for deciding the domain of a new text since these lexical clues belong to the general lexicon and do occur reliably in all texts. Hence general lexicon lemmas are extracted to populate domain lexica, which are situated in domain taxonomy. Based on this previous work, we show in this paper that the original DLT can be further expanded when a new language resource is introduced. We applied CiLin, a Chinese thesaurus, and added more than 1000 new entries for DLT and show with evaluation that the DLT approach is robust since the size and number of domain lexica increased effectively.

原文英語
頁面103-109
頁數7
出版狀態已發佈 - 2005
對外發佈
事件4th SIGHAN Workshop on Chinese Language Processing at the 2nd International Joint Conference on Natural Language Processing, SIGHAN@IJCNLP 2005 - Jeju Island, 大韓民國
持續時間: 2005 10月 142005 10月 15

會議

會議4th SIGHAN Workshop on Chinese Language Processing at the 2nd International Joint Conference on Natural Language Processing, SIGHAN@IJCNLP 2005
國家/地區大韓民國
城市Jeju Island
期間2005/10/142005/10/15

ASJC Scopus subject areas

  • 語言與語言學
  • 語言和語言學

指紋

深入研究「The robustness of domain lexico-taxonomy: Expanding domain lexicon with CiLin」主題。共同形成了獨特的指紋。

引用此