Mining term networks from text collections for crime investigation

Yuen Hsien Tseng*, Zih Ping Ho, Kai Sheng Yang, Chun Cheng Chen

*此作品的通信作者

研究成果: 雜誌貢獻期刊論文同行評審

35 引文 斯高帕斯(Scopus)

摘要

An efficient term mining method to build a general term network is presented. The resulting term network can be used for entity relation visualization and exploration, which is useful in many text-mining applications such as crime exploration and investigation from vast piles of crime news or official criminal records. In the proposed method, terms from each document in a text collection are first identified. They are subjected to an analysis for pairwise association weights. The weights are then accumulated over all the documents to obtain final similarity for each term pair. Based on the resulting term similarity, a general term network for the collection is built with terms as nodes and non-zero similarities as links. In application, a list of predefined terms having similar attributes was selected to extract the desired sub-network from the general term network for entity relation visualization. This text analysis scenario based on the collective terms of the similar type or from the same topic enables evidence-based relation exploration. Some practical instances of crime exploration and investigation are demonstrated. Our application examples show that term relations, be it causality, subordination, coupling, or others, can be effectively revealed by our method and easily verified by the underlying text collection. This work contributes by presenting an integrated term-relationship mining and exploration approach and demonstrating the feasibility of the term network to the increasingly important application of crime exploration and investigation.

原文英語
頁(從 - 到)10082-10090
頁數9
期刊Expert Systems with Applications
39
發行號11
DOIs
出版狀態已發佈 - 2012 9月 1

ASJC Scopus subject areas

  • 一般工程
  • 電腦科學應用
  • 人工智慧

指紋

深入研究「Mining term networks from text collections for crime investigation」主題。共同形成了獨特的指紋。

引用此