Mining term networks from text collections for crime investigation

Yuen-Hsien Tseng, Zih Ping Ho, Kai Sheng Yang, Chun Cheng Chen

Research output: Contribution to journalArticle

22 Citations (Scopus)

Abstract

An efficient term mining method to build a general term network is presented. The resulting term network can be used for entity relation visualization and exploration, which is useful in many text-mining applications such as crime exploration and investigation from vast piles of crime news or official criminal records. In the proposed method, terms from each document in a text collection are first identified. They are subjected to an analysis for pairwise association weights. The weights are then accumulated over all the documents to obtain final similarity for each term pair. Based on the resulting term similarity, a general term network for the collection is built with terms as nodes and non-zero similarities as links. In application, a list of predefined terms having similar attributes was selected to extract the desired sub-network from the general term network for entity relation visualization. This text analysis scenario based on the collective terms of the similar type or from the same topic enables evidence-based relation exploration. Some practical instances of crime exploration and investigation are demonstrated. Our application examples show that term relations, be it causality, subordination, coupling, or others, can be effectively revealed by our method and easily verified by the underlying text collection. This work contributes by presenting an integrated term-relationship mining and exploration approach and demonstrating the feasibility of the term network to the increasingly important application of crime exploration and investigation.

Original languageEnglish
Pages (from-to)10082-10090
Number of pages9
JournalExpert Systems with Applications
Volume39
Issue number11
DOIs
Publication statusPublished - 2012 Sep 1

Fingerprint

Crime
Visualization
Piles

Keywords

  • Co-occurrence analysis
  • Knowledge discovery
  • Network analysis
  • Term relations
  • Visualization

ASJC Scopus subject areas

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Cite this

Mining term networks from text collections for crime investigation. / Tseng, Yuen-Hsien; Ho, Zih Ping; Yang, Kai Sheng; Chen, Chun Cheng.

In: Expert Systems with Applications, Vol. 39, No. 11, 01.09.2012, p. 10082-10090.

Research output: Contribution to journalArticle

Tseng, Yuen-Hsien ; Ho, Zih Ping ; Yang, Kai Sheng ; Chen, Chun Cheng. / Mining term networks from text collections for crime investigation. In: Expert Systems with Applications. 2012 ; Vol. 39, No. 11. pp. 10082-10090.
@article{63e9cad39072433dad98fd3483bd1256,
title = "Mining term networks from text collections for crime investigation",
abstract = "An efficient term mining method to build a general term network is presented. The resulting term network can be used for entity relation visualization and exploration, which is useful in many text-mining applications such as crime exploration and investigation from vast piles of crime news or official criminal records. In the proposed method, terms from each document in a text collection are first identified. They are subjected to an analysis for pairwise association weights. The weights are then accumulated over all the documents to obtain final similarity for each term pair. Based on the resulting term similarity, a general term network for the collection is built with terms as nodes and non-zero similarities as links. In application, a list of predefined terms having similar attributes was selected to extract the desired sub-network from the general term network for entity relation visualization. This text analysis scenario based on the collective terms of the similar type or from the same topic enables evidence-based relation exploration. Some practical instances of crime exploration and investigation are demonstrated. Our application examples show that term relations, be it causality, subordination, coupling, or others, can be effectively revealed by our method and easily verified by the underlying text collection. This work contributes by presenting an integrated term-relationship mining and exploration approach and demonstrating the feasibility of the term network to the increasingly important application of crime exploration and investigation.",
keywords = "Co-occurrence analysis, Knowledge discovery, Network analysis, Term relations, Visualization",
author = "Yuen-Hsien Tseng and Ho, {Zih Ping} and Yang, {Kai Sheng} and Chen, {Chun Cheng}",
year = "2012",
month = "9",
day = "1",
doi = "10.1016/j.eswa.2012.02.052",
language = "English",
volume = "39",
pages = "10082--10090",
journal = "Expert Systems with Applications",
issn = "0957-4174",
publisher = "Elsevier Limited",
number = "11",

}

TY - JOUR

T1 - Mining term networks from text collections for crime investigation

AU - Tseng, Yuen-Hsien

AU - Ho, Zih Ping

AU - Yang, Kai Sheng

AU - Chen, Chun Cheng

PY - 2012/9/1

Y1 - 2012/9/1

N2 - An efficient term mining method to build a general term network is presented. The resulting term network can be used for entity relation visualization and exploration, which is useful in many text-mining applications such as crime exploration and investigation from vast piles of crime news or official criminal records. In the proposed method, terms from each document in a text collection are first identified. They are subjected to an analysis for pairwise association weights. The weights are then accumulated over all the documents to obtain final similarity for each term pair. Based on the resulting term similarity, a general term network for the collection is built with terms as nodes and non-zero similarities as links. In application, a list of predefined terms having similar attributes was selected to extract the desired sub-network from the general term network for entity relation visualization. This text analysis scenario based on the collective terms of the similar type or from the same topic enables evidence-based relation exploration. Some practical instances of crime exploration and investigation are demonstrated. Our application examples show that term relations, be it causality, subordination, coupling, or others, can be effectively revealed by our method and easily verified by the underlying text collection. This work contributes by presenting an integrated term-relationship mining and exploration approach and demonstrating the feasibility of the term network to the increasingly important application of crime exploration and investigation.

AB - An efficient term mining method to build a general term network is presented. The resulting term network can be used for entity relation visualization and exploration, which is useful in many text-mining applications such as crime exploration and investigation from vast piles of crime news or official criminal records. In the proposed method, terms from each document in a text collection are first identified. They are subjected to an analysis for pairwise association weights. The weights are then accumulated over all the documents to obtain final similarity for each term pair. Based on the resulting term similarity, a general term network for the collection is built with terms as nodes and non-zero similarities as links. In application, a list of predefined terms having similar attributes was selected to extract the desired sub-network from the general term network for entity relation visualization. This text analysis scenario based on the collective terms of the similar type or from the same topic enables evidence-based relation exploration. Some practical instances of crime exploration and investigation are demonstrated. Our application examples show that term relations, be it causality, subordination, coupling, or others, can be effectively revealed by our method and easily verified by the underlying text collection. This work contributes by presenting an integrated term-relationship mining and exploration approach and demonstrating the feasibility of the term network to the increasingly important application of crime exploration and investigation.

KW - Co-occurrence analysis

KW - Knowledge discovery

KW - Network analysis

KW - Term relations

KW - Visualization

UR - http://www.scopus.com/inward/record.url?scp=84862828907&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862828907&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2012.02.052

DO - 10.1016/j.eswa.2012.02.052

M3 - Article

AN - SCOPUS:84862828907

VL - 39

SP - 10082

EP - 10090

JO - Expert Systems with Applications

JF - Expert Systems with Applications

SN - 0957-4174

IS - 11

ER -