Development of a text retrieval and mining system for Taiwanese historical people

Shun Hong Sie, Hao Ren Ke, Su Bing Chang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Personage is an important kind of entities in study of history. Comprehensive understanding of personage biographies is beneficial for researching into historical events. This article introduces the development of a text retrieval and mining system for Taiwanese historical people - Taiwan Biographical Database (TBDB). It describes the characteristics of personages in TBDB, highlights the system architecture and preliminary achievement of TBDB, and proposes a method to recognize named entities in the personage biographies, specifically poetry societies, which achieves the recall rate 96% and the precision rate 65%. Finally, this article elaborates on the lessons learned through the creation of TBDB, and the future plans.

Original languageEnglish
Title of host publicationProceedings of the 2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings
Subtitle of host publicationData Informed Society, PNC 2017
EditorsSophy Shu-Jiun Chen, Feng-Tyan Lin, Da-Wei Wang, Ling-Jyh Chen
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages56-62
Number of pages7
ISBN (Electronic)9789869531702
DOIs
Publication statusPublished - 2017 Dec 13
Event2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings, PNC 2017 - Tainan, Taiwan
Duration: 2017 Nov 72017 Nov 9

Publication series

NameProceedings of the 2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings: Data Informed Society, PNC 2017
Volume2017-December

Conference

Conference2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings, PNC 2017
Country/TerritoryTaiwan
CityTainan
Period2017/11/072017/11/09

Keywords

  • Taiwan Biographical Database (TBDB)
  • name entity recognition
  • social network analysis (SNA)
  • text mining
  • text retrieval

ASJC Scopus subject areas

  • Information Systems
  • Information Systems and Management
  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Development of a text retrieval and mining system for Taiwanese historical people'. Together they form a unique fingerprint.

Cite this