Annotating text segments using a web-based categorization approach

Hsin Chen Chiao, Hsiao Tieh Pu, Lee Feng Chien

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Conventional automatic text annotation tools mostly extract named entities from texts and annotate them with information about persons, locations, and dates, etc. Such kind of entity type information, however, is insufficient for machines to understand the context or facts contained in the texts. This paper presents a general text categorization approach to categorize text segments into broader subject categories, such as categorizing a text string into a category of paper title in Mathematics or a category of conference name in Computer Science. Experimental results confirm its wide applicability to various digital library applications.

Original languageEnglish
Title of host publicationDigital Libraries
Subtitle of host publicationImplementing Strategies and Sharing Experiences - 8th International Conference on Asian Digital Libraries, ICADL 2005, Proceedings
Pages323-331
Number of pages9
DOIs
Publication statusPublished - 2005 Dec 1
Event8th International Conference on Asian Digital Libraries, ICADL 2005 - Bangkok, Thailand
Duration: 2005 Dec 122005 Dec 15

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3815 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other8th International Conference on Asian Digital Libraries, ICADL 2005
CountryThailand
CityBangkok
Period05/12/1205/12/15

Fingerprint

Digital libraries
Categorization
Web-based
Computer science
Text Categorization
Digital Libraries
Date
Annotation
Person
Computer Science
Strings
Text
Experimental Results

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Chiao, H. C., Pu, H. T., & Chien, L. F. (2005). Annotating text segments using a web-based categorization approach. In Digital Libraries: Implementing Strategies and Sharing Experiences - 8th International Conference on Asian Digital Libraries, ICADL 2005, Proceedings (pp. 323-331). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3815 LNCS). https://doi.org/10.1007/11599517_37

Annotating text segments using a web-based categorization approach. / Chiao, Hsin Chen; Pu, Hsiao Tieh; Chien, Lee Feng.

Digital Libraries: Implementing Strategies and Sharing Experiences - 8th International Conference on Asian Digital Libraries, ICADL 2005, Proceedings. 2005. p. 323-331 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3815 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chiao, HC, Pu, HT & Chien, LF 2005, Annotating text segments using a web-based categorization approach. in Digital Libraries: Implementing Strategies and Sharing Experiences - 8th International Conference on Asian Digital Libraries, ICADL 2005, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 3815 LNCS, pp. 323-331, 8th International Conference on Asian Digital Libraries, ICADL 2005, Bangkok, Thailand, 05/12/12. https://doi.org/10.1007/11599517_37
Chiao HC, Pu HT, Chien LF. Annotating text segments using a web-based categorization approach. In Digital Libraries: Implementing Strategies and Sharing Experiences - 8th International Conference on Asian Digital Libraries, ICADL 2005, Proceedings. 2005. p. 323-331. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/11599517_37
Chiao, Hsin Chen ; Pu, Hsiao Tieh ; Chien, Lee Feng. / Annotating text segments using a web-based categorization approach. Digital Libraries: Implementing Strategies and Sharing Experiences - 8th International Conference on Asian Digital Libraries, ICADL 2005, Proceedings. 2005. pp. 323-331 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{bdb5f26148154a7aa5c636f29995af72,
title = "Annotating text segments using a web-based categorization approach",
abstract = "Conventional automatic text annotation tools mostly extract named entities from texts and annotate them with information about persons, locations, and dates, etc. Such kind of entity type information, however, is insufficient for machines to understand the context or facts contained in the texts. This paper presents a general text categorization approach to categorize text segments into broader subject categories, such as categorizing a text string into a category of paper title in Mathematics or a category of conference name in Computer Science. Experimental results confirm its wide applicability to various digital library applications.",
author = "Chiao, {Hsin Chen} and Pu, {Hsiao Tieh} and Chien, {Lee Feng}",
year = "2005",
month = "12",
day = "1",
doi = "10.1007/11599517_37",
language = "English",
isbn = "3540308504",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "323--331",
booktitle = "Digital Libraries",

}

TY - GEN

T1 - Annotating text segments using a web-based categorization approach

AU - Chiao, Hsin Chen

AU - Pu, Hsiao Tieh

AU - Chien, Lee Feng

PY - 2005/12/1

Y1 - 2005/12/1

N2 - Conventional automatic text annotation tools mostly extract named entities from texts and annotate them with information about persons, locations, and dates, etc. Such kind of entity type information, however, is insufficient for machines to understand the context or facts contained in the texts. This paper presents a general text categorization approach to categorize text segments into broader subject categories, such as categorizing a text string into a category of paper title in Mathematics or a category of conference name in Computer Science. Experimental results confirm its wide applicability to various digital library applications.

AB - Conventional automatic text annotation tools mostly extract named entities from texts and annotate them with information about persons, locations, and dates, etc. Such kind of entity type information, however, is insufficient for machines to understand the context or facts contained in the texts. This paper presents a general text categorization approach to categorize text segments into broader subject categories, such as categorizing a text string into a category of paper title in Mathematics or a category of conference name in Computer Science. Experimental results confirm its wide applicability to various digital library applications.

UR - http://www.scopus.com/inward/record.url?scp=33744932463&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33744932463&partnerID=8YFLogxK

U2 - 10.1007/11599517_37

DO - 10.1007/11599517_37

M3 - Conference contribution

AN - SCOPUS:33744932463

SN - 3540308504

SN - 9783540308508

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 323

EP - 331

BT - Digital Libraries

ER -