Hierarchical topic-based communities construction for authors in a literature database

Chien Liang Wu, Jia Ling Koh

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

In this paper, given a set of research papers with only title and author information, a mining strategy is proposed to discover and organize the communities of authors according to both the co-author relationships and research topics of their published papers. The proposed method applies the CONGA algorithm to discover collaborative communities from the network constructed from the co-author relationship. To further group the collaborative communities of authors according to research interests, the CiteSeerX is used as an external source to discover the hidden hierarchical relationships among the topics covered by the papers. In order to evaluate whether the constructed topic-based collaborative community is semantically meaningful, the first part of evaluation is to measure the consistency between the terms appearing in the published papers of a topic-based collaborative community and the terms in the documents related to the specific topic retrieved from other external source. The experimental results show that 81.61% of the topic-based collaborative communities satisfy the consistency requirement. On the other hand, the accuracy of the discovered sub-concept relationship is verified by checking the Wikipedia categories. It is shown that 75.96% of the sub-concept terms are properly assigned in the concept hierarchy.

Original languageEnglish
Title of host publicationTrends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings
Pages514-524
Number of pages11
EditionPART 2
DOIs
Publication statusPublished - 2010 Dec 1
Event23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligence Systems, IEA/AIE 2010 - Cordoba, Spain
Duration: 2010 Jun 12010 Jun 4

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume6097 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligence Systems, IEA/AIE 2010
CountrySpain
CityCordoba
Period10/6/110/6/4

Fingerprint

Term
Concept Hierarchy
Wikipedia
Community
Mining
Relationships
Evaluate
Requirements
Evaluation
Experimental Results
Concepts
Strategy

Keywords

  • Bibliographic database
  • Community Mining
  • Social Network

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Wu, C. L., & Koh, J. L. (2010). Hierarchical topic-based communities construction for authors in a literature database. In Trends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings (PART 2 ed., pp. 514-524). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6097 LNAI, No. PART 2). https://doi.org/10.1007/978-3-642-13025-0_53

Hierarchical topic-based communities construction for authors in a literature database. / Wu, Chien Liang; Koh, Jia Ling.

Trends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings. PART 2. ed. 2010. p. 514-524 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6097 LNAI, No. PART 2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wu, CL & Koh, JL 2010, Hierarchical topic-based communities construction for authors in a literature database. in Trends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings. PART 2 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 2, vol. 6097 LNAI, pp. 514-524, 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligence Systems, IEA/AIE 2010, Cordoba, Spain, 10/6/1. https://doi.org/10.1007/978-3-642-13025-0_53
Wu CL, Koh JL. Hierarchical topic-based communities construction for authors in a literature database. In Trends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings. PART 2 ed. 2010. p. 514-524. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2). https://doi.org/10.1007/978-3-642-13025-0_53
Wu, Chien Liang ; Koh, Jia Ling. / Hierarchical topic-based communities construction for authors in a literature database. Trends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings. PART 2. ed. 2010. pp. 514-524 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2).
@inproceedings{946fe25e6d004e79b30058e19826181a,
title = "Hierarchical topic-based communities construction for authors in a literature database",
abstract = "In this paper, given a set of research papers with only title and author information, a mining strategy is proposed to discover and organize the communities of authors according to both the co-author relationships and research topics of their published papers. The proposed method applies the CONGA algorithm to discover collaborative communities from the network constructed from the co-author relationship. To further group the collaborative communities of authors according to research interests, the CiteSeerX is used as an external source to discover the hidden hierarchical relationships among the topics covered by the papers. In order to evaluate whether the constructed topic-based collaborative community is semantically meaningful, the first part of evaluation is to measure the consistency between the terms appearing in the published papers of a topic-based collaborative community and the terms in the documents related to the specific topic retrieved from other external source. The experimental results show that 81.61{\%} of the topic-based collaborative communities satisfy the consistency requirement. On the other hand, the accuracy of the discovered sub-concept relationship is verified by checking the Wikipedia categories. It is shown that 75.96{\%} of the sub-concept terms are properly assigned in the concept hierarchy.",
keywords = "Bibliographic database, Community Mining, Social Network",
author = "Wu, {Chien Liang} and Koh, {Jia Ling}",
year = "2010",
month = "12",
day = "1",
doi = "10.1007/978-3-642-13025-0_53",
language = "English",
isbn = "3642130240",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
number = "PART 2",
pages = "514--524",
booktitle = "Trends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings",
edition = "PART 2",

}

TY - GEN

T1 - Hierarchical topic-based communities construction for authors in a literature database

AU - Wu, Chien Liang

AU - Koh, Jia Ling

PY - 2010/12/1

Y1 - 2010/12/1

N2 - In this paper, given a set of research papers with only title and author information, a mining strategy is proposed to discover and organize the communities of authors according to both the co-author relationships and research topics of their published papers. The proposed method applies the CONGA algorithm to discover collaborative communities from the network constructed from the co-author relationship. To further group the collaborative communities of authors according to research interests, the CiteSeerX is used as an external source to discover the hidden hierarchical relationships among the topics covered by the papers. In order to evaluate whether the constructed topic-based collaborative community is semantically meaningful, the first part of evaluation is to measure the consistency between the terms appearing in the published papers of a topic-based collaborative community and the terms in the documents related to the specific topic retrieved from other external source. The experimental results show that 81.61% of the topic-based collaborative communities satisfy the consistency requirement. On the other hand, the accuracy of the discovered sub-concept relationship is verified by checking the Wikipedia categories. It is shown that 75.96% of the sub-concept terms are properly assigned in the concept hierarchy.

AB - In this paper, given a set of research papers with only title and author information, a mining strategy is proposed to discover and organize the communities of authors according to both the co-author relationships and research topics of their published papers. The proposed method applies the CONGA algorithm to discover collaborative communities from the network constructed from the co-author relationship. To further group the collaborative communities of authors according to research interests, the CiteSeerX is used as an external source to discover the hidden hierarchical relationships among the topics covered by the papers. In order to evaluate whether the constructed topic-based collaborative community is semantically meaningful, the first part of evaluation is to measure the consistency between the terms appearing in the published papers of a topic-based collaborative community and the terms in the documents related to the specific topic retrieved from other external source. The experimental results show that 81.61% of the topic-based collaborative communities satisfy the consistency requirement. On the other hand, the accuracy of the discovered sub-concept relationship is verified by checking the Wikipedia categories. It is shown that 75.96% of the sub-concept terms are properly assigned in the concept hierarchy.

KW - Bibliographic database

KW - Community Mining

KW - Social Network

UR - http://www.scopus.com/inward/record.url?scp=79551542903&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79551542903&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-13025-0_53

DO - 10.1007/978-3-642-13025-0_53

M3 - Conference contribution

AN - SCOPUS:79551542903

SN - 3642130240

SN - 9783642130243

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 514

EP - 524

BT - Trends in Applied Intelligent Systems - 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Proceedings

ER -