Structure clustering for Chinese patent documents

Su Hsien Huang, Hao-Ren Ke, Wei Pang Yang

Research output: Contribution to journalArticle

22 Citations (Scopus)

Abstract

This paper aims to cluster Chinese patent documents with the structures. Both the explicit and implicit structures are analyzed to represent by the proposed structure expression. Accordingly, an unsupervised clustering algorithm called structured self-organizing map (SOM) is adopted to cluster Chinese patent documents with both similar content and structure. Structured SOM clusters the similar content of each sub-part structure, and then propagates the similarity to upper level ones. Experimental result showed the maps size and number of patents are proportional to the computing time, which implies the width and depth of structure affects the performance of structured SOM. Structured clustering of patents is helpful in many applications. In the lawsuit of copyright, companies are easy to find claim conflict in the existent patents to contradict the accusation. Moreover, decision-maker of a company can be advised to avoid hot-spot aspects of patents, which can save a lot of R&D effort.

Original languageEnglish
Pages (from-to)2290-2297
Number of pages8
JournalExpert Systems with Applications
Volume34
Issue number4
DOIs
Publication statusPublished - 2008 May 1

Fingerprint

Self organizing maps
Clustering algorithms
Industry

Keywords

  • Chinese patent
  • Metadata
  • Structure clustering
  • Structure expression

ASJC Scopus subject areas

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Cite this

Structure clustering for Chinese patent documents. / Huang, Su Hsien; Ke, Hao-Ren; Yang, Wei Pang.

In: Expert Systems with Applications, Vol. 34, No. 4, 01.05.2008, p. 2290-2297.

Research output: Contribution to journalArticle

Huang, Su Hsien ; Ke, Hao-Ren ; Yang, Wei Pang. / Structure clustering for Chinese patent documents. In: Expert Systems with Applications. 2008 ; Vol. 34, No. 4. pp. 2290-2297.
@article{68641fcd0a90444ea98eb0a32f7e343c,
title = "Structure clustering for Chinese patent documents",
abstract = "This paper aims to cluster Chinese patent documents with the structures. Both the explicit and implicit structures are analyzed to represent by the proposed structure expression. Accordingly, an unsupervised clustering algorithm called structured self-organizing map (SOM) is adopted to cluster Chinese patent documents with both similar content and structure. Structured SOM clusters the similar content of each sub-part structure, and then propagates the similarity to upper level ones. Experimental result showed the maps size and number of patents are proportional to the computing time, which implies the width and depth of structure affects the performance of structured SOM. Structured clustering of patents is helpful in many applications. In the lawsuit of copyright, companies are easy to find claim conflict in the existent patents to contradict the accusation. Moreover, decision-maker of a company can be advised to avoid hot-spot aspects of patents, which can save a lot of R&D effort.",
keywords = "Chinese patent, Metadata, Structure clustering, Structure expression",
author = "Huang, {Su Hsien} and Hao-Ren Ke and Yang, {Wei Pang}",
year = "2008",
month = "5",
day = "1",
doi = "10.1016/j.eswa.2007.03.012",
language = "English",
volume = "34",
pages = "2290--2297",
journal = "Expert Systems with Applications",
issn = "0957-4174",
publisher = "Elsevier Limited",
number = "4",

}

TY - JOUR

T1 - Structure clustering for Chinese patent documents

AU - Huang, Su Hsien

AU - Ke, Hao-Ren

AU - Yang, Wei Pang

PY - 2008/5/1

Y1 - 2008/5/1

N2 - This paper aims to cluster Chinese patent documents with the structures. Both the explicit and implicit structures are analyzed to represent by the proposed structure expression. Accordingly, an unsupervised clustering algorithm called structured self-organizing map (SOM) is adopted to cluster Chinese patent documents with both similar content and structure. Structured SOM clusters the similar content of each sub-part structure, and then propagates the similarity to upper level ones. Experimental result showed the maps size and number of patents are proportional to the computing time, which implies the width and depth of structure affects the performance of structured SOM. Structured clustering of patents is helpful in many applications. In the lawsuit of copyright, companies are easy to find claim conflict in the existent patents to contradict the accusation. Moreover, decision-maker of a company can be advised to avoid hot-spot aspects of patents, which can save a lot of R&D effort.

AB - This paper aims to cluster Chinese patent documents with the structures. Both the explicit and implicit structures are analyzed to represent by the proposed structure expression. Accordingly, an unsupervised clustering algorithm called structured self-organizing map (SOM) is adopted to cluster Chinese patent documents with both similar content and structure. Structured SOM clusters the similar content of each sub-part structure, and then propagates the similarity to upper level ones. Experimental result showed the maps size and number of patents are proportional to the computing time, which implies the width and depth of structure affects the performance of structured SOM. Structured clustering of patents is helpful in many applications. In the lawsuit of copyright, companies are easy to find claim conflict in the existent patents to contradict the accusation. Moreover, decision-maker of a company can be advised to avoid hot-spot aspects of patents, which can save a lot of R&D effort.

KW - Chinese patent

KW - Metadata

KW - Structure clustering

KW - Structure expression

UR - http://www.scopus.com/inward/record.url?scp=38649115318&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=38649115318&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2007.03.012

DO - 10.1016/j.eswa.2007.03.012

M3 - Article

AN - SCOPUS:38649115318

VL - 34

SP - 2290

EP - 2297

JO - Expert Systems with Applications

JF - Expert Systems with Applications

SN - 0957-4174

IS - 4

ER -