TY - JOUR
T1 - Structure clustering for Chinese patent documents
AU - Huang, Su Hsien
AU - Ke, Hao Ren
AU - Yang, Wei Pang
N1 - Funding Information:
This research was supported by the Software Technology for Advanced Network Application project of Institute for Information Industry in 2004 and sponsored by MOEA, ROC.
PY - 2008/5
Y1 - 2008/5
N2 - This paper aims to cluster Chinese patent documents with the structures. Both the explicit and implicit structures are analyzed to represent by the proposed structure expression. Accordingly, an unsupervised clustering algorithm called structured self-organizing map (SOM) is adopted to cluster Chinese patent documents with both similar content and structure. Structured SOM clusters the similar content of each sub-part structure, and then propagates the similarity to upper level ones. Experimental result showed the maps size and number of patents are proportional to the computing time, which implies the width and depth of structure affects the performance of structured SOM. Structured clustering of patents is helpful in many applications. In the lawsuit of copyright, companies are easy to find claim conflict in the existent patents to contradict the accusation. Moreover, decision-maker of a company can be advised to avoid hot-spot aspects of patents, which can save a lot of R&D effort.
AB - This paper aims to cluster Chinese patent documents with the structures. Both the explicit and implicit structures are analyzed to represent by the proposed structure expression. Accordingly, an unsupervised clustering algorithm called structured self-organizing map (SOM) is adopted to cluster Chinese patent documents with both similar content and structure. Structured SOM clusters the similar content of each sub-part structure, and then propagates the similarity to upper level ones. Experimental result showed the maps size and number of patents are proportional to the computing time, which implies the width and depth of structure affects the performance of structured SOM. Structured clustering of patents is helpful in many applications. In the lawsuit of copyright, companies are easy to find claim conflict in the existent patents to contradict the accusation. Moreover, decision-maker of a company can be advised to avoid hot-spot aspects of patents, which can save a lot of R&D effort.
KW - Chinese patent
KW - Metadata
KW - Structure clustering
KW - Structure expression
UR - http://www.scopus.com/inward/record.url?scp=38649115318&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38649115318&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2007.03.012
DO - 10.1016/j.eswa.2007.03.012
M3 - Article
AN - SCOPUS:38649115318
SN - 0957-4174
VL - 34
SP - 2290
EP - 2297
JO - Expert Systems with Applications
JF - Expert Systems with Applications
IS - 4
ER -