A multi-level hierarchical index structure for supporting efficient similarity search on tag sets

Jia Ling Koh*, Nonhlanhla Shongwe, Chung Wen Cho

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

2 引文 斯高帕斯(Scopus)

摘要

Social communication websites has been an emerging type of a Web service that helps users to share their resources. For providing efficient similarity search of tag set in a social tagging system, we propose a multi-level hierarchical index structure to group similar tag sets. Not only the algorithms of similarity searches of tag sets, but also the algorithms of deletion and updating of tag sets by using the constructed index structure are provided. Furthermore, we define a modified hamming distance function on tag sets, which consider the semantically relatedness when comparing the members for evaluating the similarity of two tag sets. This function is more applicable to evaluate the similarity search of two tag sets. A systematic performance study is performed to verify the effectiveness and the efficiency of the proposed strategies. The experiment results show that the proposed MHIB approach further improves the pruning effect of the previous work which constructs a two-level index structure. Especially, the MHIB approach is well scalable with respect to the three parameters when using either the hamming distance or the modified hamming distance for similarity measure. Although the insertion operation of the MHIB approach requires higher cost than the naïve method, with the assistant of the constructed inverted list of clusters, it performs faster than the previous work. Besides, the cost of performing deletion operation by using the MHIB approach is much less than the other two approaches and so is the update operation.

原文英語
主出版物標題6th International Conference on Research Challenges in Information Science, RCIS 2012 - Conference Proceedings
DOIs
出版狀態已發佈 - 2012
事件6th International Conference on Research Challenges in Information Science, RCIS 2012 - Valencia, 西班牙
持續時間: 2012 5月 162012 5月 18

出版系列

名字Proceedings - International Conference on Research Challenges in Information Science
ISSN(列印)2151-1349
ISSN(電子)2151-1357

其他

其他6th International Conference on Research Challenges in Information Science, RCIS 2012
國家/地區西班牙
城市Valencia
期間2012/05/162012/05/18

ASJC Scopus subject areas

  • 電腦科學應用
  • 資訊系統
  • 軟體

指紋

深入研究「A multi-level hierarchical index structure for supporting efficient similarity search on tag sets」主題。共同形成了獨特的指紋。

引用此