Fast visual retrieval using accelerated sequence matching

Mei-Chen Yeh, Kwang Ting Cheng

Research output: Contribution to journalArticle

19 Citations (Scopus)

Abstract

We present an approach to represent, match, and index various types of visual data, with the primary goal of enabling effective and computationally efficient searches. In this approach, an image/video is represented by an ordered list of feature descriptors. Similarities between such representations are then measured by the approximate string matching technique. This approach unifies visual appearance and the ordering information in a holistic manner with joint consideration of visual-order consistency between the query and the reference instances, and can be used for automatically identifying local alignments between two pieces of visual data. This capability is essential for tasks such as video copy detection where only small portions of the query and the reference videos are similar. To deal with large volumes of data, we further show that this approach can be significantly accelerated along with a dedicated indexing structure. Extensive experiments on various visual retrieval and classification tasks demonstrate the superior performance of the proposed techniques compared to existing solutions.

Original languageEnglish
Article number5643930
Pages (from-to)320-329
Number of pages10
JournalIEEE Transactions on Multimedia
Volume13
Issue number2
DOIs
Publication statusPublished - 2011 Apr 1

Fingerprint

Experiments

Keywords

  • Image classification
  • similarity measure
  • string matching
  • video retrieval

ASJC Scopus subject areas

  • Signal Processing
  • Media Technology
  • Computer Science Applications
  • Electrical and Electronic Engineering

Cite this

Fast visual retrieval using accelerated sequence matching. / Yeh, Mei-Chen; Cheng, Kwang Ting.

In: IEEE Transactions on Multimedia, Vol. 13, No. 2, 5643930, 01.04.2011, p. 320-329.

Research output: Contribution to journalArticle

@article{ec4af1b6db504622bc58b4ccae22cc0c,
title = "Fast visual retrieval using accelerated sequence matching",
abstract = "We present an approach to represent, match, and index various types of visual data, with the primary goal of enabling effective and computationally efficient searches. In this approach, an image/video is represented by an ordered list of feature descriptors. Similarities between such representations are then measured by the approximate string matching technique. This approach unifies visual appearance and the ordering information in a holistic manner with joint consideration of visual-order consistency between the query and the reference instances, and can be used for automatically identifying local alignments between two pieces of visual data. This capability is essential for tasks such as video copy detection where only small portions of the query and the reference videos are similar. To deal with large volumes of data, we further show that this approach can be significantly accelerated along with a dedicated indexing structure. Extensive experiments on various visual retrieval and classification tasks demonstrate the superior performance of the proposed techniques compared to existing solutions.",
keywords = "Image classification, similarity measure, string matching, video retrieval",
author = "Mei-Chen Yeh and Cheng, {Kwang Ting}",
year = "2011",
month = "4",
day = "1",
doi = "10.1109/TMM.2010.2094999",
language = "English",
volume = "13",
pages = "320--329",
journal = "IEEE Transactions on Multimedia",
issn = "1520-9210",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "2",

}

TY - JOUR

T1 - Fast visual retrieval using accelerated sequence matching

AU - Yeh, Mei-Chen

AU - Cheng, Kwang Ting

PY - 2011/4/1

Y1 - 2011/4/1

N2 - We present an approach to represent, match, and index various types of visual data, with the primary goal of enabling effective and computationally efficient searches. In this approach, an image/video is represented by an ordered list of feature descriptors. Similarities between such representations are then measured by the approximate string matching technique. This approach unifies visual appearance and the ordering information in a holistic manner with joint consideration of visual-order consistency between the query and the reference instances, and can be used for automatically identifying local alignments between two pieces of visual data. This capability is essential for tasks such as video copy detection where only small portions of the query and the reference videos are similar. To deal with large volumes of data, we further show that this approach can be significantly accelerated along with a dedicated indexing structure. Extensive experiments on various visual retrieval and classification tasks demonstrate the superior performance of the proposed techniques compared to existing solutions.

AB - We present an approach to represent, match, and index various types of visual data, with the primary goal of enabling effective and computationally efficient searches. In this approach, an image/video is represented by an ordered list of feature descriptors. Similarities between such representations are then measured by the approximate string matching technique. This approach unifies visual appearance and the ordering information in a holistic manner with joint consideration of visual-order consistency between the query and the reference instances, and can be used for automatically identifying local alignments between two pieces of visual data. This capability is essential for tasks such as video copy detection where only small portions of the query and the reference videos are similar. To deal with large volumes of data, we further show that this approach can be significantly accelerated along with a dedicated indexing structure. Extensive experiments on various visual retrieval and classification tasks demonstrate the superior performance of the proposed techniques compared to existing solutions.

KW - Image classification

KW - similarity measure

KW - string matching

KW - video retrieval

UR - http://www.scopus.com/inward/record.url?scp=79952927234&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79952927234&partnerID=8YFLogxK

U2 - 10.1109/TMM.2010.2094999

DO - 10.1109/TMM.2010.2094999

M3 - Article

AN - SCOPUS:79952927234

VL - 13

SP - 320

EP - 329

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

SN - 1520-9210

IS - 2

M1 - 5643930

ER -