Audio Fingerprint Extraction for Content Identification

Yu Shiu*, Chia Hung Yeh, C. C.Jay Kuo

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

1 Citation (Scopus)


In this work, we present an audio content identification system that identifies some unknown audio material by comparing its fingerprint with those extracted off-line and saved in the music database. We will describe in detail the procedure to extract audio fingerprints and demonstrate that they are robust to noise and content-preserving manipulations. The main feature in the proposed system is the zero-crossing rate extracted with the octave-band filter bank. The zero-crossing rate can be used to describe the dominant frequency in each subband with a very low computational cost. The size of audio fingerprint is small and can be efficiently stored along with the compressed files in the database. It is also robust to many modifications such as tempo change and time-alignment distortion. Besides, the octave-band filter bank is used to enhance the robustness to distortion, especially those localized on some frequency regions.

Original languageEnglish
Pages (from-to)55-64
Number of pages10
JournalProceedings of SPIE - The International Society for Optical Engineering
Publication statusPublished - 2003
Externally publishedYes
EventInternet Multimedia Management Systems IV - Orlando, FL, United States
Duration: 2003 Sept 92003 Sept 11


  • Audio Database Management
  • Audio Fingerprint
  • Audio Identification
  • Audio Processing
  • Zero-crossing Rate

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering


Dive into the research topics of 'Audio Fingerprint Extraction for Content Identification'. Together they form a unique fingerprint.

Cite this