Audio Fingerprint Extraction for Content Identification

Yu Shiu*, Chia Hung Yeh, C. C.Jay Kuo

*此作品的通信作者

研究成果: 雜誌貢獻會議論文同行評審

1 引文 斯高帕斯(Scopus)

摘要

In this work, we present an audio content identification system that identifies some unknown audio material by comparing its fingerprint with those extracted off-line and saved in the music database. We will describe in detail the procedure to extract audio fingerprints and demonstrate that they are robust to noise and content-preserving manipulations. The main feature in the proposed system is the zero-crossing rate extracted with the octave-band filter bank. The zero-crossing rate can be used to describe the dominant frequency in each subband with a very low computational cost. The size of audio fingerprint is small and can be efficiently stored along with the compressed files in the database. It is also robust to many modifications such as tempo change and time-alignment distortion. Besides, the octave-band filter bank is used to enhance the robustness to distortion, especially those localized on some frequency regions.

原文英語
頁(從 - 到)55-64
頁數10
期刊Proceedings of SPIE - The International Society for Optical Engineering
5242
DOIs
出版狀態已發佈 - 2003
對外發佈
事件Internet Multimedia Management Systems IV - Orlando, FL, 美国
持續時間: 2003 9月 92003 9月 11

ASJC Scopus subject areas

  • 電子、光磁材料
  • 凝聚態物理學
  • 電腦科學應用
  • 應用數學
  • 電氣與電子工程

指紋

深入研究「Audio Fingerprint Extraction for Content Identification」主題。共同形成了獨特的指紋。

引用此