Automatic Music Transcription Leveraging Generalized Cepstral Features and Deep Learning

Yu Te Wu, Berlin Chen, Li Su

研究成果: 書貢獻/報告類型會議論文篇章

11 引文 斯高帕斯(Scopus)

摘要

Spectral features are limited in modeling musical signals with multiple concurrent pitches due to the challenge to suppress the interference of the harmonic peaks from one pitch to another. In this paper, we show that using multiple features represented in both the frequency and time domains with deep learning modeling can reduce such interference. These features are derived systematically from conventional pitch detection functions that relate to one another through the discrete Fourier transform and a nonlinear scaling function. Neural networks modeled with these features outperform state-of-the-art methods while using less training data.

原文英語
主出版物標題2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面401-405
頁數5
ISBN(列印)9781538646588
DOIs
出版狀態已發佈 - 2018 9月 10
事件2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Calgary, 加拿大
持續時間: 2018 4月 152018 4月 20

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2018-April
ISSN(列印)1520-6149

會議

會議2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
國家/地區加拿大
城市Calgary
期間2018/04/152018/04/20

ASJC Scopus subject areas

  • 軟體
  • 訊號處理
  • 電氣與電子工程

指紋

深入研究「Automatic Music Transcription Leveraging Generalized Cepstral Features and Deep Learning」主題。共同形成了獨特的指紋。

引用此