Polyphonic Music Transcription with Semantic Segmentation

Yu Te Wu, Berlin Chen, Li Su

研究成果: 書貢獻/報告類型會議論文篇章

24 引文 斯高帕斯(Scopus)

摘要

The multi-instrument transcription task refers to joint recognition of instrument and pitch of every event in polyphonic music signals generated by one or more classes of music instruments. In this paper, we leverage multi-object semantic segmentation techniques to solve this problem. We design a time-frequency representation, which has multiple channels to jointly represent the harmonic structure and pitch saliency of a pitch activation. The transcription task therefore becomes a pixel-wise multi-task classification problem including pitch activity detection and instrument recognition. Experiments on both single- and multi-instrument data verify the competitiveness of the proposed method.

原文英語
主出版物標題2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面166-170
頁數5
ISBN(電子)9781479981311
DOIs
出版狀態已發佈 - 2019 5月
事件44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Brighton, 英国
持續時間: 2019 5月 122019 5月 17

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2019-May
ISSN(列印)1520-6149

會議

會議44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019
國家/地區英国
城市Brighton
期間2019/05/122019/05/17

ASJC Scopus subject areas

  • 軟體
  • 訊號處理
  • 電氣與電子工程

指紋

深入研究「Polyphonic Music Transcription with Semantic Segmentation」主題。共同形成了獨特的指紋。

引用此