TY - GEN
T1 - Automatic Music Transcription Leveraging Generalized Cepstral Features and Deep Learning
AU - Wu, Yu Te
AU - Chen, Berlin
AU - Su, Li
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/9/10
Y1 - 2018/9/10
N2 - Spectral features are limited in modeling musical signals with multiple concurrent pitches due to the challenge to suppress the interference of the harmonic peaks from one pitch to another. In this paper, we show that using multiple features represented in both the frequency and time domains with deep learning modeling can reduce such interference. These features are derived systematically from conventional pitch detection functions that relate to one another through the discrete Fourier transform and a nonlinear scaling function. Neural networks modeled with these features outperform state-of-the-art methods while using less training data.
AB - Spectral features are limited in modeling musical signals with multiple concurrent pitches due to the challenge to suppress the interference of the harmonic peaks from one pitch to another. In this paper, we show that using multiple features represented in both the frequency and time domains with deep learning modeling can reduce such interference. These features are derived systematically from conventional pitch detection functions that relate to one another through the discrete Fourier transform and a nonlinear scaling function. Neural networks modeled with these features outperform state-of-the-art methods while using less training data.
KW - Automatic music transcription
KW - Cepstrum
KW - Convolutional neural networks
KW - Deep learning
UR - http://www.scopus.com/inward/record.url?scp=85054289665&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054289665&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2018.8462079
DO - 10.1109/ICASSP.2018.8462079
M3 - Conference contribution
AN - SCOPUS:85054289665
SN - 9781538646588
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 401
EP - 405
BT - 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
Y2 - 15 April 2018 through 20 April 2018
ER -