Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Deep learning approaches for beat and downbeat tracking have brought advancements. However, these approaches continue to rely on hand-crafted, subsampled spectral features as input, restricting the information available to the model. In this work, we propose WaveBeat, an end-to-end approach for joint beat and downbeat tracking operating directly on waveforms. This method forgoes engineered spectral features, and instead, produces beat and downbeat predictions directly from the waveform, the first of its kind for this task. Our model utilizes temporal convolutional networks (TCNs) operating on waveforms that achieve a very large receptive field (= 30 s) at audio sample rates in a memory efficient manner by employing rapidly growing dilation factors with fewer layers. With a straightforward data augmentation strategy, our method outperforms previous state-of-the-art methods on some datasets, while producing comparable results on others, demonstrating the potential for time domain approaches.
Author (s): Steinmetz, Christian J.; Reiss, Joshua D.
Affiliation:
Queen Mary University of London, UK
(See document for exact affiliation information.)
AES Convention: 151
Paper Number:655
Publication Date:
2021-10-06
Import into BibTeX
Session subject:
Applications in audio
Permalink: https://aes2.org/publications/elibrary-page/?id=21578
(280KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Steinmetz, Christian J.; Reiss, Joshua D.; 2021; WaveBeat: End-to-end beat and downbeat tracking in the time domain [PDF]; Queen Mary University of London, UK; Paper 655; Available from: https://aes2.org/publications/elibrary-page/?id=21578
Steinmetz, Christian J.; Reiss, Joshua D.; WaveBeat: End-to-end beat and downbeat tracking in the time domain [PDF]; Queen Mary University of London, UK; Paper 655; 2021 Available: https://aes2.org/publications/elibrary-page/?id=21578