You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Electronic dance music (EDM), produced using computers and electronic instruments, is a collection of musical sub-genres that emphasise timbre and rhythm over melody and harmony. It is usually presented through the medium of DJing, where tracks are curated and mixed sequentially to offer unique listening and dancing experiences. However, unlike key and tempo annotations, DJs still rely on audition rather than metadata to examine and select tracks with complementary audio content. In this work, we investigate the use of deep learning-based representations (Complex Autoencoder and OpenL3) for analysing and visualising audio content on a corpus of DJ mixes with approximate transition timestamps and compare them with signal processing-based representations (joint time-frequency scattering transform and mel-frequency cepstral coefficients). Representations are computed once per second and visualised with UMAP dimensionality reduction. We propose heuristics based on the identification of observed patterns in visualisations and time-sensitive Euclidean distances in the representation space to compute DJ transition lengths, transition smoothness, and inter-song, song-to-song, and full-mix audio content consistency using audio representations along with rough DJ transition timestamps. Our method enables the visualisation of variations within music tracks, facilitating the analysis of DJ mixes and individual EDM tracks. This approach supports musicians in making informed creative decisions based on such visualisations. We share our code, dataset annotations, computed audio representations, and trained CAE model. We encourage researchers and music enthusiasts alike to analyse their own music using our tools: https://github.com/alexjameswilliams/EDMAudioRepresentations.
Author (s): Barthet, Mathieu; Lattner, Stefan; Saitis, Charalampos; Tian, Haokun; Williams, Alexander
Affiliation:
Queen Mary University of London; Sony CLS, Paris, France; Queen Mary University London, London, UK; Queen Mary University London, London, UK; Queen Mary University London, London, UK
(See document for exact affiliation information.)
Publication Date:
2024-05-28
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22438
(13087KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Barthet, Mathieu; Lattner, Stefan; Saitis, Charalampos; Tian, Haokun; Williams, Alexander; 2024; Deep Learning-based Audio Representations for the Analysis and Visualisation of Electronic Dance Music DJ Mixes [PDF]; Queen Mary University of London; Sony CLS, Paris, France; Queen Mary University London, London, UK; Queen Mary University London, London, UK; Queen Mary University London, London, UK; Paper 5; Available from: https://aes2.org/publications/elibrary-page/?id=22438
Barthet, Mathieu; Lattner, Stefan; Saitis, Charalampos; Tian, Haokun; Williams, Alexander; Deep Learning-based Audio Representations for the Analysis and Visualisation of Electronic Dance Music DJ Mixes [PDF]; Queen Mary University of London; Sony CLS, Paris, France; Queen Mary University London, London, UK; Queen Mary University London, London, UK; Queen Mary University London, London, UK; Paper 5; 2024 Available: https://aes2.org/publications/elibrary-page/?id=22438
@article{barthet2024deep,
author={barthet mathieu and lattner stefan and saitis charalampos and tian haokun and williams alexander},
journal={journal of the audio engineering society},
title={deep learning-based audio representations for the analysis and visualisation of electronic dance music dj mixes},
year={2024},
number={5},
month={may},}
TY – paper
TI – Deep Learning-based Audio Representations for the Analysis and Visualisation of Electronic Dance Music DJ Mixes
AU – Barthet, Mathieu
AU – Lattner, Stefan
AU – Saitis, Charalampos
AU – Tian, Haokun
AU – Williams, Alexander
PY – 2024
JO – Journal of the Audio Engineering Society
VL – 5
Y1 – May 2024