Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
We present the engineering underlying a consumer application to help music industry professionals find audio clips and samples of personal interest within their large audio libraries typically consisting of heterogeneously-labeled clips supplied by various vendors. We enable users to train an indexing system using their own custom tags (e.g., instruments, genres, moods), by means of convolutional neural networks operating on spectrograms. Since the intended users are not data scientists and may not possess the required computational resources (i.e., Graphics Processing Units, GPUs), our primary contributions consist of (i) designing an intuitive user experience for a local client application to help users create representative spectrogram datasets, and (ii) "seamless" integration with a cloud-based GPU server for efficient neural network training.
Author (s): Hawley, Scott; Bagley, Jason; Porter, Brett; Traynham, Daisey
Affiliation:
Belmont University, Nashville, TN, USA; Art+Logic, Pasadena, CA, USA; Art+Logic, Fanwood, NJ, USA
(See document for exact affiliation information.)
AES Convention: 147
Paper Number:562
Publication Date:
2019-10-06
Import into BibTeX
Session subject:
Applications in Audio
Permalink: https://aes2.org/publications/elibrary-page/?id=20585
(1164KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Hawley, Scott; Bagley, Jason; Porter, Brett; Traynham, Daisey; 2019; Vibrary: A Consumer-Trainable Music Tagging Utility [PDF]; Belmont University, Nashville, TN, USA; Art+Logic, Pasadena, CA, USA; Art+Logic, Fanwood, NJ, USA; Paper 562; Available from: https://aes2.org/publications/elibrary-page/?id=20585
Hawley, Scott; Bagley, Jason; Porter, Brett; Traynham, Daisey; Vibrary: A Consumer-Trainable Music Tagging Utility [PDF]; Belmont University, Nashville, TN, USA; Art+Logic, Pasadena, CA, USA; Art+Logic, Fanwood, NJ, USA; Paper 562; 2019 Available: https://aes2.org/publications/elibrary-page/?id=20585