You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
In recent years, machine learning has been widely adopted to automate the audio mixing process. Automatic mixing systems have been applied to various audio effects such as gain adjustment, equalization, and reverberation. These systems can be controlled through visual interfaces, audio examples being provided, usage of knobs, and semantic descriptors. Using semantic descriptors or textual information to control these systems is an effective way for artists to communicate their creative goals. In this paper, the novel idea of using word embeddings to represent semantic descriptors is explored. Word embeddings are generally obtained by training neural networks on large corpora of written text. These embeddings serve as the input layer of the neural network to create a translation from words to equalizer (EQ) settings. Using this technique, the machine learning model can also generate EQ settings for semantic descriptors that it has not seen before. The EQ settings of humans are compared with the predictions of the neural network to evaluate the quality of predictions. The results showed that the embedding layer enables the neural network to understand semantic descriptors. It was observed that the models with embedding layers perform better than those without embedding layers but still not as well as human labels.
Author (s): Venkatesh, Satvik; Moffat, David; Miranda, Eduardo Reck
Affiliation:
Interdisciplinary Centre for Computer Music Research, University of Plymouth, Plymouth, UK; Plymouth Marine Laboratory, Plymouth, UK; Interdisciplinary Centre for Computer Music Research, University of Plymouth, Plymouth, UK
(See document for exact affiliation information.)
Publication Date:
2022-09-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=21887
(831KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Venkatesh, Satvik; Moffat, David; Miranda, Eduardo Reck; 2022; Word Embeddings for Automatic Equalization in Audio Mixing [PDF]; Interdisciplinary Centre for Computer Music Research, University of Plymouth, Plymouth, UK; Plymouth Marine Laboratory, Plymouth, UK; Interdisciplinary Centre for Computer Music Research, University of Plymouth, Plymouth, UK; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=21887
Venkatesh, Satvik; Moffat, David; Miranda, Eduardo Reck; Word Embeddings for Automatic Equalization in Audio Mixing [PDF]; Interdisciplinary Centre for Computer Music Research, University of Plymouth, Plymouth, UK; Plymouth Marine Laboratory, Plymouth, UK; Interdisciplinary Centre for Computer Music Research, University of Plymouth, Plymouth, UK; Paper ; 2022 Available: https://aes2.org/publications/elibrary-page/?id=21887
@article{venkatesh2022word,
author={venkatesh satvik and moffat david and miranda eduardo reck},
journal={journal of the audio engineering society},
title={word embeddings for automatic equalization in audio mixing},
year={2022},
volume={70},
issue={9},
pages={753-763},
month={september},}