You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Artificial intelligence (AI) has seen significant advancement in recent years, leading to increasing interest in integrating these techniques to solve both existing and emerging problems in audio engineering. In this paper, the authors investigate current trends in the application of AI for audio engineering, outlining open problems and applications in the research field. The paper begins by providing an overview of AI-based algorithm development in the context of audio, discussing problem selection and taxonomy. Next, human-centric AI challenges and how they relate to audio engineering are explored, including ethics, trustworthiness, explainability, and interaction, emphasizing the need for ethically sound and human-centered AI systems. Subsequently, technical challenges that arise when applying modern AI techniques to audio are examined, including robust generalization, audio quality, high sample rates, and real-time processing with low latency. Finally, the authors outline applications of AI in audio engineering, covering the development of machine learning–powered audio effects, synthesizers, automated mixing systems, and spatial audio, speech enhancement, dialog separation, and music generation. Emphasized are the need for a balanced approach that integrates humancentric concerns with technological advancements, advocating for responsible and effective application of AI.
Author (s): Steinmetz, Christian J.; Uhle, Christian; Everardo, Flavio; Mitcheltree, Christopher; McElveen, J. Keith; Jot, Jean-Marc; Wichern, Gordon
Affiliation:
Centre for Digital Music, Queen Mary University of London, London, UK; Centre for Digital Music, Queen Mary University of London, London, UK; Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; International Audio Laboratories Erlangen, Erlangen, Germany; Tecnológico de Monterrey Campus Puebla, Puebla, Mexico; Wave Sciences, Charleston, SC; Virtuel Works, LLC, Aptos, CA; Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA
(See document for exact affiliation information.)
Publication Date:
2025-07-07
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22921
(836KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Steinmetz, Christian J.; Uhle, Christian; Everardo, Flavio; Mitcheltree, Christopher; McElveen, J. Keith; Jot, Jean-Marc; Wichern, Gordon; 2025; Audio Signal Processing in the Artificial Intelligence Era: Challenges and Directions [PDF]; Centre for Digital Music, Queen Mary University of London, London, UK; Centre for Digital Music, Queen Mary University of London, London, UK; Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; International Audio Laboratories Erlangen, Erlangen, Germany; Tecnológico de Monterrey Campus Puebla, Puebla, Mexico; Wave Sciences, Charleston, SC; Virtuel Works, LLC, Aptos, CA; Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=22921
Steinmetz, Christian J.; Uhle, Christian; Everardo, Flavio; Mitcheltree, Christopher; McElveen, J. Keith; Jot, Jean-Marc; Wichern, Gordon; Audio Signal Processing in the Artificial Intelligence Era: Challenges and Directions [PDF]; Centre for Digital Music, Queen Mary University of London, London, UK; Centre for Digital Music, Queen Mary University of London, London, UK; Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; International Audio Laboratories Erlangen, Erlangen, Germany; Tecnológico de Monterrey Campus Puebla, Puebla, Mexico; Wave Sciences, Charleston, SC; Virtuel Works, LLC, Aptos, CA; Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA; Paper ; 2025 Available: https://aes2.org/publications/elibrary-page/?id=22921
@article{steinmetz2025audio,
author={steinmetz christian j. and uhle christian and everardo flavio and mitcheltree christopher and mcelveen j. keith and jot jean-marc and wichern gordon},
journal={journal of the audio engineering society},
title={audio signal processing in the artificial intelligence era: challenges and directions},
year={2025},
volume={73},
issue={7/8},
pages={406-428},
month={july},}
TY – paper
TI – Audio Signal Processing in the Artificial Intelligence Era: Challenges and Directions
SP – 406 EP – 428
AU – Steinmetz, Christian J.
AU – Uhle, Christian
AU – Everardo, Flavio
AU – Mitcheltree, Christopher
AU – McElveen, J. Keith
AU – Jot, Jean-Marc
AU – Wichern, Gordon
PY – 2025
JO – Journal of the Audio Engineering Society
VO – 73
IS – 7/8
Y1 – July 2025