AES E-Library

Audio Signal Processing in the Artificial Intelligence Era: Challenges and Directions

Artificial intelligence (AI) has seen significant advancement in recent years, leading to increasing interest in integrating these techniques to solve both existing and emerging problems in audio engineering. In this paper, the authors investigate current trends in the application of AI for audio engineering, outlining open problems and applications in the research field. The paper begins by providing an overview of AI-based algorithm development in the context of audio, discussing problem selection and taxonomy. Next, human-centric AI challenges and how they relate to audio engineering are explored, including ethics, trustworthiness, explainability, and interaction, emphasizing the need for ethically sound and human-centered AI systems. Subsequently, technical challenges that arise when applying modern AI techniques to audio are examined, including robust generalization, audio quality, high sample rates, and real-time processing with low latency. Finally, the authors outline applications of AI in audio engineering, covering the development of machine learning–powered audio effects, synthesizers, automated mixing systems, and spatial audio, speech enhancement, dialog separation, and music generation. Emphasized are the need for a balanced approach that integrates humancentric concerns with technological advancements, advocating for responsible and effective application of AI.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
Publication Date:
Permalink: https://aes2.org/publications/elibrary-page/?id=22921


(836KB)


Download Now

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
E-Libary location:
16938
Choose your country of residence from this list:










Skip to content