AES E-Library

A Machine Learning approach to modal control in small rooms

Reducing the decay time of low-frequency resonances in small rooms remains a critical challenge in audio engineering, as these resonances can degrade sound quality, leading to a "boomy" or "muddy" audio experience. Traditional equalization methods, while effective in certain contexts, rely heavily on static equalization filters, making them suboptimal for complex scenarios. This paper explores a novel machine learning-based approach to suppress low-frequency resonances by leveraging a modified Wave-U-Net architecture. The model incorporates psychoacoustic principles, specifically perceptual modal thresholds, to ensure that resonance suppression is guided by human auditory perception while preserving the natural characteristics of the audio. A large dataset of synthetic Room Impulse Responses was generated using a physically informed parametric model for cuboid rooms with randomized dimensions and absorption profiles. The model was trained on amplitude-modulated white noise bursts to simulate broadband excitation. Evaluation on unseen synthetic data demonstrates effective suppression of low-frequency resonances below perceptual thresholds. Limitations in generalization are discussed, along with future directions that include retraining on musical stimuli and incorporating perceptual loss metrics.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
Publication Date:
Session subject:
Permalink: https://aes2.org/publications/elibrary-page/?id=23015


(1359KB)


Download Now

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
E-Libary location:
16938
Choose your country of residence from this list:










Skip to content