You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Noise reduction techniques based on deep learning have demonstrated impressive performance in enhancing the overall quality of recorded speech. While these approaches are highly performant, their application in audio engineering can be limited due to a number of factors. These include operation only on speech without support for music, lack of real-time capability, lack of interpretable control parameters, operation at lower sample rates, and a tendency to introduce artifacts. On the other hand, signal processing-based noise reduction algorithms offer fine-grained control and operation on a broad range of content, however, they often require manual operation to achieve the best results. To address the limitations of both approaches, in this work we introduce a method that leverages a signal processing-based denoiser that when combined with a neural network controller, enables fully automatic and high-fidelity noise reduction on both speech and music signals. We evaluate our proposed method with objective metrics and a perceptual listening test. Our evaluation reveals that speech enhancement models can be extended to music, however training the model to remove only stationary noise is critical. Furthermore, our proposed approach achieves performance on par with the deep learning models, while being significantly more efficient and introducing fewer artifacts in some cases. Listening examples are available online: https://tape.it/research/denoiser
Author (s): Steinmetz, Christian J.; Walther, Thomas; Reiss, Joshua D.
Affiliation:
Centre for Digital Music, Queen Mary University of London; Tape It Music GmbH; Centre for Digital Music, Queen Mary University of London
(See document for exact affiliation information.)
AES Convention: 155
Paper Number:153
Publication Date:
2023-10-06
Import into BibTeX
Session subject:
Signal Processing
Permalink: https://aes2.org/publications/elibrary-page/?id=22307
(564KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Steinmetz, Christian J.; Walther, Thomas; Reiss, Joshua D.; 2023; High-Fidelity Noise Reduction with Differentiable Signal Processing [PDF]; Centre for Digital Music, Queen Mary University of London; Tape It Music GmbH; Centre for Digital Music, Queen Mary University of London; Paper 153; Available from: https://aes2.org/publications/elibrary-page/?id=22307
Steinmetz, Christian J.; Walther, Thomas; Reiss, Joshua D.; High-Fidelity Noise Reduction with Differentiable Signal Processing [PDF]; Centre for Digital Music, Queen Mary University of London; Tape It Music GmbH; Centre for Digital Music, Queen Mary University of London; Paper 153; 2023 Available: https://aes2.org/publications/elibrary-page/?id=22307