You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The multiple-channel[1] sound source enhancement methods have made a great progress in recent years, espe-cially when combined with the learning-based algorithms. However, the performance of these techniques is lim-ited by the completeness of the training dataset, which may degrade in mismatched environments. In this paper, we propose a reconstruction Model based Self-supervised Learning (RMSL) method for sound source enhance-ment. A reconstruction module is used to integrate the estimated target signal and noise components to regenerate the multi-channel mixed signals, and it is connected with a separating model to form a closed loop.In this case, the optimization of the separation model can be achieved by continuously iterating the separation-reconstruction process. We use the separation error, the reconstruction error, and the signal-noise independence error as loss functions in the self-supervised learning process. This method is applied to the state-of-the-art sound source separation model (ADL-MVDR) and evaluated under different scenarios. Experimental results demonstrate that the proposed method can improve the performance of ADL-MVDR algorithm under different number of sound sources, bringing about 0.5 dB to 1 dB Si-SNR gain, while maintaining good clarity and intelligibility in practical application.
Author (s): Gao, Shan; Wu, Xihong; Qu, Tianshu
Affiliation:
Peking University; Peking University; Peking University
(See document for exact affiliation information.)
AES Convention: 156
Paper Number:10683
Publication Date:
2024-06-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22496
(11218KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Gao, Shan; Wu, Xihong; Qu, Tianshu; 2024; DOA-Informed Self-Supervised Learning Method for Sound Source Enhancement [PDF]; Peking University; Peking University; Peking University; Paper 10683; Available from: https://aes2.org/publications/elibrary-page/?id=22496
Gao, Shan; Wu, Xihong; Qu, Tianshu; DOA-Informed Self-Supervised Learning Method for Sound Source Enhancement [PDF]; Peking University; Peking University; Peking University; Paper 10683; 2024 Available: https://aes2.org/publications/elibrary-page/?id=22496