E-library page

AES E-Library

Single-Channel Audio Source Separation Using Deep Neural Network Ensembles

Deep neural networks (DNNs) are often used to tackle the single channel source separation (SCSS) problem by predicting time-frequency masks. The predicted masks are then used to separate the sources from the mixed signal. Different types of masks produce separated sources with different levels of distortion and interference. Some types of masks produce separated sources with low distortion, while other masks produce low interference between the separated sources. In this paper a combination of different DNNs’ predictions (masks) is used for SCSS to achieve better quality of the separated sources than using each DNN individually. We train four different DNNs by minimizing four different cost functions to predict four different masks. The first and second DNNs are trained to approximate reference binary and soft masks. The third DNN is trained to predict a mask from the reference sources directly. The last DNN is trained similarly to the third DNN but with an additional discriminative constraint to maximize the differences between the estimated sources. Our experimental results show that combining the predictions of different DNNs achieves separated sources with better quality than using each DNN individually.

Author (s): Grais, Emad M.; Roma, Gerard; Simpson, Andrew J. R.; Plumbley, Mark D.
Affiliation: University of Surrey, Guildford, Surrey, UK (See document for exact affiliation information.)
AES Convention: 140 Paper Number:9494
Publication Date: 2016-05-06 Import into BibTeX
Session subject: Audio Signal Processing: Coding, Encoding, and Perception
Permalink: https://aes2.org/publications/elibrary-page/?id=18193

(241KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Convention Paper
E-Libary location: (CD 140Papers) TMP/conv/140/

Learn more about the AES E-Library

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

Single-Channel Audio Source Separation Using Deep Neural Network Ensembles

Choose your country of residence from this list:

AES E-Library

Login Institutions

Single-Channel Audio Source Separation Using Deep Neural Network Ensembles

Choose your country of residence from this list: