E-library page

AES E-Library

Conditioned Source Separation by Attentively Aggregating Frequency Transformations With Self-Conditioning

Label-conditioned source separation extracts the target source, specified by an input symbol, from an input mixture track. A recently proposed label-conditioned source separation model called Latent Source Attentive Frequency Transformation (LaSAFT)--Gated Point-Wise Convolutional Modulation (GPoCM)--Net introduced a block for latent source analysis called LaSAFT. Employing LaSAFT blocks, it established state-of-the-art performance on several tasks of the MUSDB18 benchmark. This paper enhances the LaSAFT block by exploiting a self-conditioning method. Whereas the existing method only cares about the symbolic relationships between the target source symbol and latent sources, ignoring audio content, the new approach also considers audio content. The enhanced block computes the attention mask conditioning on the label and the input audio feature map. Here, it is shown that the conditioned U-Net employing the enhanced LaSAFT blocks outperforms the previous model. It is also shown that the present model performs the audio-query--based separation with a slight modification.

Author (s): Choi, Woosung; Jeong, Yeong-Seok; Kim, Jinsung; Chung, Jaehwa; Jung, Soonyoung; Reiss, Joshua D.
Affiliation: Department of Computer Science and Engineering, Korea University, Republic of Korea; Department of Computer Science and Engineering, Korea University, Republic of Korea; Department of Computer Science and Engineering, Korea University, Republic of Korea; Department of Computer Science, Korea National Open University, Republic of Korea; Department of Computer Science and Engineering, Korea University, Republic of Korea; Centre for Digital Music, Queen Mary University of London, London, UK (See document for exact affiliation information.)
Publication Date: 2022-09-06 Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=21880

(882KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Journal Article
E-Libary location: (CD JAES70) TMP/JAES70/9/

Learn more about the AES E-Library

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

Conditioned Source Separation by Attentively Aggregating Frequency Transformations With Self-Conditioning

Choose your country of residence from this list:

AES E-Library

Login Institutions

Conditioned Source Separation by Attentively Aggregating Frequency Transformations With Self-Conditioning

Choose your country of residence from this list: