E-library page

AES E-Library

Deep Learning Based Voice Extraction and Primary-Ambience Decomposition for Stereo to Surround Upmixing

Surround systems have gained popularity in home entertainment despite the fact that most of the cinematic content is delivered in two-channel stereo format. Although there are several upmixing options, it has proven challenging to deliver an upmixed signal that approximates the original directionality and timbre intended by the mixing artist. The aim of this work is to design a two-to-five channels upmixer using a novel upmixing strategy combining voice extraction and primary-ambience decomposition. Results from a modified-MUSHRA test show that our proposed upmixer outperforms established alternatives for cinematic upmixing in perceived spatial and timbral quality.

Author (s): Paez Amaro, Ricardo Thaddeus; Tejeda Ocampo, Carlos; Souza Blanes, Ema; Bharitkar, Sunil; Madrid Herrera, Luis
Affiliation: Samsung Research Tijuana, Mexico; Samsung Research Tijuana, Mexico; Samsung Research America, Mountain View, CA, USA; Samsung Research America, Mountain View, CA, USA; Samsung Research Tijuana, Mexico (See document for exact affiliation information.)
AES Convention: 154 Paper Number:62
Publication Date: 2023-05-06 Import into BibTeX
Session subject: Neural Networks
Permalink: https://aes2.org/publications/elibrary-page/?id=22087

(886KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Express Paper
E-Libary location: TMP/conv/154/

Learn more about the AES E-Library

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

Deep Learning Based Voice Extraction and Primary-Ambience Decomposition for Stereo to Surround Upmixing

Choose your country of residence from this list:

AES E-Library

Login Institutions

Deep Learning Based Voice Extraction and Primary-Ambience Decomposition for Stereo to Surround Upmixing

Choose your country of residence from this list: