You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Surround systems have gained popularity in home entertainment despite the fact that most of the cinematic content is delivered in two-channel stereo format. Although there are several upmixing options, it has proven challenging to deliver an upmixed signal that approximates the original directionality and timbre intended by the mixing artist. The aim of this work is to design a two-to-five channels upmixer using a novel upmixing strategy combining voice extraction and primary-ambience decomposition. Results from a modified-MUSHRA test show that our proposed upmixer outperforms established alternatives for cinematic upmixing in perceived spatial and timbral quality.
Author (s): Paez Amaro, Ricardo Thaddeus; Tejeda Ocampo, Carlos; Souza Blanes, Ema; Bharitkar, Sunil; Madrid Herrera, Luis
Affiliation:
Samsung Research Tijuana, Mexico; Samsung Research Tijuana, Mexico; Samsung Research America, Mountain View, CA, USA; Samsung Research America, Mountain View, CA, USA; Samsung Research Tijuana, Mexico
(See document for exact affiliation information.)
AES Convention: 154
Paper Number:62
Publication Date:
2023-05-06
Import into BibTeX
Session subject:
Neural Networks
Permalink: https://aes2.org/publications/elibrary-page/?id=22087
(886KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Paez Amaro, Ricardo Thaddeus; Tejeda Ocampo, Carlos; Souza Blanes, Ema; Bharitkar, Sunil; Madrid Herrera, Luis; 2023; Deep Learning Based Voice Extraction and Primary-Ambience Decomposition for Stereo to Surround Upmixing [PDF]; Samsung Research Tijuana, Mexico; Samsung Research Tijuana, Mexico; Samsung Research America, Mountain View, CA, USA; Samsung Research America, Mountain View, CA, USA; Samsung Research Tijuana, Mexico; Paper 62; Available from: https://aes2.org/publications/elibrary-page/?id=22087
Paez Amaro, Ricardo Thaddeus; Tejeda Ocampo, Carlos; Souza Blanes, Ema; Bharitkar, Sunil; Madrid Herrera, Luis; Deep Learning Based Voice Extraction and Primary-Ambience Decomposition for Stereo to Surround Upmixing [PDF]; Samsung Research Tijuana, Mexico; Samsung Research Tijuana, Mexico; Samsung Research America, Mountain View, CA, USA; Samsung Research America, Mountain View, CA, USA; Samsung Research Tijuana, Mexico; Paper 62; 2023 Available: https://aes2.org/publications/elibrary-page/?id=22087
@article{paez2023deep,
author={paez amaro ricardo thaddeus and tejeda ocampo carlos and souza blanes ema and bharitkar sunil and madrid herrera luis},
journal={journal of the audio engineering society},
title={deep learning based voice extraction and primary-ambience decomposition for stereo to surround upmixing},
year={2023},
number={62},
month={may},}
TY – paper
TI – Deep Learning Based Voice Extraction and Primary-Ambience Decomposition for Stereo to Surround Upmixing
AU – Paez Amaro, Ricardo Thaddeus
AU – Tejeda Ocampo, Carlos
AU – Souza Blanes, Ema
AU – Bharitkar, Sunil
AU – Madrid Herrera, Luis
PY – 2023
JO – Journal of the Audio Engineering Society
VL – 62
Y1 – May 2023