E-library page

AES E-Library

Stereo Speech Enhancement Using Custom Mid-Side Signals and Monaural Processing

Speech enhancement (SE) systems typically operate on monaural input and are used for applications including voice communications and capture cleanup for user-generated content. Recent advancements and changes in the devices used for these applications are likely to lead to an increase in the amount of two-channel content for the same applications. However, SE systems are typically designed for monaural input; stereo results produced using trivial methods such as channel-independent or mid-side processing may be unsatisfactory, including substantial speech distortions. To address this, the authors propose a system that creates a novel representation of stereo signals called custom mid-side signals (CMSS). CMSS allow benefits of mid-side signals for center-panned speech to be extended to a much larger class of input signals. This, in turn, allows any existing monaural SE system to operate as an efficient stereo system by processing the custom mid signal. This paper describes how the parameters needed for CMSS can be efficiently estimated by a component of the spatio-level--filtering source separation system. Subjective listening using state-of-the-art deep learning--based SE systems on stereo content with various speech mixing styles shows that CMSS processing leads to improved speech quality at approximately half the cost of channel-independent processing.

Author (s): Master, Aaron S.; Lu, Lie; Swedlow, Nathan
Affiliation: Dolby Laboratories, Inc., San Francisco, CA (See document for exact affiliation information.)
Publication Date: 2023-07-06 Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22148

(580KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Journal Article
E-Libary location: (CD JAES71) TMP/JAES71/7/

Learn more about the AES E-Library

About AES

Code of Conduct

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

Stereo Speech Enhancement Using Custom Mid-Side Signals and Monaural Processing

Choose your country of residence from this list:

AES E-Library

Login Institutions

Stereo Speech Enhancement Using Custom Mid-Side Signals and Monaural Processing

Choose your country of residence from this list: