E-library page

AES E-Library

MPEG Unified Speech and Audio Coding

Unified Speech and Audio Coding is the newest MPEG audio standard, published in late 2011. It achieves consistently state-of-the-art compression performance for any mix of speech and music content. MPEG-1 and MPEG-2 Layer III and MPEG-4 Advanced Audio Coding (AAC) use perceptually shaped quantization noise as the primary tool for achieving compression; MPEG-4 High-Efficiency AAC adds parametric coding of the upper spectrum region (using the Spectral Band Replication tool); and MPEG-D MPEG Surround adds parametric coding of the sound stage (using level, time and coherence parameters in the time/frequency domain). The common thread in all of these MPEG standards is that they model and exploit how humans perceive sound. MPEG-D Unified Speech and Audio Coding incorporates all of these models of sound perception and additionally incorporates a model of sound production, specifically that of human speech. The paper gives an overview of the architecture of the Unified Speech and Audio Coding algorithm and how the various compression tools operate in response to the instantaneous statistics of arbitrary mixed-content signals. There is a brief description of the tools giving the greatest compression performance and results of subjective listening tests showing the performance of the standard relative to state-of-the-art benchmark coders.

Author (s): Quackenbush, Schuyler
Affiliation: Audio Research Labs, Scotch Plains, NJ, USA (See document for exact affiliation information.)
Publication Date: 2011-09-06 Import into BibTeX
Session subject: Keynote
Permalink: https://aes2.org/publications/elibrary-page/?id=16115

(422KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Conference Paper
E-Libary location: (CD 43rdPapers) TMP/conf/43/

Learn more about the AES E-Library

About AES

Code of Conduct

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

MPEG Unified Speech and Audio Coding

Choose your country of residence from this list:

AES E-Library

Login Institutions

MPEG Unified Speech and Audio Coding

Choose your country of residence from this list: