Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Unified Speech and Audio Coding is the newest MPEG audio standard, published in late 2011. It achieves consistently state-of-the-art compression performance for any mix of speech and music content. MPEG-1 and MPEG-2 Layer III and MPEG-4 Advanced Audio Coding (AAC) use perceptually shaped quantization noise as the primary tool for achieving compression; MPEG-4 High-Efficiency AAC adds parametric coding of the upper spectrum region (using the Spectral Band Replication tool); and MPEG-D MPEG Surround adds parametric coding of the sound stage (using level, time and coherence parameters in the time/frequency domain). The common thread in all of these MPEG standards is that they model and exploit how humans perceive sound. MPEG-D Unified Speech and Audio Coding incorporates all of these models of sound perception and additionally incorporates a model of sound production, specifically that of human speech. The paper gives an overview of the architecture of the Unified Speech and Audio Coding algorithm and how the various compression tools operate in response to the instantaneous statistics of arbitrary mixed-content signals. There is a brief description of the tools giving the greatest compression performance and results of subjective listening tests showing the performance of the standard relative to state-of-the-art benchmark coders.
Author (s): Quackenbush, Schuyler
Affiliation:
Audio Research Labs, Scotch Plains, NJ, USA
(See document for exact affiliation information.)
Publication Date:
2011-09-06
Import into BibTeX
Session subject:
Keynote
Permalink: https://aes2.org/publications/elibrary-page/?id=16115
(432KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Quackenbush, Schuyler; 2011; MPEG Unified Speech and Audio Coding [PDF]; Audio Research Labs, Scotch Plains, NJ, USA; Paper 1-0; Available from: https://aes2.org/publications/elibrary-page/?id=16115
Quackenbush, Schuyler; MPEG Unified Speech and Audio Coding [PDF]; Audio Research Labs, Scotch Plains, NJ, USA; Paper 1-0; 2011 Available: https://aes2.org/publications/elibrary-page/?id=16115