Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Since early perceptual audio coders such as mp3, the underlying psychoacoustic model that controls the encoding process has not undergone many dramatic changes. Meanwhile, modern audio coders have been equipped with semi-parametric or parametric coding tools such as audio bandwidth extension. Thereby, the initial psychoacoustic model used in a perceptual coder, just considering added quantization noise, became partly unsuitable. We propose the use of an improved psychoacoustic excitation model based on an existing model proposed by Dau et al. in 1997. This modulation-based model is essentially independent from the input waveform by calculating an internal auditory representation. Using the example of MPEG-H 3D Audio and its semi-parametric Intelligent Gap Filling (IGF) tool, we demonstrate that we can successfully control the IGF parameter selection process to achieve overall improved perceptual quality.
Author (s): Disch, Sascha; van de Par, Steven; Niedermeier, Andreas; Burdiel Pérez, Elena; Berasategui Ceberio, Ane; Edler, Bernd
Affiliation:
University of Oldenburg, Oldenburg, Germany; Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; Friedrich Alexander University, International Audio Laboratories Erlangen, Erlangen, Germany
(See document for exact affiliation information.)
AES Convention: 145
Paper Number:10029
Publication Date:
2018-10-06
Import into BibTeX
Session subject:
Perception – Part 1
Permalink: https://aes2.org/publications/elibrary-page/?id=19755
(1957KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Disch, Sascha; van de Par, Steven; Niedermeier, Andreas; Burdiel Pérez, Elena; Berasategui Ceberio, Ane; Edler, Bernd; 2018; Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs [PDF]; University of Oldenburg, Oldenburg, Germany; Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; Friedrich Alexander University, International Audio Laboratories Erlangen, Erlangen, Germany; Paper 10029; Available from: https://aes2.org/publications/elibrary-page/?id=19755
Disch, Sascha; van de Par, Steven; Niedermeier, Andreas; Burdiel Pérez, Elena; Berasategui Ceberio, Ane; Edler, Bernd; Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs [PDF]; University of Oldenburg, Oldenburg, Germany; Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; Friedrich Alexander University, International Audio Laboratories Erlangen, Erlangen, Germany; Paper 10029; 2018 Available: https://aes2.org/publications/elibrary-page/?id=19755
@article{disch2018improved,
author={disch sascha and van de par steven and niedermeier andreas and burdiel pérez elena and berasategui ceberio ane and edler bernd},
journal={journal of the audio engineering society},
title={improved psychoacoustic model for efficient perceptual audio codecs},
year={2018},
number={10029},
month={october},}
TY – paper
TI – Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs
AU – Disch, Sascha
AU – van de Par, Steven
AU – Niedermeier, Andreas
AU – Burdiel Pérez, Elena
AU – Berasategui Ceberio, Ane
AU – Edler, Bernd
PY – 2018
JO – Journal of the Audio Engineering Society
VL – 10029
Y1 – October 2018