Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
In this and the companion paper Part II, the authors present the Perceptual Objective Listening Quality Assessment (POLQA), the third-generation speech quality measurement algorithm, standardized by the International Telecommunication Union in 2011 as Recommendation P.863. In contrast to the previous standard (P.862 Perceptual Evaluation of Speech Quality), a more complex temporal alignment was developed allowing for the alignment of a wide variety of complex distortions for which P.862 was known to fail, such as multiple delay variations within utterances as well as temporal stretching and compression of the degraded signal. When this new algorithm is used in combination with the advanced perceptual model described in Part II, it provides a new measurement standard for predicting Mean Opinion Scores that outperforms the older PESQ standard, especially for wideband and super wideband speech signals (7 and 14 kHz audio bandwidth). Part I provides the basics of the POLQA approach and outlines the core elements of the temporal alignment.
Author (s): Beerends, John G.; Schmidmer, Christian; Berger, Jens; Obermann, Matthias; Ullmann, Raphael; Pomy, Joachim; Keyhl, Michael
Affiliation:
TNO, Delft, The Netherlands; OPTICOM GmbH, Erlangen, Germany; SwissQual AG, Zuchwil, Switzerland
(See document for exact affiliation information.)
Publication Date:
2013-06-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=16829
(526KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Beerends, John G.; Schmidmer, Christian; Berger, Jens; Obermann, Matthias; Ullmann, Raphael; Pomy, Joachim; Keyhl, Michael; 2013; Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part I—Temporal Alignment [PDF]; TNO, Delft, The Netherlands; OPTICOM GmbH, Erlangen, Germany; SwissQual AG, Zuchwil, Switzerland; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=16829
Beerends, John G.; Schmidmer, Christian; Berger, Jens; Obermann, Matthias; Ullmann, Raphael; Pomy, Joachim; Keyhl, Michael; Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part I—Temporal Alignment [PDF]; TNO, Delft, The Netherlands; OPTICOM GmbH, Erlangen, Germany; SwissQual AG, Zuchwil, Switzerland; Paper ; 2013 Available: https://aes2.org/publications/elibrary-page/?id=16829
@article{beerends2013perceptual,
author={beerends john g. and schmidmer christian and berger jens and obermann matthias and ullmann raphael and pomy joachim and keyhl michael},
journal={journal of the audio engineering society},
title={perceptual objective listening quality assessment (polqa), the third generation itu-t standard for end-to-end speech quality measurement part i—temporal alignment},
year={2013},
volume={61},
issue={6},
pages={366-384},
month={july},}
TY – paper
TI – Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part I—Temporal Alignment
SP – 366 EP – 384
AU – Beerends, John G.
AU – Schmidmer, Christian
AU – Berger, Jens
AU – Obermann, Matthias
AU – Ullmann, Raphael
AU – Pomy, Joachim
AU – Keyhl, Michael
PY – 2013
JO – Journal of the Audio Engineering Society
VO – 61
IS – 6
Y1 – July 2013