Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
A general method for predicting the subjective quality of speech codecs has been developed. This method uses the concept of an internal sound representation. A model of the human auditory system is used to calculate the internal representation of the input and output signals of a speech codec. The transformation from the physical domain to the psychophysical (internal) domain is performed by way of three operations-frequency warping, time-frequency smearing, and level compression. These operations allow modeling of the masking behavior of the human auditory system both at and above masked threshold. It is shown that for the determination of speech codec quality, no time-frequency smearing has to be applied. This is in contrast with the results found for music codecs, for which the applied model parameters were in line with psychoacoustic data. Nevertheless the perceptual speech-quality measure (PSQM) can be used to predict the quality of speech codecs. The PSQM was optimized using the ETSI GSM speech codec test. The PSQM is validated is validated with the subjective results of the CCITT LD-CELP (G.728) speech codec test. Correlation between the predicted objective mean opinion scores (MOS), using the PSQM, and the subjective MOS results of the CCITT LD-CELP database was very high (0.99) with a low standard deviation (0.14). The predictions made with the PSQM are compared with predictions of four other speech-quality measures. The results show that the PSQM has the highest correlation with the lowest standard deviation.
Author (s): Beerends, John G.; Stemerdink, Jan A.
Affiliation:
PTT Research, Leidschendam, The Netherlands
(See document for exact affiliation information.)
Publication Date:
1994-03-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=6957
(974KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Beerends, John G.; Stemerdink, Jan A.; 1994; A Perceptual Speech-Quality Measure Based on a Psychoacoustic Sound Representation [PDF]; PTT Research, Leidschendam, The Netherlands; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=6957
Beerends, John G.; Stemerdink, Jan A.; A Perceptual Speech-Quality Measure Based on a Psychoacoustic Sound Representation [PDF]; PTT Research, Leidschendam, The Netherlands; Paper ; 1994 Available: https://aes2.org/publications/elibrary-page/?id=6957
@article{beerends1994a,
author={beerends john g. and stemerdink jan a.},
journal={journal of the audio engineering society},
title={a perceptual speech-quality measure based on a psychoacoustic sound representation},
year={1994},
volume={42},
issue={3},
pages={115-123},
month={march},}