E-library page

AES E-Library

A Perceptual Speech-Quality Measure Based on a Psychoacoustic Sound Representation

A general method for predicting the subjective quality of speech codecs has been developed. This method uses the concept of an internal sound representation. A model of the human auditory system is used to calculate the internal representation of the input and output signals of a speech codec. The transformation from the physical domain to the psychophysical (internal) domain is performed by way of three operations-frequency warping, time-frequency smearing, and level compression. These operations allow modeling of the masking behavior of the human auditory system both at and above masked threshold. It is shown that for the determination of speech codec quality, no time-frequency smearing has to be applied. This is in contrast with the results found for music codecs, for which the applied model parameters were in line with psychoacoustic data. Nevertheless the perceptual speech-quality measure (PSQM) can be used to predict the quality of speech codecs. The PSQM was optimized using the ETSI GSM speech codec test. The PSQM is validated is validated with the subjective results of the CCITT LD-CELP (G.728) speech codec test. Correlation between the predicted objective mean opinion scores (MOS), using the PSQM, and the subjective MOS results of the CCITT LD-CELP database was very high (0.99) with a low standard deviation (0.14). The predictions made with the PSQM are compared with predictions of four other speech-quality measures. The results show that the PSQM has the highest correlation with the lowest standard deviation.

Author (s): Beerends, John G.; Stemerdink, Jan A.
Affiliation: PTT Research, Leidschendam, The Netherlands (See document for exact affiliation information.)
Publication Date: 1994-03-06 Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=6957

(951KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Journal Article
E-Libary location: (CD aes5) JRNL8997/1994/

Learn more about the AES E-Library

About AES

Code of Conduct

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

A Perceptual Speech-Quality Measure Based on a Psychoacoustic Sound Representation

Choose your country of residence from this list:

AES E-Library

Login Institutions

A Perceptual Speech-Quality Measure Based on a Psychoacoustic Sound Representation

Choose your country of residence from this list: