AES E-Library

Overview of speech quality metrics in terms of automated evaluation of signal denoising in a presence of non-stationary noise

Recent developments in neural network-based speech enhancement ask for a robust subjective metric that can be used for comparing performance of a different noise suppression algorithms (for non-stationary noises), that would closely match costly and time-consuming subjective user tests. The article describes results of a comparison between subjective scores obtained using MUSHRA methodology vs. automated evaluation with objective metrics i.e. POLQA, 3QUEST, STOI & ESTOI, on a set of recordings processed by 2 different denoising algorithms for close & far speaker distance. Correlation coefficient is calculated between subjective scores and examined metrics. The results are based on recordings simulated using an in-house simulation toolchain, based on impulse responses from actual laptop device used in low reverb quiet room.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
AES Convention: Paper Number:
Publication Date:
Session subject:
Permalink: https://aes2.org/publications/elibrary-page/?id=20911


(490KB)


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
E-Libary location:
16938
Choose your country of residence from this list:










Skip to content