Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Speech de-noising algorithms often suffer from introduction of artifacts, either by removal of parts of the speech signal, or imperfect noise reduction causing the remaining noise to sound unnatural and disturbing. This contribution proposes to spatially distribute monaural noisy speech signals based on single-channel source separation, in order to improve the perceived speech quality. Stereo up-mixing is utilized on the estimated speech and noise sources instead of simply suppressing the noise. This paper investigates the case of non-negative matrix factorization (NMF) speech enhancement applied to high levels of non-stationary noise. NMF-based and spectral subtraction speech enhancement algorithms are evaluated in a listening test in terms of speech intelligibility, presence of interfering noises and overall quality with respect to the unprocessed signal. In the result, the listening test provides evidence for superior noise reduction by NMF, yet also a drop in perceived speech quality that is not covered by the employed set of common objective metrics. However, stereo up-mixing of NMF-separated speech and noise delivers high subjective noise reduction while preserving the perceived speech quality.
Author (s): Kirst, Christian; Weninger, Felix; Joder, Cyril; Grosche, Peter; Geiger, Jürgen; Schuller, Björn
Affiliation:
HUAWEI Technologies Duesseldorf GmbH, European Research Center, Germany; Technische Universität München, Munich, Germany
(See document for exact affiliation information.)
Publication Date:
2014-01-06
Import into BibTeX
Session subject:
Intelligent Audio Effects
Permalink: https://aes2.org/publications/elibrary-page/?id=17090
(1383KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Kirst, Christian; Weninger, Felix; Joder, Cyril; Grosche, Peter; Geiger, Jürgen; Schuller, Björn; 2014; On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise [PDF]; HUAWEI Technologies Duesseldorf GmbH, European Research Center, Germany; Technische Universität München, Munich, Germany; Paper 4-2; Available from: https://aes2.org/publications/elibrary-page/?id=17090
Kirst, Christian; Weninger, Felix; Joder, Cyril; Grosche, Peter; Geiger, Jürgen; Schuller, Björn; On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise [PDF]; HUAWEI Technologies Duesseldorf GmbH, European Research Center, Germany; Technische Universität München, Munich, Germany; Paper 4-2; 2014 Available: https://aes2.org/publications/elibrary-page/?id=17090
@article{kirst2014on-line,
author={kirst christian and weninger felix and joder cyril and grosche peter and geiger jürgen and schuller björn},
journal={journal of the audio engineering society},
title={on-line nmf-based stereo up-mixing of speech improves perceived reduction of non-stationary noise},
year={2014},
number={4-2},
month={january},}
TY – paper
TI – On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise
AU – Kirst, Christian
AU – Weninger, Felix
AU – Joder, Cyril
AU – Grosche, Peter
AU – Geiger, Jürgen
AU – Schuller, Björn
PY – 2014
JO – Journal of the Audio Engineering Society
VL – 4-2
Y1 – January 2014