You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Automated speaker recognition attains impressive reliability when tested under controlled laboratory acoustic conditions. However, the environmental noise that inevitably exits in many real-world speech samples causes considerable degradation of recognition accuracy due to the so-called “channel mismatch” that occurs between the enrollment and recognition phases. A new online training method is proposed to improve robustness of speaker recognition in noisy conditions. An estimate of the signal-to-noise ratio and an emulated ambient noise spectral profile found in the silence intervals of the speech signal are used to re-enroll the reference model for a claimed speaker to generate a new noisy reference model. Based on a large number of tests using two datasets for speech samples contaminated with cafeteria babble and street noise, the proposed method shows promising improvement. When the signal-to-noise ratio is higher than 20 dB, typical speaker recognition algorithms normally function well, and the use of the proposed online training does not offer any benefit. When the signal-to-noise ratio is below 15 dB, the proposed method improves robustness of recognition. However, the new method shows limitations with speech samples that have been contaminated with interior train noise. Train noise contains slow time-varying components that require prolonged observation to create a reliable estimate.
Author (s): Al-Noori, Ahmed H.Y.; Duncan, Philip
Affiliation:
School of Computing Science and Engineering, University of Salford, Salford, UK
(See document for exact affiliation information.)
Publication Date:
2019-04-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=20450
(972KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Al-Noori, Ahmed H.Y.; Duncan, Philip; 2019; Robust Speaker Recognition in Noisy Conditions by Means of Online Training with Noise Profiles [PDF]; School of Computing Science and Engineering, University of Salford, Salford, UK; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=20450
Al-Noori, Ahmed H.Y.; Duncan, Philip; Robust Speaker Recognition in Noisy Conditions by Means of Online Training with Noise Profiles [PDF]; School of Computing Science and Engineering, University of Salford, Salford, UK; Paper ; 2019 Available: https://aes2.org/publications/elibrary-page/?id=20450
@article{al-noori2019robust,
author={al-noori ahmed h.y. and duncan philip},
journal={journal of the audio engineering society},
title={robust speaker recognition in noisy conditions by means of online training with noise profiles},
year={2019},
volume={67},
issue={4},
pages={174-189},
month={april},}