You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Speaker recognition systems can typically attain high performance in ideal conditions. However, significant degradations in accuracy are found in channel-mismatched scenarios. Non-stationary environmental noises and their variations are listed at the top of speaker recognition challenges. Gammtone frequency cepstral coefficient method (GFCC) has been developed to improve the robustness of speaker recognition. This paper presents systematic comparisons between performance of GFCC and conventional MFCC-based speaker verification systems with a purposely collected noisy speech data set. Furthermore, the current work extends the experiments to include investigations into language independency features in recognition phases. The results show that GFCC has better verification performance in noisy environments than MFCC. However, the GFCC shows a higher sensitivity to language mismatch between enrollment and recognition phase.
Author (s): Al-Noori, Ahmed; Li, Francis F.; Duncan, Philip J.
Affiliation:
University of Salford, Salford, Greater Manchester, UK
(See document for exact affiliation information.)
AES Convention: 140
Paper Number:9577
Publication Date:
2016-05-06
Import into BibTeX
Session subject:
Human Factors and Interfaces
Permalink: https://aes2.org/publications/elibrary-page/?id=18275
(1415KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Al-Noori, Ahmed; Li, Francis F.; Duncan, Philip J.; 2016; Robustness of Speaker Recognition from Noisy Speech Samples and Mismatched Languages [PDF]; University of Salford, Salford, Greater Manchester, UK; Paper 9577; Available from: https://aes2.org/publications/elibrary-page/?id=18275
Al-Noori, Ahmed; Li, Francis F.; Duncan, Philip J.; Robustness of Speaker Recognition from Noisy Speech Samples and Mismatched Languages [PDF]; University of Salford, Salford, Greater Manchester, UK; Paper 9577; 2016 Available: https://aes2.org/publications/elibrary-page/?id=18275