AES E-Library

Real-Time Reverb Reduction for Improved Automatic Speech Recognition in Far-Field

In the paper, methods of real-time reverb reduction based on Generalized Weighted Prediction Error (GWPE) were presented. It was shown that usage of the proposed audio processing routines highly improve the accuracy of Automatic Speech Recognition (ASR) system namely word error rates (WERs) are reduced 11.36% when the user stands 5 meters from the microphone array. The obtained results are close to the ones that are achieved by the offline GWPE implementation (12.06%). Thanks to optimizations and parameters tuning, computational complexity of the proposed realization of GWPE was highly reduced and it achieves RTFs lower than 1.0 (computation time is shorter than signal duration) when using one core of CPU.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
AES Convention: Paper Number:
Publication Date:
Session subject:
Permalink: https://aes2.org/publications/elibrary-page/?id=18677


(1246KB)


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
E-Libary location:
16938
Choose your country of residence from this list:










Skip to content