AES E-Library

Introducing the "Cockpit Party Problem": Blind Source Separation Enhances Aircraft Cockpit Speech Transcription

Cockpit voice recorders (CVRs) are one of the two mandatory flight recording devices embarked in commercial aircraft. Its analysis is crucial to understand the context of an air incident or accident. However, in such scenarios, when the audio recordings are usable, CVR may contain strong mixtures of crew member speech signals, radio communications, and cockpit alarms. However, contrary to the “cocktail party problem” that blind source separation (BSS) aims to tackle, modeling CVR mixtures—that are here named the “cockpit party problem”—was never done before. In this paper, the authors thus propose a CVR mixture model and highlight its limitations. While not trivial—even in a two-source scenario—BSS methods can be applied to real CVR recordings. It is found that taking into account several BSS outputs provided by various methods may help audio analysts to transcribe the CVR data. That is, near 90% of unintelligible words can be transcribed from CVR recordings processed by BSS methods.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
Publication Date:
Permalink: https://aes2.org/publications/elibrary-page/?id=22792


(795KB)


Download Now

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
E-Libary location:
16938
Choose your country of residence from this list:










Skip to content