E-library page

AES E-Library

Speech Loudness in Broadcasting and Streaming

The introduction and regulation of loudness in broadcasting and streaming brought clear benefits to the audience, e.g., a level of uniformity across programs and channels. Yet, speech loudness is frequently reported as being too low in certain passages, which can hinder the full understanding and enjoyment of movies and TV programs. This paper proposes expanding the set of loudness-based measures typically used in the industry. We focus on speech loudness, and we show that, when clean speech is not available, Deep Neural Networks (DNNs) can be used to isolate the speech signal and so to accurately estimate speech loudness, providing a more precise estimate compared to speech-gated loudness. Moreover, we define critical passages, i.e., passages in which speech is likely to be hard to understand. Critical passages are defined based on the local Speech Loudness Deviation (SLD) and the local Speech-to-Background Loudness Difference (SBLD), as SLD and SBLD significantly contribute to intelligibility and listening effort. In contrast to other more comprehensive measures of intelligibility and listening effort, SLD and SBLD can be straightforwardly measured, are intuitive, and, most importantly, can be easily controlled by adjusting the speech level in the mix or by enabling personalization at the users end. Finally, examples are provided that show how the detection of critical passages can support the evaluation and control of the speech signal during and after content production.

Author (s): Torcoli, Matteo; Halimeh, Mhd Modar; Leitz, Thomas; Grewe, Yannik; Kratschmer, Michael; Murtaza, Adrian; Fuchs, Harald; Habets, Emanuel; Neugebauer, Bernhard
Affiliation: Fraunhofer IIS; DSP Solutions; Fraunhofer IIS; Fraunhofer IIS; Fraunhofer IIS; Fraunhofer IIS; Fraunhofer IIS; Fraunhofer IIS; Fraunhofer IIS (See document for exact affiliation information.)
AES Convention: 156 Paper Number:10698
Publication Date: 2024-06-06 Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22511

(14430KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Convention Paper
E-Libary location: TMP/conv/156/

Learn more about the AES E-Library

About AES

Code of Conduct

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

Speech Loudness in Broadcasting and Streaming

Choose your country of residence from this list:

AES E-Library

Login Institutions

Speech Loudness in Broadcasting and Streaming

Choose your country of residence from this list: