Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Object-based audio promises format-agnostic reproduction and extensive personalization of spatial audio content. However, in practical listening scenarios, such as in consumer audio, ideal reproduction is typically not possible. To maximize the quality of listening experience, a different approach is required, for example modifications of metadata to adjust for the reproduction layout or personalization choices. This paper proposes a novel system architecture for semantically informed rendering (SIR), that combines object audio rendering with high-level processing of object metadata. In many cases, this processing uses novel, advanced metadata describing the objects to optimally adjust the audio scene to the reproduction system or listener preferences. The proposed system is evaluated with several adaptation strategies, including semantically motivated downmix to layouts with few loudspeakers, manipulation of perceptual attributes, perceptual reverberation compensation, and orchestration of mobile devices for immersive reproduction. These examples demonstrate how SIR can significantly improve the media experience and provide advanced personalization controls, for example by maintaining smooth object trajectories on systems with few loudspeakers, or providing personalized envelopment levels. An example implementation of the proposed system architecture is described and provided as an open, extensible software framework that combines object-based audio rendering and high-level processing of advanced object metadata.
Author (s): Franck, Andreas; Francombe, Jon; Woodcock, James; Hughes, Richard; Coleman, Philip; Menzies, Dylan; Cox, Trevor J.; Jackson, Philip J.B.; Fazi, Filippo Maria
Affiliation:
Institute of Sound and Vibration Research, University of Southampton, Southampton, Hampshire, UK; BBC Research and Development, Dock House, MediaCityUK, Salford, UK; Acoustics Research Centre, University of Salford, Salford, UK; Institute of Sound Recording, University of Surrey, Guildford, Surrey, UK; Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, Surrey, UK
(See document for exact affiliation information.)
Publication Date:
2019-07-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=20488
(541KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Franck, Andreas; Francombe, Jon; Woodcock, James; Hughes, Richard; Coleman, Philip; Menzies, Dylan; Cox, Trevor J.; Jackson, Philip J.B.; Fazi, Filippo Maria; 2019; A System Architecture for Semantically Informed Rendering of Object-Based Audio [PDF]; Institute of Sound and Vibration Research, University of Southampton, Southampton, Hampshire, UK; BBC Research and Development, Dock House, MediaCityUK, Salford, UK; Acoustics Research Centre, University of Salford, Salford, UK; Institute of Sound Recording, University of Surrey, Guildford, Surrey, UK; Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, Surrey, UK; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=20488
Franck, Andreas; Francombe, Jon; Woodcock, James; Hughes, Richard; Coleman, Philip; Menzies, Dylan; Cox, Trevor J.; Jackson, Philip J.B.; Fazi, Filippo Maria; A System Architecture for Semantically Informed Rendering of Object-Based Audio [PDF]; Institute of Sound and Vibration Research, University of Southampton, Southampton, Hampshire, UK; BBC Research and Development, Dock House, MediaCityUK, Salford, UK; Acoustics Research Centre, University of Salford, Salford, UK; Institute of Sound Recording, University of Surrey, Guildford, Surrey, UK; Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, Surrey, UK; Paper ; 2019 Available: https://aes2.org/publications/elibrary-page/?id=20488
@article{franck2019a,
author={franck andreas and francombe jon and woodcock james and hughes richard and coleman philip and menzies dylan and cox trevor j. and jackson philip j.b. and fazi filippo maria},
journal={journal of the audio engineering society},
title={a system architecture for semantically informed rendering of object-based audio},
year={2019},
volume={67},
issue={7/8},
pages={498-509},
month={july},}
TY – paper
TI – A System Architecture for Semantically Informed Rendering of Object-Based Audio
SP – 498 EP – 509
AU – Franck, Andreas
AU – Francombe, Jon
AU – Woodcock, James
AU – Hughes, Richard
AU – Coleman, Philip
AU – Menzies, Dylan
AU – Cox, Trevor J.
AU – Jackson, Philip J.B.
AU – Fazi, Filippo Maria
PY – 2019
JO – Journal of the Audio Engineering Society
VO – 67
IS – 7/8
Y1 – July 2019