E-library page

AES E-Library

S^3MASH: Spatial Sound Scene Matching using Single-Channel Audio

This paper describes a novel approach for recording and binaurally reproducing spatial sound scenes using the audio from a single microphone. This is realised by recording the sound scene using both a microphone array, which potentially comprises more affordable and lower quality capsules, and a monophonic microphone, possibly featuring a higher quality capsule. By adopting a perceptually motivated sound-field model and estimating the models spatial parameters, it is possible to define target time-frequency-dependent binaural spatial covariance matrices (SCMs). The actual binaural signals can then be synthesised using an adaptive SCM matching renderer, which takes only the higher-quality monophonic audio signal as input. A perceptual study was conducted to compare this novel processing approach, using a tetrahedral array and an omnidirectional microphone, against binaural renderings achieved through traditional Ambisonic means, when using four- and 32-channel arrays. The results show that, despite utilising only a monophonic signal for the spatialisation, the proposed approach yielded binaural renderings that are perceptually in-between the two conventional Ambisonic array renderings, with regards to their perceived spatial accuracy.

Author (s): Gonzalez, Raimundo; McCormack, Leo; Politis, Archontis
Affiliation: Department of Information and Communications Engineering, Aalto University, Espoo, Finland; Department of Information and Communications Engineering, Aalto University, Espoo, Finland; Faculty of Information Technology and Communication Sciences, Tampere University, Finland (See document for exact affiliation information.)
Publication Date: 2024-08-05 Import into BibTeX
Session subject: Audio for Virtual and Augmented Reality
Permalink: https://aes2.org/publications/elibrary-page/?id=22664

(4439KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: conference Paper
E-Libary location: TMP/conf/2024/2499de99-0164-4a7e-b321-6ba8b944abcb/

Learn more about the AES E-Library

About AES

Code of Conduct

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

S^3MASH: Spatial Sound Scene Matching using Single-Channel Audio

Choose your country of residence from this list:

AES E-Library

Login Institutions

S^3MASH: Spatial Sound Scene Matching using Single-Channel Audio

Choose your country of residence from this list: