You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
This paper investigates audiovisual congruence in virtual reality with both horizontal and vertical offsets between audio and visual rendering. Audiovisual congruence and localization errors are assessed using loudspeaker playback and nonindividualized headphone rendering. To account for the influence of different types of visual information on congruence, presentations of a loudspeaker model and 3D human avatar were compared. Therefore, a new dataset of audiovisual speech was recorded. Results show that human avatar rendering increases perceived congruence, and experienced listeners have an increased tendency to respond with “incongruent” when a loudspeaker model is shown but not when the human avatar is presented. Moreover, a correlation is found between localization precision and audiovisual congruence for horizontally offset stimuli and avatar presentation. For vertical offsets, the angular range of congruence is generally large, and localization errors are high, so no correlation can be observed between the two. The paper contributes congruence ranges for audiovisual speech in virtual reality, which also has implications for augmented reality telepresence use.
Author (s): Hofmann, Anja; Meyer-Kahlen, Nils; Schlecht, Sebastian J.; Lokki, Tapio
Affiliation:
Aalto Acoustics Lab, Department of Information and Communications Engineering, Aalto University, and Media Lab
(See document for exact affiliation information.)
Publication Date:
2024-10-15
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22771
(935KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Hofmann, Anja; Meyer-Kahlen, Nils; Schlecht, Sebastian J.; Lokki, Tapio; 2024; Audiovisual Congruence and Localization Performance in Virtual Reality: 3D Loudspeaker Model vs. Human Avatar [PDF]; Aalto Acoustics Lab, Department of Information and Communications Engineering, Aalto University, and Media Lab; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=22771
Hofmann, Anja; Meyer-Kahlen, Nils; Schlecht, Sebastian J.; Lokki, Tapio; Audiovisual Congruence and Localization Performance in Virtual Reality: 3D Loudspeaker Model vs. Human Avatar [PDF]; Aalto Acoustics Lab, Department of Information and Communications Engineering, Aalto University, and Media Lab; Paper ; 2024 Available: https://aes2.org/publications/elibrary-page/?id=22771
@article{hofmann2024audiovisual,
author={hofmann anja and meyer-kahlen nils and schlecht sebastian j. and lokki tapio},
journal={journal of the audio engineering society},
title={audiovisual congruence and localization performance in virtual reality: 3d loudspeaker model vs. human avatar},
year={2024},
volume={72},
issue={10},
pages={679-690},
month={april},}
TY – paper
TI – Audiovisual Congruence and Localization Performance in Virtual Reality: 3D Loudspeaker Model vs. Human Avatar
SP – 679 EP – 690
AU – Hofmann, Anja
AU – Meyer-Kahlen, Nils
AU – Schlecht, Sebastian J.
AU – Lokki, Tapio
PY – 2024
JO – Journal of the Audio Engineering Society
VO – 72
IS – 10
Y1 – April 2024