Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The variation in the acoustic condition of a room presents a major hurdle in the performance robustness of sound event classification. Room impulse response characterizes the way in which a sound wave is propagated from source to receiver and the overall perceptual quality and intelligibility of the recorded sound. This study presents the Room Acoustic Adversarial Neural Network (RAANN) method that can make sound event classification more robust to changes in acoustic condition by exploiting knowledge regarding the room acoustics during learning. With RAANN, the weighted F1 score for the classification task improved by 1.54 percentage points, and the standard deviation in performance dropped from 1.74 percentage points to 1.07 percentage points for acoustic conditions that were harder than those seen during the learning phase. The Clarity Index over the first 25 ms emerged as a good metric for the acoustic estimation in the RAANN training.
Author (s): Upadhyaya, Sreenivasa; Buyens, Wim; Desmet, Wim; Karsmakers, Peter
Affiliation:
KU Leuven, Department of Computer Science, DTAI, Kleinhoefstraat 4, B-2440 Geel, Belgium; SoundTalks NV, Interleuvenlaan 15/c, B-3001 Leuven, Belgium; KU Leuven, Department of Mechanical Engineering, LMSD, Celestijnenlaan 300, Leuven, Belgium
(See document for exact affiliation information.)
Publication Date:
2024-11-13
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22777
(894KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Upadhyaya, Sreenivasa; Buyens, Wim; Desmet, Wim; Karsmakers, Peter; 2024; Room Acoustic Adversarial Neural Network for Robust Sound Event Classification [PDF]; KU Leuven, Department of Computer Science, DTAI, Kleinhoefstraat 4, B-2440 Geel, Belgium; SoundTalks NV, Interleuvenlaan 15/c, B-3001 Leuven, Belgium; KU Leuven, Department of Mechanical Engineering, LMSD, Celestijnenlaan 300, Leuven, Belgium; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=22777
Upadhyaya, Sreenivasa; Buyens, Wim; Desmet, Wim; Karsmakers, Peter; Room Acoustic Adversarial Neural Network for Robust Sound Event Classification [PDF]; KU Leuven, Department of Computer Science, DTAI, Kleinhoefstraat 4, B-2440 Geel, Belgium; SoundTalks NV, Interleuvenlaan 15/c, B-3001 Leuven, Belgium; KU Leuven, Department of Mechanical Engineering, LMSD, Celestijnenlaan 300, Leuven, Belgium; Paper ; 2024 Available: https://aes2.org/publications/elibrary-page/?id=22777
@article{upadhyaya2024room,
author={upadhyaya sreenivasa and buyens wim and desmet wim and karsmakers peter},
journal={journal of the audio engineering society},
title={room acoustic adversarial neural network for robust sound event classification},
year={2024},
volume={72},
issue={11},
pages={754-766},
month={december},}