You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Acoustic Scene Classification (ASC) is a fundamental task in machine listening, aiming to categorize the acoustic environment in which an audio recording was made. Although Convolutional Neural Networks (CNNs) are still the dominant approach in ASC today, increasing attention has been given to optimizing model efficiency, particularly for model deployment in resource-limited computational platforms such as hearing aids. In this study, we explore Reservoir Computing (RC) as an alternative lightweight approach for ASC and systematically compare its performance against multiple CNN architectures, including ResNet-34, MobileNet-V2, EfficientNet-B2, and TF-SepNet-20. Our results indicate that CNN models consistently outperform RC in classification accuracy, even when trained without data augmentation. However, RC exhibits significantly lower computational cost, requires less training time, and achieves the fastest inference speed among all models. This trade-off between accuracy and efficiency suggests that model selection should be guided by application requirements: CNNs are preferable when accuracy is paramount, while RC provides an alternative for low-power real-time applications such as hearing aids.
Author (s): He, Yuxuan; Hosseini, Alireza Molla Ali; Abeßer, Jakob; Jaurigue, Lina; Raake, Alexander; Lüdge, Kathy
Affiliation:
Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau
(See document for exact affiliation information.)
Publication Date:
2025-09-02
Import into BibTeX
Session subject:
Artificial Intelligence and Machine Learning for Audio
Permalink: https://aes2.org/publications/elibrary-page/?id=23010
(5441KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
He, Yuxuan; Hosseini, Alireza Molla Ali; Abeßer, Jakob; Jaurigue, Lina; Raake, Alexander; Lüdge, Kathy; 2025; From CNN to Reservoir Computing: A New Perspective on Acoustic Scene Classification [PDF]; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Paper 21; Available from: https://aes2.org/publications/elibrary-page/?id=23010
He, Yuxuan; Hosseini, Alireza Molla Ali; Abeßer, Jakob; Jaurigue, Lina; Raake, Alexander; Lüdge, Kathy; From CNN to Reservoir Computing: A New Perspective on Acoustic Scene Classification [PDF]; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Technische Universität Ilmenau; Paper 21; 2025 Available: https://aes2.org/publications/elibrary-page/?id=23010
@article{he2025from,
author={he yuxuan and hosseini alireza molla ali and abeßer jakob and jaurigue lina and raake alexander and lüdge kathy},
journal={journal of the audio engineering society},
title={from cnn to reservoir computing: a new perspective on acoustic scene classification},
year={2025},
number={21},
month={september},}
TY – paper
TI – From CNN to Reservoir Computing: A New Perspective on Acoustic Scene Classification
AU – He, Yuxuan
AU – Hosseini, Alireza Molla Ali
AU – Abeßer, Jakob
AU – Jaurigue, Lina
AU – Raake, Alexander
AU – Lüdge, Kathy
PY – 2025
JO – Journal of the Audio Engineering Society
VL – 21
Y1 – September 2025