You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Locating the right sound effect efficiently is an important yet challenging topic for audio production. Most current sound-searching systems rely on pre-annotated audio labels created by humans, which can be time-consuming to produce and prone to inaccuracies, limiting the efficiency of audio production. Recent works on text and audio multimodal neural networks have led to the development of contrastive language-audio pretraining (CLAP), which learns a shared embedding space for text descriptions and audio samples. Using this idea, we built a CLAP-based sound searching system (CLAP-Search) that does not rely on human annotations. To evaluate the effectiveness of CLAP-Search, we conducted comparative experiments with a widely used sound effect searching platform, the BBC Sound Effect Library. Our study evaluates user performance, cognitive load, and satisfaction through ecologically valid tasks based on professional sound-searching workflows. Our result shows that CLAP-Search demonstrated significantly enhanced productivity and reduced frustration while maintaining comparable cognitive demands. We also qualitatively analyzed the participants feedback, which offered valuable perspectives on the design of future AI-assisted sound search systems.
Author (s): Liu, Haohe;
Deacon, Thomas;
Wang, Wenwu;
Paradis, Matt;
Plumbley, Mark D.;
Affiliation:
University of Surrey; University of Surrey; University of Surrey; British Broadcasting Corporation
(See document for exact affiliation information.)
AES Convention: 159
Paper Number:10233
Publication Date:
2025-10-14
DOI:

Liu, Haohe; Deacon, Thomas; Wang, Wenwu; Paradis, Matt; Plumbley, Mark D.; 2025; Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows [PDF]; University of Surrey; University of Surrey; University of Surrey; British Broadcasting Corporation; Paper 10233; Available from: https://aes.org/publications/elibrary-page/?id=23077
Liu, Haohe; Deacon, Thomas; Wang, Wenwu; Paradis, Matt; Plumbley, Mark D.; Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows [PDF]; University of Surrey; University of Surrey; University of Surrey; British Broadcasting Corporation; Paper 10233; 2025 Available: https://aes.org/publications/elibrary-page/?id=23077
@inproceedings{Liu2025exploring,
title={{Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows}},
author={Liu, Haohe and Deacon, Thomas and Wang, Wenwu and Paradis, Matt and Plumbley, Mark D.},
year={2025},
month={jun},
booktitle={Journal of the Audio Engineering Society},
publisher={},
number={10233},
organization={AES},
}
Notifications