You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Locating the right sound effect efficiently is an important yet challenging topic for audio production. Most current sound-searching systems rely on pre-annotated audio labels created by humans, which can be time-consuming to produce and prone to inaccuracies, limiting the efficiency of audio production. Recent works on text and audio multimodal neural networks have led to the development of contrastive language-audio pretraining (CLAP), which learns a shared embedding space for text descriptions and audio samples. Using this idea, we built a CLAP-based sound searching system (CLAP-Search) that does not rely on human annotations. To evaluate the effectiveness of CLAP-Search, we conducted comparative experiments with a widely used sound effect searching platform, the BBC Sound Effect Library. Our study evaluates user performance, cognitive load, and satisfaction through ecologically valid tasks based on professional sound-searching workflows. Our result shows that CLAP-Search demonstrated significantly enhanced productivity and reduced frustration while maintaining comparable cognitive demands. We also qualitatively analyzed the participants feedback, which offered valuable perspectives on the design of future AI-assisted sound search systems.
Author (s): Liu, Haohe; Deacon, Thomas; Wang, Wenwu; Paradis, Matt; Plumbley, Mark D.
Affiliation:
University of Surrey; University of Surrey; University of Surrey; British Broadcasting Corporation
(See document for exact affiliation information.)
AES Convention: 159
Paper Number:10233
Publication Date:
2025-10-14
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=23077
(1328KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Liu, Haohe; Deacon, Thomas; Wang, Wenwu; Paradis, Matt; Plumbley, Mark D.; 2025; Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows [PDF]; University of Surrey; University of Surrey; University of Surrey; British Broadcasting Corporation; Paper 10233; Available from: https://aes2.org/publications/elibrary-page/?id=23077
Liu, Haohe; Deacon, Thomas; Wang, Wenwu; Paradis, Matt; Plumbley, Mark D.; Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows [PDF]; University of Surrey; University of Surrey; University of Surrey; British Broadcasting Corporation; Paper 10233; 2025 Available: https://aes2.org/publications/elibrary-page/?id=23077
@article{liu2025exploring,
author={liu haohe and deacon thomas and wang wenwu and paradis matt and plumbley mark d.},
journal={journal of the audio engineering society},
title={exploring the user experience of ai-assisted sound searching systems for creative workflows},
year={2025},
number={10233},
month={october},}