Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
With the growing capability of recording and storage devices, the problem of indexing large audio databases has been the object of much attention. Most of this effort is dedicated to automatic inferences from indexed metadata. In contrast, browsing audio databases in an effective manner has been less considered. This report studies the relevance of a semantic organization of sounds to ease the browsing of a sound database. For such a task, semantic access to data is traditionally implemented by a keyword selection process. However, various limitations of written language, such as word polysemy, ambiguities, or translation issues, may bias the browsing process. Two sound presentation strategies organized sounds spatially to reflect an underlying semantic hierarchy. For the sake of comparison, the authors also considered a display whose spatial organization was only based on acoustic cues. Those three displays were evaluated in terms of search speed in a crowdsourcing experiment using two different corpora: environmental sounds from urban environments and sounds produced by musical instruments. Coherent results demonstrate the usefulness of an implicit semantic organization for representing sounds in terms of both search speed and of learning efficiency.
Author (s): Lafay, Grégoire; Misdariis, Nicolas; Lagrange, Mathieu; Rossignol, Mathias
Affiliation:
IRCCyN, Ecole Centrale de Nantes, France; STMS Ircam-CNRS-UPMC, Paris, France
(See document for exact affiliation information.)
Publication Date:
2016-09-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=18371
(316KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Lafay, Grégoire; Misdariis, Nicolas; Lagrange, Mathieu; Rossignol, Mathias; 2016; Semantic Browsing of Sound Databases without Keywords [PDF]; IRCCyN, Ecole Centrale de Nantes, France; STMS Ircam-CNRS-UPMC, Paris, France; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=18371
Lafay, Grégoire; Misdariis, Nicolas; Lagrange, Mathieu; Rossignol, Mathias; Semantic Browsing of Sound Databases without Keywords [PDF]; IRCCyN, Ecole Centrale de Nantes, France; STMS Ircam-CNRS-UPMC, Paris, France; Paper ; 2016 Available: https://aes2.org/publications/elibrary-page/?id=18371
@article{lafay2016semantic,
author={lafay grégoire and misdariis nicolas and lagrange mathieu and rossignol mathias},
journal={journal of the audio engineering society},
title={semantic browsing of sound databases without keywords},
year={2016},
volume={64},
issue={9},
pages={628-635},
month={september},}