You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Vocal Percussion Transcription (VPT) aims at detecting vocal percussion sound events in a beatboxing performance and classifying them into the correct drum instrument class (kick, snare, or hi-hat). To do this in an online (real-time) setting, however, algorithms are forced to classify these events within just a few milliseconds after they are detected. The purpose of this study was to investigate which phoneme-to-instrument mappings are the most robust for online transcription purposes. We used three different evaluation criteria to base our decision upon: frequency of use of phonemes among different performers, spectral similarity to reference drum sounds, and classification separability. With these criteria applied, the recommended mappings would potentially feel natural for performers to articulate while enabling the classification algorithms to achieve the best performance possible. Given the final results, we provided a detailed discussion on which phonemes to choose given different contexts and applications.
Author (s): Delgado Alejandro; Saitis, Charalampos; Sandler, Mark
Affiliation:
Roli Ltd., London, UK; Queen Mary University of London, London, UK
(See document for exact affiliation information.)
AES Convention: 151
Paper Number:10529
Publication Date:
2021-10-06
Import into BibTeX
Session subject:
Audio Signal Processing
Permalink: https://aes2.org/publications/elibrary-page/?id=21493
(502KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Delgado Alejandro; Saitis, Charalampos; Sandler, Mark; 2021; Phoneme Mappings for Online Vocal Percussion Transcription [PDF]; Roli Ltd., London, UK; Queen Mary University of London, London, UK; Paper 10529; Available from: https://aes2.org/publications/elibrary-page/?id=21493
Delgado Alejandro; Saitis, Charalampos; Sandler, Mark; Phoneme Mappings for Online Vocal Percussion Transcription [PDF]; Roli Ltd., London, UK; Queen Mary University of London, London, UK; Paper 10529; 2021 Available: https://aes2.org/publications/elibrary-page/?id=21493