E-library page

AES E-Library

Using Neural Networks to Compute Time Offsets from Musical Instruments

This research proposes an approach for computing the time offsets between audio sequences that contain musical sounds from different instruments produced in a distributed way and which have a set of weak features that are not useful as alignment points. It is therefore necessary to apply transformations in order to find a set of distinctive features to compute the offset values in a suitable way. The main issue that occurs with such a system is nonlinearity that does not allow the delay to be predicted by using a linear function. To solve this problem, the authors propose a set of long short-term memory (LSTM) layers to create a neural network model capable of learning such features transformations in a supervised approach, using a gradient-descent optimizer. This demonstrates the use of a recurrence matrix to extract timing information from a set of transformed features given by the neural network output. With this approach, the algorithm can classify up to 60% of a specific combination from the MedleyDB data set, and reduce the search space to five possibilities with accuracy up to 90% while keeping the precision of 10 ms. This performance is equal or better than state-of-the-art methods.

Author (s): Pereira, Igor; Distante, Cosimo; Silveira, Luiz F.; Gonçalves, Luiz
Affiliation: Institute of Applied Sciences and Intelligent Systems, Lecce, Italy; Federal University of Rio Grande do Norte, Natal, Brazil (See document for exact affiliation information.)
Publication Date: 2020-03-06 Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=20726

(246KB)

This paper costs $33 for non-members and is free for AES members and E-Libary subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Journal Article
E-Libary location: (CD JAES68) TMP/JAES68/3/

Learn more about the AES E-Library

About AES

Code of Conduct

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

AES E-Library

Using Neural Networks to Compute Time Offsets from Musical Instruments

Choose your country of residence from this list:

AES E-Library

Login Institutions

Using Neural Networks to Compute Time Offsets from Musical Instruments

Choose your country of residence from this list: