You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Instrument sound synthesis using deep neural networks has received numerous improvements over the last couple of years. Among them, the Differentiable Digital Signal Processing (DDSP) framework has modernized the spectral modeling paradigm by including signal-based synthesizers and effects into fully differentiable architectures. The present work extends the applications of DDSP to the task of polyphonic sound synthesis, with the proposal of a differentiable piano synthesizer conditioned on MIDI inputs. The model architecture is motivated by high-level acoustic modeling knowledge of the instrument, which, along with the sound structure priors inherent to the DDSP components, makes for a lightweight, interpretable, and realistic-sounding piano model. A subjective listening test has revealed that the proposed approach achieves better sound quality than a state-of-the-art neural-based piano synthesizer, but physical-modeling-based models still hold the best quality. Leveraging its interpretability and modularity, a qualitative analysis of the model behavior was also conducted: it highlights where additional modeling knowledge and optimization procedures could be inserted in order to improve the synthesis quality and the manipulation of sound properties. Eventually, the proposed differentiable synthesizer can be further used with other deep learning models for alternative musical tasks handling polyphonic audio and symbolic data.
Author (s): Renault, Lenny; Mignot, Rémi; Roebel, Axel
Affiliation:
STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France
(See document for exact affiliation information.)
Publication Date:
2023-09-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22231
(774KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Renault, Lenny; Mignot, Rémi; Roebel, Axel; 2023; DDSP-Piano: A Neural Sound Synthesizer Informed by Instrument Knowledge [PDF]; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=22231
Renault, Lenny; Mignot, Rémi; Roebel, Axel; DDSP-Piano: A Neural Sound Synthesizer Informed by Instrument Knowledge [PDF]; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; STMS - UMR9912, IRCAM, Sorbonne Université, CNRS, Ministére de la Culture, Paris, France; Paper ; 2023 Available: https://aes2.org/publications/elibrary-page/?id=22231
@article{renault2023ddsp-piano:,
author={renault lenny and mignot rémi and roebel axel},
journal={journal of the audio engineering society},
title={ddsp-piano: a neural sound synthesizer informed by instrument knowledge},
year={2023},
volume={71},
issue={9},
pages={552-565},
month={september},}