You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Because the spectral envelope of a sound is a crucial aspect of timbre perception, the authors propose a quantitative model of spectral envelope perception using a set of orthogonal basis functions, analogous to the three primary colors in vision. The goal is find a quantitative mapping between the physical description of the spectral envelope and its perception. This allows for a meaningful and reliable way of controlling timbre in sonification. This paper presents a quantitative metric to describe the multidimensionality of spectral envelope perception, i.e., the perception that is specifically related to the spectral element of timbre. Mel-frequency cepstral coefficients (MFCC) were chosen as a metric for spectral envelope perception because of their linearity, orthogonality, and multidimensionality. Quantitative data from two experiments illustrate the linear relationship between the subjective perception of spectrally-varied synthetic sounds and the MFCC.
Author (s): Terasawa, Hiroko; Berger, Jonathan; Makino, Shoji
Affiliation:
Life Science Center of TARA, University of Tsukuba, Tsukuba, Ibaraki, Japan; JST, PRESTO (Information Science and Humans), Chiyoda-ku, Tokyo, Japan; CCRMA, Department of Music, Stanford University, Stanford, CA, USA
(See document for exact affiliation information.)
Publication Date:
2012-09-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=16372
(520KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Terasawa, Hiroko; Berger, Jonathan; Makino, Shoji; 2012; In Search of a Perceptual Metric for Timbre: Dissimilarity Judgments among Synthetic Sounds with MFCC-Derived Spectral Envelopes [PDF]; Life Science Center of TARA, University of Tsukuba, Tsukuba, Ibaraki, Japan; JST, PRESTO (Information Science and Humans), Chiyoda-ku, Tokyo, Japan; CCRMA, Department of Music, Stanford University, Stanford, CA, USA; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=16372
Terasawa, Hiroko; Berger, Jonathan; Makino, Shoji; In Search of a Perceptual Metric for Timbre: Dissimilarity Judgments among Synthetic Sounds with MFCC-Derived Spectral Envelopes [PDF]; Life Science Center of TARA, University of Tsukuba, Tsukuba, Ibaraki, Japan; JST, PRESTO (Information Science and Humans), Chiyoda-ku, Tokyo, Japan; CCRMA, Department of Music, Stanford University, Stanford, CA, USA; Paper ; 2012 Available: https://aes2.org/publications/elibrary-page/?id=16372
@article{terasawa2012in,
author={terasawa hiroko and berger jonathan and makino shoji},
journal={journal of the audio engineering society},
title={in search of a perceptual metric for timbre: dissimilarity judgments among synthetic sounds with mfcc-derived spectral envelopes},
year={2012},
volume={60},
issue={9},
pages={674-685},
month={october},}
TY – paper
TI – In Search of a Perceptual Metric for Timbre: Dissimilarity Judgments among Synthetic Sounds with MFCC-Derived Spectral Envelopes
SP – 674 EP – 685
AU – Terasawa, Hiroko
AU – Berger, Jonathan
AU – Makino, Shoji
PY – 2012
JO – Journal of the Audio Engineering Society
VO – 60
IS – 9
Y1 – October 2012