You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The end-to-end framework has been introduced into the binaural localization modeling and achieved higher localization accuracy than the other models, however, the reasonability and interpretability for applying the related neural networks remain unclear. It has been well documented that the auditory system relies on binaural cues for sound localization, and the equalization and cancellation (EC) theory describes how the binaural cues are extracted. In this paper, an end-to-end binaural localization model is proposed based on the EC theory. In the proposed model, a convolution neural network(CNN) with a specifically designed activation function is used to implement the EC theory. The proposed model was trained in synthesized rooms and evaluated in real rooms. Experiment results show that CNN kernels learned by the proposed model are corresponding to binaural cues, and the proposed model outperforms the current end-to-end model by a 10.73% improvement in localization accuracy and a 12.91%improvement in root mean square error(RMSE).
Author (s): Song, Tao; Zhang, Wenwen; Chen, Jing
Affiliation:
Peking University; Beijing University of Posts and Telecommunications
(See document for exact affiliation information.)
AES Convention: 152
Paper Number:10576
Publication Date:
2022-05-06
Import into BibTeX
Session subject:
Binaural Audio
Permalink: https://aes2.org/publications/elibrary-page/?id=21689
(438KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Song, Tao; Zhang, Wenwen; Chen, Jing; 2022; An End-to-End Binaural Sound Localization Model Based on the Equalization and Cancellation Theory [PDF]; Peking University; Beijing University of Posts and Telecommunications; Paper 10576; Available from: https://aes2.org/publications/elibrary-page/?id=21689
Song, Tao; Zhang, Wenwen; Chen, Jing; An End-to-End Binaural Sound Localization Model Based on the Equalization and Cancellation Theory [PDF]; Peking University; Beijing University of Posts and Telecommunications; Paper 10576; 2022 Available: https://aes2.org/publications/elibrary-page/?id=21689