You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Deep neural networks can be employed for estimating the direction of arrival (DOA) of individual sound sources from audio signals. Existing methods mostly focus on estimating the DOA of each source on individual frames, without utilizing the motion information of the sources. This paper proposes a method for estimating trajectories of sources, leveraging the differential of trajectories across different time scales. Additionally, a neural network is employed for enhancing the trajectories wrongly estimated especially for sound sources with low-energy. Experimental evaluations conducted on simulated dataset validate that the proposed method achieves more precise localization and tracking performance and encounters less interference when the sound source energy is low.
Author (s): Wu, Donghang; Wu, Xihong; Qu, Tianshu
Affiliation:
Peking University
(See document for exact affiliation information.)
AES Convention: 156
Paper Number:10687
Publication Date:
2024-06-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22500
(10186KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Wu, Donghang; Wu, Xihong; Qu, Tianshu; 2024; Exploiting Motion information in Sound Source Localization and Tracking [PDF]; Peking University; Paper 10687; Available from: https://aes2.org/publications/elibrary-page/?id=22500
Wu, Donghang; Wu, Xihong; Qu, Tianshu; Exploiting Motion information in Sound Source Localization and Tracking [PDF]; Peking University; Paper 10687; 2024 Available: https://aes2.org/publications/elibrary-page/?id=22500