You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
A novel dictionary learning approach that utilizes Mel-scale frequency warping in detecting overlapped acoustic events is proposed. The study explored several dictionary learning schemes for improved performance of overlapping acoustic event detection. The structure of NMF for calculating gains of each event was utilized for including in overlapped signal for its low computational load. In this paper, we propose a method of frequency warping for better sound representation, and apply dictionary learning by a holistic-based representation, namely nonnegative K-SVD (NK-SVD) in order to resolve a basis sharing problem raised by part-based representations. By using Mel-scale in a dictionary learning, we show that the information carried by low frequency components more than high frequency components and dealt with a low complexity. Also, the proposed holistic-based representation method avoids the permutation problem between another acoustic events. Based on these benefits, we confirm that the proposed method of Mel-scale with NK-SVD delivered significantly better results than the conventional methods.
Author (s): Choi, Hyeonsik; Lee, Keunsang; Keum, Minseok; Han, David; Ko, Hanseok
Affiliation:
LG Electronics., South Korea ; SELVAS AI, South Korea ; Army Research Laboratory, Adelphi, MD, USA; Korea University, Seoul, South Korea
(See document for exact affiliation information.)
AES Convention: 149
Paper Number:10395
Publication Date:
2020-10-06
Import into BibTeX
Session subject:
Audio Processing
Permalink: https://aes2.org/publications/elibrary-page/?id=20932
(429KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Choi, Hyeonsik; Lee, Keunsang; Keum, Minseok; Han, David; Ko, Hanseok; 2020; Overlapping Acoustic Event Detection via Perceptually Inspired the Holistic-based Representation Method [PDF]; LG Electronics., South Korea ; SELVAS AI, South Korea ; Army Research Laboratory, Adelphi, MD, USA; Korea University, Seoul, South Korea; Paper 10395; Available from: https://aes2.org/publications/elibrary-page/?id=20932
Choi, Hyeonsik; Lee, Keunsang; Keum, Minseok; Han, David; Ko, Hanseok; Overlapping Acoustic Event Detection via Perceptually Inspired the Holistic-based Representation Method [PDF]; LG Electronics., South Korea ; SELVAS AI, South Korea ; Army Research Laboratory, Adelphi, MD, USA; Korea University, Seoul, South Korea; Paper 10395; 2020 Available: https://aes2.org/publications/elibrary-page/?id=20932
@article{choi2020overlapping,
author={choi hyeonsik and lee keunsang and keum minseok and han david and ko hanseok},
journal={journal of the audio engineering society},
title={overlapping acoustic event detection via perceptually inspired the holistic-based representation method},
year={2020},
number={10395},
month={october},}