Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Automatic content classification technique is an essential tool in multimedia applications. Present research for audio-based classifiers look at short- and long-term analysis of signals, using both temporal and spectral features. In this paper we present a neural network to classify between the movie (cinematic, TV shows), music, and voice using metadata contained in either the audio/video stream. Towards this end, statistical models of the various metadata are created since a large metadata dataset is not available. Subsequently, synthetic metadata are generated from these statistical models, and the synthetic metadata is input to the ML classifier as feature vectors. The resulting classifier is then able to classify real-world content (e.g., YouTube) with an accuracy ˜ 90% with very low latency (viz., ˜ on an average 7 ms) based on real-world metadata.
Author (s): Bharitkar, Sunil G.
Affiliation:
HP Labs., Inc., San Francisco, CA, USA
(See document for exact affiliation information.)
AES Convention: 147
Paper Number:564
Publication Date:
2019-10-06
Import into BibTeX
Session subject:
Applications in Audio
Permalink: https://aes2.org/publications/elibrary-page/?id=20587
(1863KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Bharitkar, Sunil G.; 2019; Generative Modeling of Metadata for Machine Learning Based Audio Content Classification [PDF]; HP Labs., Inc., San Francisco, CA, USA; Paper 564; Available from: https://aes2.org/publications/elibrary-page/?id=20587
Bharitkar, Sunil G.; Generative Modeling of Metadata for Machine Learning Based Audio Content Classification [PDF]; HP Labs., Inc., San Francisco, CA, USA; Paper 564; 2019 Available: https://aes2.org/publications/elibrary-page/?id=20587