You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
In this work a software framework based on client-server architecture is implemented for real time intelligent audio coding. A speech/music discrimination scheme analyzes the input audio signal and takes a decision about the nature of the audio signal (speech or music) on a frame by frame basis. According to the decision of the speech/music discriminator, a suitable coder is selected at each frame. The designed software framework makes use of the speech and audio coders incorporated into the MPEG4 audio standard (HVXC or CELP for speech frames and TwinVQ or AAC for music frames) to evaluate the performance of an intelligent multi-mode audio coder. The framework supports several types of audio features (timbral texture features and rhythmic content features) and classifiers (classical Statistical Pattern Recognition (SPR) classifiers, Multilayer Perceptron Neural Networks (MLPNN), Support Vector Machines (SVM), Fuzzy Expert Systems (FES), Hidden Markov Models (HMM)). Comparison between a speech/music discrimination based-intelligent audio coder and MPEG4-AAC has been performed using audio signals representative of the two corresponding classes (speech and music). Subjective and objective tests have been accomplished aiming at assessing the behaviour of the intelligent audio coding scheme.
Author (s): Garcia Galan, Sebastian; Muñoz Exposito, Jose Enrique; Ruiz Reyes, Nicolas; Vera Candeas, Pedro
Affiliation:
University of Jaen
(See document for exact affiliation information.)
AES Convention: 122
Paper Number:7005
Publication Date:
2007-05-06
Import into BibTeX
Session subject:
Low Bit-Rate Audio Coding
Permalink: https://aes2.org/publications/elibrary-page/?id=13990
(759KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Garcia Galan, Sebastian; Muñoz Exposito, Jose Enrique; Ruiz Reyes, Nicolas; Vera Candeas, Pedro; 2007; Design and Implementation of a Web-Based Software Framework for Real Time Intelligent Audio Coding Based on Speech/Music Discrimination [PDF]; University of Jaen; Paper 7005; Available from: https://aes2.org/publications/elibrary-page/?id=13990
Garcia Galan, Sebastian; Muñoz Exposito, Jose Enrique; Ruiz Reyes, Nicolas; Vera Candeas, Pedro; Design and Implementation of a Web-Based Software Framework for Real Time Intelligent Audio Coding Based on Speech/Music Discrimination [PDF]; University of Jaen; Paper 7005; 2007 Available: https://aes2.org/publications/elibrary-page/?id=13990
@article{garcia2007design,
author={garcia galan sebastian and muñoz exposito jose enrique and ruiz reyes nicolas and vera candeas pedro},
journal={journal of the audio engineering society},
title={design and implementation of a web-based software framework for real time intelligent audio coding based on speech/music discrimination},
year={2007},
number={7005},
month={may},}
TY – paper
TI – Design and Implementation of a Web-Based Software Framework for Real Time Intelligent Audio Coding Based on Speech/Music Discrimination
AU – Garcia Galan, Sebastian
AU – Muñoz Exposito, Jose Enrique
AU – Ruiz Reyes, Nicolas
AU – Vera Candeas, Pedro
PY – 2007
JO – Journal of the Audio Engineering Society
VL – 7005
Y1 – May 2007