Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
An audio processor that integrates intelligent classification and preprocessing algorithms is presented. Audio features in the time and frequency domains are extracted and processed prior to classification. Classification algorithms, including the nearest neighbor rule (NNR), artificial neural networks (ANN), fuzzy neural networks (FNN), and hidden Markov models (HMM), are used to classify and identify singers and musical instruments. A training phase is required to establish a feature space template, followed by a test phase in which the audio features of the test data are calculated and matched to the feature space template. In addition to audio classification, the proposed system provides several independent component analysis (ICA)-based preprocessing functions for blind source separation, voice removal, and noise reduction. The proposed techniques were applied to process various kinds of audio program materials. The test results reveal that the performance of the methods is satisfactory, but varies slightly with the algorithm and program materials used in the tests.
Author (s): Bai, Mingsian R.; Chen, Meng-chun
Affiliation:
Department of Mechani Engineering, National Chiao-Tung University, Hsin-Chu, Taiwan
(See document for exact affiliation information.)
Publication Date:
2007-05-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=14164
(908KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Bai, Mingsian R.; Chen, Meng-chun; 2007; Intelligent Preprocessing and Classification of Audio Signals [PDF]; Department of Mechani Engineering, National Chiao-Tung University, Hsin-Chu, Taiwan; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=14164
Bai, Mingsian R.; Chen, Meng-chun; Intelligent Preprocessing and Classification of Audio Signals [PDF]; Department of Mechani Engineering, National Chiao-Tung University, Hsin-Chu, Taiwan; Paper ; 2007 Available: https://aes2.org/publications/elibrary-page/?id=14164