Semantic Analysis and Deep Learning

Home / Audio Topics / Semantic Analysis and Deep Learning

Semantic Analysis and Deep Learning

With the omnipresence of digital multimedia data, the processing, analysis, and understanding of such data by means of automated methods has become a central issue in engineering and computer science.

Semantic audio is concerned with:

- analysing audio signals in order to infer semantically meaningful information that can be understood by humans
- decomposing audio signals into semantic entities in order to enable facilitated handling, modification and interaction with these audio objects in an intuitive way
- enabling a machine to process audio signal as human experts could do (a least for the simple and boring tasks)

Such methods are relevant for the following applications:

analysing music for automated recommendation services
automatic transcription, score following and source separation for personalised sound and interactive music education
managing large amounts of data in audio editing and production
new consumer applications including DJ, karaoke, and dialog enhancement software

Deep Learning is also omnipresent. It is a branch of machine learning that in recent years gave rise to developments that outperformed their predecessors by large margins. This happened in computer vision and natural language processing and then also in digital speech and audio signal processing, e.g. in speech recognition, speech synthesis, speech enhancement, dereverberation and blind source separation.

Papers

Feature Articles

How I Think about Intelligent Production Tools, Bryan Pardo (Keynote lecture for the 2nd AES Workshop on Intelligent Music Production at the Centre for Digital Music at Queen Mary University of London on 13th September 2016.)

Papers

Feature Articles

How I Think about Intelligent Production Tools, Bryan Pardo (Keynote lecture for the 2nd AES Workshop on Intelligent Music Production at the Centre for Digital Music at Queen Mary University of London on 13th September 2016.)

Upcoming event on

Semantic Analysis and Deep Learning

Curators

Christian Uhle is chief scientist in the Audio division of the Fraunhofer Institute for Integrated Circuits IIS. He received the Dipl.-Ing. and PhD degrees from the Technical University of Ilmenau, Germany, in 1997 and 2008, respectively. His research activities comprise automotive sound reproduction, semantic audio processing, blind source separation, dialog enhancement, digital audio effects and natural language understanding with neural networks. He is a member of the AES and chairs the AES Technical Committee on Semantic Audio Analysis.

Semantic Analysis and Deep Learning

Technical Committee

Chair:

Christian Uhle

Chair:

Vice Chair:

Bryan Pardo

This committee provides a forum in the AES for researchers and innovators to come together and discuss and disseminate the newest developments related to Semantic Audio. We are interested in how to extract meaning from audio signals, how to represent it and how to use it. This includes developments in Music Information Retrieval, Intelligent Audio Editing, Semantic Audio Processing and Semantic Web for Music. We organize AES International Conferences on Semantic Audio and tutorials and workshops for AES Conventions. If you are interested in our work or if you would like to propose a workshop or tutorial at a future Convention, please you attend a meeting of the TC SAA at an AES Convention.

About AES

Code of Conduct

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Current Standards

Standards Blog

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity