Semantic Analysis and Deep Learning

With the omnipresence of digital multimedia data, the processing, analysis, and understanding of such data by means of automated methods has become a central issue in engineering and computer science.

Semantic audio is concerned with:

    • analysing audio signals in order to infer semantically meaningful information that can be understood by humans
    • decomposing audio signals into semantic entities in order to enable facilitated handling, modification and interaction with these audio objects in an intuitive way
    • enabling a machine to process audio signal as human experts could do (a least for the simple and boring tasks)

Such methods are relevant for the following applications:

  • analysing music for automated recommendation services
  • automatic transcription, score following and source separation for personalised sound and interactive music education
  • managing large amounts of data in audio editing and production
  • new consumer applications including DJ, karaoke, and dialog enhancement software

Deep Learning is also omnipresent. It is a branch of machine learning that in recent years gave rise to developments that outperformed their predecessors by large margins. This happened in computer vision and natural language processing and then also in digital speech and audio signal processing, e.g. in speech recognition, speech synthesis, speech enhancement, dereverberation and blind source separation.

Upcoming event on

Semantic Analysis and Deep Learning

Curators

Christian Uhle is chief scientist in the Audio division of the Fraunhofer Institute for Integrated Circuits IIS. He received the Dipl.-Ing. and PhD degrees from the Technical University of Ilmenau, Germany, in 1997 and 2008, respectively. His research activities comprise automotive sound reproduction, semantic audio processing, blind source separation, dialog enhancement, digital audio effects and natural language understanding with neural networks. He is a member of the AES and chairs the AES Technical Committee on Semantic Audio Analysis.
Semantic Analysis and Deep Learning
Technical Committee

Chair:

Christian Uhle

Chair:

Vice Chair: 

Bryan Pardo
This committee provides a forum in the AES for researchers and innovators to come together and discuss and disseminate the newest developments related to Semantic Audio. We are interested in how to extract meaning from audio signals, how to represent it and how to use it. This includes developments in Music Information Retrieval, Intelligent Audio Editing, Semantic Audio Processing and Semantic Web for Music. We organize AES International Conferences on Semantic Audio and tutorials and workshops for AES Conventions. If you are interested in our work or if you would like to propose a workshop or tutorial at a future Convention, please you attend a meeting of the TC SAA at an AES Convention.
Choose your country of residence from this list: