UnivIS
Informationssystem der Friedrich-Alexander-Universität Erlangen-Nürnberg © Config eG 
FAU Logo
  Sammlung/Stundenplan    Modulbelegung Home  |  Rechtliches  |  Kontakt  |  Hilfe    
Suche:      Semester:   
 
 Darstellung
 
Druckansicht

 
 
 Außerdem im UnivIS
 
Vorlesungs- und Modulverzeichnis nach Studiengängen

Vorlesungsverzeichnis

 
 
Veranstaltungskalender

Stellenangebote

Möbel-/Rechnerbörse

 
 

  Selected Topics in Deep Learning for Audio, Speech, and Music Processing (DLA)

Dozentinnen/Dozenten
Prof. Dr. ir. Emanuël A. P. Habets, Prof. Dr. Meinard Müller

Angaben
Vorlesung
Online
2 SWS, ECTS-Studium, ECTS-Credits: 2,5, Sprache Englisch
Zeit: Mo 16:00 - 18:00, Zoom-Meeting, 3R4.04

Studienfächer / Studienrichtungen
WF EEI-MA ab 1
WF INF-MA ab 1
WF CME-MA ab 1
WF ASC-MA ab 1
WF ICT-MA ab 1

Voraussetzungen / Organisatorisches
In this course, we require a good knowledge of deep learning techniques, machine learning, and pattern recognition as well as a strong mathematical background. Furthermore, we require a solid background in general digital signal processing and some experience with audio, image, or video processing.

It is recommended to finish the following modules (or having equivalent knowledge) before starting this module:

  • Lecture Deep Learning

  • Digitale Signalverarbeitung

  • Statistische Signalverarbeitung

  • Sprach- und Audiosignalverarbeitung

Inhalt
Many recent advances in audio, speech, and music processing have been driven by techniques based on deep learning (DL). For example, DL-based techniques have led to significant improvements in, for example, speaker separation, speech synthesis, acoustic scene analysis, audio retrieval, chord recognition, melody estimation, and beat tracking. Considering specific audio, speech, and music processing tasks, we study various DL-based approaches and their capability to extract complex features and make predictions based on hidden structures and relations. Rather than giving a comprehensive overview, we will study selected and generally applicable DL-based techniques. Furthermore, in the context of challenging application scenarios, we will critically review the potential and limitations of recent deep learning techniques. As one main general objective of the lecture, we want to discuss how you can integrate domain knowledge into neural network architectures to obtain explainable models that are less vulnerable to data biases and confounding factors.

The course consists of two overview-like lectures, where we introduce current research problems in audio, speech, and music processing. We will then continue with 6 to 8 lectures on selected audio processing topics and DL-based techniques. Being based on articles from the research literature, we will provide detailed explanations covered in mathematical depth; we may also try to attract some of the original authors to serve as guest lecturers. Finally, we round off the course by a concluding lecture covering practical aspects (e.g., hardware, software, version control, reproducibility, datasets) that are relevant when working with DL-based techniques.

ECTS-Informationen:
Title:
Selected Topics of Deep Learning for Audio, Speech, and Music Processing

Credits: 2,5

Prerequisites
In this course, we require a good knowledge of deep learning techniques, machine learning, and pattern recognition as well as a strong mathematical background. Furthermore, we require a solid background in general digital signal processing and some experience with audio, image, or video processing.

It is recommended to finish the following modules (or having equivalent knowledge) before starting this module:

  • Lecture Deep Learning

  • Digitale Signalverarbeitung

  • Statistische Signalverarbeitung

  • Sprach- und Audiosignalverarbeitung

Contents
Many recent advances in audio, speech, and music processing have been driven by techniques based on deep learning (DL). For example, DL-based techniques have led to significant improvements in, for example, speaker separation, speech synthesis, acoustic scene analysis, audio retrieval, chord recognition, melody estimation, and beat tracking. Considering specific audio, speech, and music processing tasks, we study various DL-based approaches and their capability to extract complex features and make predictions based on hidden structures and relations. Rather than giving a comprehensive overview, we will study selected and generally applicable DL-based techniques. Furthermore, in the context of challenging application scenarios, we will critically review the potential and limitations of recent deep learning techniques. As one main general objective of the lecture, we want to discuss how you can integrate domain knowledge into neural network architectures to obtain explainable models that are less vulnerable to data biases and confounding factors.

The course consists of two overview-like lectures, where we introduce current research problems in audio, speech, and music processing. We will then continue with 6 to 8 lectures on selected audio processing topics and DL-based techniques. Being based on articles from the research literature, we will provide detailed explanations covered in mathematical depth; we may also try to attract some of the original authors to serve as guest lecturers. Finally, we round off the course by a concluding lecture covering practical aspects (e.g., hardware, software, version control, reproducibility, datasets) that are relevant when working with DL-based techniques.

Zusätzliche Informationen
Schlagwörter: AudioLabs Deep Learning Audio Speech Music
Erwartete Teilnehmerzahl: 10
www: https://www.audiolabs-erlangen.de/fau/professor/mueller/teaching/2021s_dla
Für diese Lehrveranstaltung ist eine Anmeldung erforderlich.
Die Anmeldung erfolgt von Montag, 22.2.2021, 14:55 Uhr bis Montag, 19.4.2021, 15:00 Uhr über: StudOn.

Verwendung in folgenden UnivIS-Modulen
Startsemester SS 2021:
Selected Topics of Deep Learning for Audio, Speech, and Music Processing (DLA)

Institution: International Audio Laboratories Erlangen (AudioLabs)
UnivIS ist ein Produkt der Config eG, Buckenhof