Multi-Task Learning for Speech Enhancement and Phoneme Recognition

Simulate organization breadcrumb open Simulate organization breadcrumb close

Pattern Recognition Lab PRL

Type: MA thesis

Status: finished

Date: November 2, 2020 - April 30, 2021

Supervisors: Hendrik Schröter, Andreas Maier, Dr. Boppert

For speech intelligibility, consonants have a fundamental importance. Unfortunately, when reducing noise in speech, consonants are often also degraded while vocals are easier to preserve/enhance. To improve the detection and enhancement of consonants, we want to use multi-task learning to reduce the noise in the signal and furthermore detect phonemes (smallest acoustic unit in speech).