Speech Processing and Understanding
Projects
DysarTrain: Development of a digital therapy tool as an exercise supplement for speech disorders and facial paralysis
Dysarthrien sind neurologisch bedingte, erworbene Störungen des Sprechens. Dabei sind vor allem die Koordination und Ausführung der Sprechbewegungen, aber auch die Mimik betroffen. Besonders häufig tritt eine Dysarthrie nach einem Schlaganfall, Schädel-Hirn-Trauma oder bei neurologischen Erkrankungen wie Parkinson auf.
Ähnlich wie in allen Sprechtherapien erfordert auch die Behandlung der Dysarthrie ein intensives Training. Anhaltende Effekte der Dysarthrie-Therapie stellen sich deshalb nur …
Modelling the progression of neurological diseases
Develop speech technology that can allow unobtrusive monitoring of many kinds of neurological diseases. The state of a patient can degrade slowly between medical check-ups. We want to track the state of a patient unobtrusively without the feeling of constant supervision. At the same time the privacy of the patient has to be respected. We will concentrate on PD and thus on acoustic cues of changes. The algorithms should run on a smartphone, track acoustic changes during regular phone conversations…
TAPAS: Training Network on Automatic Processing of PAthological Speech
There are an increasing number of people across Europe with debilitating
speech pathologies (e.g., due to stroke, Parkinson's, etc). These
groups face communication problems that can lead to social exclusion.
They are now being further marginalised by a new wave of speech
technology that is increasingly woven into everyday life but which is
not robust to atypical speech. TAPAS is a Horizon 2020 Marie
Skłodowska-Curie Actions Innovative Training Network European Training
Network (MSCA-ITN-ETN) project that aims to transform the well being of
these people.
TAPAS adopts an inter-disciplinary and
multi-sectorial approach. The consortium includes clinical
practitioners, academic researchers and industrial partners, with
expertise spanning speech engineering, linguistics and clinical science.
All members have expertise in some element of pathological speech. This
rich network will train a new generation of 15 researchers, equipping
them with the skills and resources necessary for lasting success.
DeepAL: Deep Learning Applied to Animal Linguistics
Deep Learning applied to animal linguistics in particular the analysis of underwater audio recordings of marine animals (killer whales):
The project includes the automatic segmentation of killer whale signals in noise-heavy and large underwater bioacoustic archives as well as a subsequent call type identification/classification in order to derive linguistic elements/patterns. In combination with the recorded situational video footage those patterns should help to decode the killer whale language.
Deep Learning based Noise Reduction for Hearing Aids
Reduction of unwanted environmental noises is an important feature of today’s hearing aids, which is why noise reduction is nowadays included in almost every commercially available device. The majority of these algorithms, however, is restricted to the reduction of stationary noises. Due to the large number of different background noises in daily situations, it is hard to heuristically cover the complete solution space of noise reduction schemes. Deep learning-based algorithms pose a possible so…
Participating Scientists
Prof. Dr.-Ing. Elmar Nöth
- Phone number: +49 9131 85-27888
- Email: elmar.noeth@fau.de
- Website: https://lme.tf.fau.de/person/noeth/
Philipp Klumpp, M. Sc.
- Phone number: +4991318527137
- Email: philipp.klumpp@fau.de
Colloquium time table
Publications
Apkinson: A mobile solution for multimodal assessment of patients with Parkinson's disease
20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 (Graz, September 15, 2019 - September 19, 2019)
In: Gernot Kubin, Thomas Hain, Bjorn Schuller, Dina El Zarka, Petra Hodl (ed.): Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
DOI: 10.21437/Interspeech.2019-8003
URL: https://www.isca-speech.org/archive/Interspeech_2019/abstracts/8003.html
BibTeX: Download
, , , , , , , , , , , , , , :
Unobtrusive Monitoring of Speech Impairments of Parkinson's Disease Patients Through Mobile Devices
2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
DOI: 10.1109/ICASSP.2018.8462332
BibTeX: Download
, , , , :
Apkinson — A Mobile Monitoring Solution for Parkinson's Disease
Interspeech 2017
In: Interspeech 2017 2017
URL: https://www.researchgate.net/profile/Juan_Vasquez12/publication/319185470_Apkinson_-_A_Mobile_Monitoring_Solution_for_Parkinson's_Disease/links/59b9fecfa6fdcc68723177dc/Apkinson-A-Mobile-Monitoring-Solution-for-Parkinsons-Disease.pdf
BibTeX: Download
, , , , , :
Phonological posteriors and GRU recurrent units to assess speech impairments of patients with Parkinson’s disease
In: Prof. Dr. Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Text, Speech, and Dialogue, Springer Interational, 2018, p. 453-461 (Lecture Notes in Computer Science, Vol.11107)
ISBN: 978-3-030-00794-2
DOI: 10.1007/978-3-030-00794-2_49
URL: https://link.springer.com/chapter/10.1007/978-3-030-00794-2_49
BibTeX: Download
, , , , :
Word accuracy and dynamic time warping to assess intelligibility deficits in patients with Parkinsons disease
21st Symposium on Signal Processing, Images and Artificial Vision, STSIVA 2016
DOI: 10.1109/STSIVA.2016.7743349
BibTeX: Download
, , :
Next Events:
Contact
Christian Bergler, M. Eng.
91058 Erlangen
- Phone number: +49 9131 85-27872
- Email: christian.bergler@fau.de
- Website: https://lme.tf.fau.de/person/bergler/