Dr.-Ing. Tomás Arias Vergara

Dr.-Ing. Tomás Arias Vergara, M. Sc.

Lehrstuhl für Informatik 5 (Mustererkennung)
Chair of Computer Science 5 (Pattern Recognition)

Room 10.134
Martensstr. 3
91058 Erlangen

Phone number: +49 9131 85-27872
Fax number: +49 9131 85-27270
Email: tomas.arias@fau.de
Website: https://lme.tf.fau.de/person/arias/

I received a B.S. in Electronics Engineering from the University of Antioquia (UdeA, Colombia) in 2014, a Master of Science degree at the same institution in 2017, and a Ph.D. in a joint program between the UdeA and the FAU in 2022. Since 2015, my research has focused on speech processing and machine learning methods for the analysis of pathological speech signals resulting from neurological (e.g., Parkinson’s disease), structural (e.g., children with cleft lip and palate), and perceptual (e.g., hearing loss) disorders. I have also investigated the effect of the natural aging process on speech, participated in developing Android-based applications for collecting and analyzing data from Parkinson’s disease patients and adults/children with hearing loss, and performed research on automatic methods for the analysis of high-speed videoendoscopy data of people with voice disorders.

Academic CV

https://tariasvergara.github.io/CV_Tomas_AriasVergara.pdf

Projects

2024

A multimodal approach for automatic generation of radiology reports using chest X-ray images, clinical free-text, and spoken commands.

(FAU Funds)

Term: January 15, 2024 - January 14, 2025

Abstract

Advancements in Artificial Intelligence (AI) methods have enabled thedevelopment of Large Language Models (LLMs) capable of generating informationfrom user instructions and supporting various tasks in education, research,healthcare, and others. AI has also impacted the field of medical imaging withseveral deep learning models capable of achieving expert-level performanceacross different tasks, e.g., detection, segmentation, and assisted clinicaldiagnosis. In addition, open-source Automatic Speech Recognition (ASR) systemscan be incorporated as modules in AI-based systems. This proposed fundedproject aims to combine LLMs, medical imaging, and speech recognition using AImethods to generate high-quality radiology reports from chest X-ray images.

→More information

2017

Training Network on Automatic Processing of PAthological Speech

(Third Party Funds Group – Overall project)

Term: November 1, 2017 - October 31, 2021
Funding source: Innovative Training Networks (ITN)
URL: https://www.tapas-etn-eu.org/

Abstract

There are an increasing number of people across Europe with debilitating speech pathologies (e.g., due to stroke, Parkinson's, etc). These groups face communication problems that can lead to social exclusion. They are now being further marginalised by a new wave of speech technology that is increasingly woven into everyday life but which is not robust to atypical speech. TAPAS is a Horizon 2020 Marie Skłodowska-Curie Actions Innovative Training Network European Training Network (MSCA-ITN-ETN) project that aims to transform the well being of these people.
The TAPAS work programme targets three key research problems:
(a) Detection: We will develop speech processing techniques for early detection of conditions that impact on speech production. The outcomes will be cheap and non-invasive diagnostic tools that provide early warning of the onset of progressive conditions such as Alzheimer's and Parkinson's.
(b) Therapy: We will use newly-emerging speech processing techniques to produce automated speech therapy tools. These tools will make therapy more accessible and more individually targeted. Better therapy can increase the chances of recovering intelligible speech after traumatic events such a stroke or oral surgery.
(c) Assisted Living: We will re-design current speech technology so that it works well for people with speech impairments and also helps in making informed clinical choices. People with speech impairments often have other co-occurring conditions making them reliant on carers. Speech-driven tools for assisted-living are a way to allow such people to live more independently.
TAPAS adopts an inter-disciplinary and multi-sectorial approach. The consortium includes clinical practitioners, academic researchers and industrial partners, with expertise spanning speech engineering, linguistics and clinical science. All members have expertise in some element of pathological speech. This rich network will train a new generation of 15 researchers, equipping them with the skills and resources necessary for lasting success.

→More information

Publications

2024

Journal Articles

Schraut T., Schützenberger A., Arias Vergara T., Kunduk M., Echternach M., Döllinger M.:
Machine learning based estimation of hoarseness severity using sustained vowelsa)
In: Journal of the Acoustical Society of America 155 (2024), p. 381-395
ISSN: 0001-4966
DOI: 10.1121/10.0024341
BibTeX: Download
Chacon AM., Nguyen DD., Holik J., Döllinger M., Arias Vergara T., Arias-Vergara T., Madill CJ.:
Vowel onset measures and their reliability, sensitivity and specificity: A systematic literature review
In: PLoS ONE 19 (2024), p. e0301786-
ISSN: 1932-6203
DOI: 10.1371/journal.pone.0301786
BibTeX: Download

Conference Contributions

Vysotskaya N., Maul N., Fusco A., Hazra S., Harnisch J., Arias Vergara T., Maier A.:
Transforming Cardiovascular Health: a Transformer-Based Approach to Continuous, Non-Invasive Blood Pressure Estimation via Radar Sensing
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (Seoul, April 14, 2024 - April 19, 2024)
In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New York City: 2024
DOI: 10.1109/ICASSP48485.2024.10446988
BibTeX: Download

2023

Authored Books

Weise T., Maier A., Demir KC., Pérez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster ME., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
Springer Science and Business Media Deutschland GmbH, 2023
ISBN: 9783031404979
DOI: 10.1007/978-3-031-40498-6_13
BibTeX: Download

Journal Articles

Arias Vergara T., Döllinger M., Schraut T., Mohd Khairuddin KA., Schützenberger A.:
Nyquist Plot Parametrization for Quantitative Analysis of Vibration of the Vocal Folds
In: Journal of Voice (2023)
ISSN: 0892-1997
DOI: 10.1016/j.jvoice.2023.01.014
BibTeX: Download

Conference Contributions

Weise T., Maier A., Demir KC., Pérez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster M., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
TSD 2023: Text, Speech, and Dialogue (Pilsen, September 4, 2023 - September 6, 2023)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Text, Speech, and Dialogue, Cham: 2023
DOI: 10.1007/978-3-031-40498-6_13
BibTeX: Download
Pérez Toro PA., Arias Vergara T., Braun F., Hönig F., Tobón-Quintero CA., Aguillón D., Lopera F., Hincapié-Henao L., Schuster M., Riedhammer K., Maier A., Nöth E., Orozco Arroyave JR.:
Automatic Assessment of Alzheimer's across Three Languages Using Speech and Language Features
24th International Speech Communication Association, Interspeech 2023 (Dublin, IRL, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2079
BibTeX: Download
Hung H., Pérez Toro PA., Arias Vergara T., Maier A., Nöth E.:
Speaking Clearly, Understanding Better: Predicting the L2 Narrative Comprehension of Chinese Bilingual Kindergarten Children Based on Speech Intelligibility Using a Machine Learning Approach
Interspeech 2023 (Dublin, August 20, 2023 - August 24, 2023)
In: Interspeech 2023 2023
DOI: 10.21437/Interspeech.2023-2057
BibTeX: Download
Pérez Toro PA., Rodriguez Salas D., Arias Vergara T., Bayerl SP., Klumpp P., Riedhammer KT., Schuster M., Nöth E., Maier A., Orozco Arroyave JR.:
Transferring Quantified Emotion Knowledge for the Detection of Depression in Alzheimer’s Disease Using Forestnets
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Rhodes Island, June 4, 2023 - June 10, 2023)
In: ICASSP 2023 2023
DOI: 10.1109/ICASSP49357.2023.10095219
BibTeX: Download
Arias Vergara T., Londoño-Mora E., Pérez Toro PA., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Measuring Phonological Precision in Children with Cleft Lip and Palate
24th International Speech Communication Association, Interspeech 2023 (Dublin, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2099
BibTeX: Download

2022

Authored Books

Pérez Toro PA., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR., Arias Vergara T.:
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
Springer Science and Business Media Deutschland GmbH, 2022
ISBN: 9783031162695
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
Arias Vergara T.:
Analysis of Pathological Speech Signals
Erlangen, Bayern, Germany: Logos Verlag Berlin GmbH, 2022
(Studien zur Mustererkennung, Vol.50)
ISBN: 978-3-8325-5561-0
URL: https://logos-verlag.eu/cgi-bin/engbuchmid?isbn=5561&lng=eng&id=
BibTeX: Download

Journal Articles

Pérez Toro PA., Rodriguez Salas D., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download
Schraut T., Schützenberger A., Arias Vergara T., Kunduk M., Echternach M., Döllinger M.:
Machine learning based estimation of hoarseness severity from sustained vowels
In: Journal of the Acoustical Society of America 152 (2022), p. A141-A141
ISSN: 0001-4966
DOI: 10.1121/10.0015825
BibTeX: Download
Arias Vergara T., Schraut T., Orozco-Arroyave JR., Döllinger M.:
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
Arias Vergara T., Schraut T., Orozco-Arroyave JR., Döllinger M.:
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
Pérez Toro PA., Arias Vergara T., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco Arroyave JR.:
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
Pérez Toro PA., Arias Vergara T., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR.:
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
Pérez Toro PA., Rodriguez Salas D., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR., Maier A.:
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download

Conference Contributions

Dürr S., Schützenberger A., Kist A., Semmler M., Schraut T., Arias Vergara T., Döllinger M.:
High-speed video endoscopy to improve the diagnosis of voice disorders
DOI: 10.1055/s-0042-1746963
BibTeX: Download
Pérez-Toro PA., Klumpp P., Vasquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR., Arias Vergara T.:
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
25th International Conference on Text, Speech, and Dialogue, TSD 2022 (Brno, CZE, September 6, 2022 - September 9, 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
Pérez Toro PA., Klumpp P., Hernandez A., Arias Vergara T., Lillo P., Slachevsky A., García AM., Schuster M., Maier A., Nöth E., Orozco Arroyave JR.:
Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10883
BibTeX: Download
Schäfer P., Pérez Toro PA., Klumpp P., Orozco-Arroyave JR., Nöth E., Maier A., Abad A., Schuster M., Arias Vergara T.:
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
URL: https://www.scopus.com/record/display.uri?eid=2-s2.0-85140086069∨igin=inward
BibTeX: Download
Schäfer P., Pérez Toro PA., Klumpp P., Orozco Arroyave JR., Nöth E., Maier A., Abad A., Schuster M., Arias Vergara T.:
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, KOR, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
BibTeX: Download
tom Dieck T., Pérez Toro PA., Arias Vergara T., Nöth E., Klumpp P.:
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download
tom Dieck T., Pérez Toro PA., Arias Vergara T., Nöth E., Klumpp P.:
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download

2021

Journal Articles

Klumpp P., Arias Vergara T., Vásquez-Correa JC., Pérez Toro PA., Orozco-Arroyave JR., Batliner A., Nöth E.:
The Phonetic Footprint of Parkinson's Disease
In: Computer Speech and Language 72 (2021)
ISSN: 0885-2308
DOI: 10.1016/j.csl.2021.101321
URL: https://www.sciencedirect.com/science/article/abs/pii/S0885230821001169
BibTeX: Download
Vásquez-Correa JC., Rios-Urrego CD., Arias Vergara T., Schuster M., Rusz J., Nöth E., Orozco-Arroyave JR.:
Transfer learning helps to improve the accuracy to classify patients with different speech disorders in different languages
In: Pattern Recognition Letters (2021)
ISSN: 0167-8655
DOI: 10.1016/j.patrec.2021.04.011
BibTeX: Download

Conference Contributions

Pérez Toro PA., Vasquez Correa J., Arias Vergara T., Klumpp P., Sierra-Castrillón M., Roldán-López ME., Aguillón D., Hincapié-Henao L., Tóbon-Quintero CA., Bocklet T., Schuster M., Orozco-Arroyave JR., Nöth E.:
Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer's Disease
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414009
URL: https://ieeexplore.ieee.org/abstract/document/9414009
BibTeX: Download
Pérez Toro PA., Vásquez-Correa JC., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR.:
Emotional State Modeling for the Assessment of Depression in Parkinson’s Disease
24th International Conference on Text, Speech, and Dialogue, TSD 2021 (Olomouc, CZE, September 6, 2021 - September 9, 2021)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2021
DOI: 10.1007/978-3-030-83527-9_39
BibTeX: Download
Vasquez Correa J., Arias Vergara T., Klumpp P., Pérez Toro PA., Orozco-Arroyave JR., Nöth E.:
End-2-End Modeling of Speech and Gait from Patients with Parkinson’s Disease: Comparison Between High Quality Vs. Smartphone Data
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414729
URL: https://ieeexplore.ieee.org/abstract/document/9414729
BibTeX: Download
Pérez Toro PA., Bayerl S., Arias Vergara T., Vasquez Correa J., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR., Riedhammer K.:
Influence of the Interviewer on the Automatic Assessment of Alzheimer’s Disease in the Context of the ADReSSo Challenge
22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1589
BibTeX: Download
Klumpp P., Bocklet T., Arias Vergara T., Vasquez Correa J., Pérez Toro PA., Bayerl S., Orozco-Arroyave JR., Nöth E.:
The Phonetic Footprint of Covid-19
Interspeech 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1488
BibTeX: Download

2020

Journal Articles

Arias Vergara T., Arguello-Velez P., Vasquez Correa J., Nöth E., Schuster M., González-Rátiva MC., Orozco Arroyave JR.:
Automatic detection of Voice Onset Time in voiceless plosives using gated recurrent units
In: Digital Signal Processing 104 (2020), Article No.: 102779
ISSN: 1051-2004
DOI: 10.1016/j.dsp.2020.102779
BibTeX: Download
Vasquez Correa J., Arias Vergara T., Schuster M., Orozco Arroyave JR., Nöth E.:
Parallel Representation Learning for the Classification of Pathological Speech: Studies on Parkinson's Disease and Cleft Lip and Palate
In: Speech Communication 122 (2020), p. 56-67
ISSN: 0167-6393
DOI: 10.1016/j.specom.2020.07.005
BibTeX: Download
Pérez Toro PA., Vasquez Correa J., Arias Vergara T., Nöth E., Orozco-Arroyave JR.:
Nonlinear dynamics and Poincare sections to model gait impairments in different stages of Parkinson's disease
In: Nonlinear Dynamics 100 (2020), p. 3253-3276
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download
Pérez-Toro PA., Vasquez Correa J., Arias Vergara T., Nöth E., Orozco Arroyave JR.:
Nonlinear dynamics and Poincaré sections to model gait impairments in different stages of Parkinson’s disease
In: Nonlinear Dynamics (2020)
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download

Conference Contributions

Klumpp P., Arias Vergara T., Vásquez-Correa JC., Pérez-Toro PA., Hönig FT., Nöth E., Orozco-Arroyave JR.:
Surgical mask detection with deep recurrent phonetic models
Interspeech 2020
In: Interspeech 2020 2020
BibTeX: Download
Klumpp P., Arias Vergara T., Vasquez Correa J., Pérez Toro PA., Hönig FT., Nöth E., Orozco-Arroyave JR.:
Surgical mask detection with deep recurrent phonetic models
21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
DOI: 10.21437/Interspeech.2020-1723
BibTeX: Download

2019

Book Contributions

Vasquez Correa J., Arias Vergara T., Rios-Urrego CD., Schuster M., Rusz J., Orozco Arroyave JR., Nöth E.:
Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson’s Disease from Speech in Three Different Languages
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 697-706 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_66
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Gollwitzer S., Orozco-Arroyave JR., Schuster M., Nöth E.:
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 679-687 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download

Conference Contributions

Arias Vergara T., Orozco Arroyave JR., Vasquez Correa J., Nöth E., Schuster M., Gollwitzer S., Högerle C.:
Speech differences between CI users with pre- and postlingual onset of deafness detected by speech processing methods on voiceless to voice transitions
90. Jahresversammlung der Deutschen Gesellschaft für Hals-Nasen-Ohren-Heilkunde, Kopf- und Hals-Chirurgie (Estrel Congress Center Berlin, May 29, 2019 - June 1, 2019)
In: Laryngo-Rhino-Otol 2019 2019
DOI: 10.1055/s-0039-168632
BibTeX: Download
Arias Vergara T., Gollwitzer S., Orozco Arroyave JR., Schuster M., Nöth E.:
Consonant-to-Vowel/Vowel-to-Consonant Transitions to Analyze the Speech of Cochlear Implant Users.
Text, Speech, and Dialogue 2019 (Ljubljana, September 11, 2019 - September 13, 2019)
In: Kamil Ekštein (ed.): Lecture Notes in Computer Science 2019
DOI: 10.1007/978-3-030-27947-9_25
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Gollwitzer S., Orozco Arroyave JR., Schuster M., Nöth E.:
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
Iberoamerican Congress on Pattern Recognition
In: CIARP 2019: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications 2019
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download
Arias Vergara T., Orozco-Arroyave JR., Cernak M., Gollwitzer S., Schuster M., Nöth E.:
Phone-attribute posteriors to evaluate the speech of cochlear implant users
20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 (Graz, September 15, 2019 - September 19, 2019)
In: Gernot Kubin, Thomas Hain, Bjorn Schuller, Dina El Zarka, Petra Hodl (ed.): Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
DOI: 10.21437/Interspeech.2019-2144
BibTeX: Download

2018

Journal Articles

Vasquez Correa J., Arias Vergara T., Rafael Orozco-Arroyave J., Eskofier B., Klucken J., Nöth E.:
Multimodal assessment of Parkinson's disease: a deep learning approach
In: IEEE Journal of Biomedical and Health Informatics (2018)
ISSN: 2168-2194
DOI: 10.1109/JBHI.2018.2866873
URL: https://ieeexplore.ieee.org/document/8444654
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Rafael Orozco-Arroyave J., Nöth E.:
Speaker models for monitoring Parkinson’s disease progression considering different communication channels and acoustic conditions
In: Speech Communication 101 (2018), p. 11-25
ISSN: 0167-6393
DOI: 10.1016/j.specom.2018.05.007
URL: https://www.sciencedirect.com/science/article/abs/pii/S0167639317304454
BibTeX: Download

Conference Contributions

Vasquez Correa J., Arias Vergara T., Orozco-Arroyave JR., Nöth E.:
A Multitask Learning Approach to Assess the Dysarthria Severity in Patients with Parkinson's Disease
INTERSPEECH
In: Proceedings of INTERSPEECH 2018
DOI: 10.21437/Interspeech.2018-1988
URL: https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1988.html
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Orozco Arroyave JR., Klumpp P., Nöth E.:
Unobtrusive Monitoring of Speech Impairments of Parkinson's Disease Patients Through Mobile Devices
2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
DOI: 10.1109/ICASSP.2018.8462332
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Rafael Orozco-Arroyave J., Klumpp P., Nöth E.:
Unobtrusive Monitoring of Speech Impairments of Parkinson'S Disease Patients Through Mobile Devices
43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP.2018.8462332
URL: https://ieeexplore.ieee.org/abstract/document/8462332
BibTeX: Download

2017

Authored Books

Vasquez Correa J., Castrillon R., Arias Vergara T., Orozco Arroyave JR., Nöth E.:
Speaker model to monitor the neurological state and the dysarthria level of patients with parkinson’s disease
Springer Verlag, 2017
ISBN: 9783319642055
DOI: 10.1007/978-3-319-64206-2_31
BibTeX: Download

Conference Contributions

Klumpp P., Janu T., Arias Vergara T., Vasquez Correa J., Rafael Orozco-Arroyave J., Nöth E.:
Apkinson — A Mobile Monitoring Solution for Parkinson's Disease
Interspeech 2017
In: Interspeech 2017 2017
URL: https://www.researchgate.net/profile/Juan_Vasquez12/publication/319185470_Apkinson_-_A_Mobile_Monitoring_Solution_for_Parkinson's_Disease/links/59b9fecfa6fdcc68723177dc/Apkinson-A-Mobile-Monitoring-Solution-for-Parkinsons-Disease.pdf
BibTeX: Download
Arias Vergara T., Klumpp P., Vasquez Correa J., Orozco Arroyave JR., Nöth E.:
Parkinson’s disease progression assessment from speech using a mobile device-based application
20th International Conference on Text, Speech and Dialogue, TSD 2017
In: TSD 2017: Text, Speech, and Dialogue 2017
DOI: 10.1007/978-3-319-64206-2_42
BibTeX: Download

2016

Conference Contributions

Arias Vergara T., Vasquez Correa J., Orozco-Arroyave JR., Vargas-Bonilla JF., Haderlein T., Nöth E.:
Gender-dependent GMM-UBM for Tracking Parkinson's Disease Progression from Speech
12. ITG Fachtagung Sprachkommunikation (Paderborn)
In: Speech Communication - 12. ITG Fachtagung Sprachkommunikation, Berlin: 2016
URL: https://www5.informatik.uni-erlangen.de/Forschung/Publikationen/2016/Arias-Vergara16-GGF.pdf
BibTeX: Download

Thesis Supervision

Type	Title	Status
MA thesis	Text Generation in Alzheimer’s Disease	running
MA thesis	Improving Text Summarization through Guided Decoding of Language Models	running
MA thesis	Spoken Language Identification for Hearing Aids	running
MA thesis	Understanding Odor Descriptors through Advanced NLP Models and Semantic Scores	running
Project	Generation of Clinical Text Reports from Chest X-Ray Images	running
MA thesis	Cross-Dataset Phonological Speech Analysis of Children with Cleft Lip and Palate	finished
Project	Automatic recognition of bavarian dialects	running
MA thesis	Large Language Model for Generation of Structured Medical Report from X-ray Transcriptions	finished
MA thesis	Natural Language Text Generation for Symbolic Descriptions Using Language Models	finished
MA thesis	Development of a deep learning approach to detect faulty axial bearing components after assembly using acoustic signals	finished
MA thesis	Edge-AI: Self-sensing backpressure estimation in piezoelectric micropumps using machine learning methods on a limited hardware	finished
BA thesis	CoachLea: An Android Application to evaluate the progress of speaking and hearing abilities of children with Cochlear Implant	finished
BA thesis	CITA: An Android-based Application to Evaluate the Speech of Cochlear Implant Users	finished