Dr.-Ing. Tomás Arias Vergara

Dr.-Ing. Tomás Arias Vergara, M. Sc.

Lehrstuhl für Informatik 5 (Mustererkennung)
Chair of Computer Science 5 (Pattern Recognition)

Room 10.134
Martensstr. 3
91058 Erlangen

Phone number: +49 9131 85-27872
Fax number: +49 9131 85-27270
Email: tomas.arias@fau.de
Website: https://lme.tf.fau.de/person/arias/

I received a B.S. in Electronics Engineering from the University of Antioquia (UdeA, Colombia) in 2014, a Master of Science degree at the same institution in 2017, and a Ph.D. in a joint program between the UdeA and the FAU in 2022. Since 2015, my research has focused on speech processing and machine learning methods for the analysis of pathological speech signals resulting from neurological (e.g., Parkinson’s disease), structural (e.g., children with cleft lip and palate), and perceptual (e.g., hearing loss) disorders. I have also investigated the effect of the natural aging process on speech, participated in developing Android-based applications for collecting and analyzing data from Parkinson’s disease patients and adults/children with hearing loss, and performed research on automatic methods for the analysis of high-speed videoendoscopy data of people with voice disorders.

Academic CV

https://tariasvergara.github.io/CV_Tomas_AriasVergara.pdf

Projects

2024

A multimodal approach for automatic generation of radiology reports using chest X-ray images, clinical free-text, and spoken commands.

(FAU Funds)

Term: January 15, 2024 - January 14, 2025

Abstract

Advancements in Artificial Intelligence (AI) methods have enabled thedevelopment of Large Language Models (LLMs) capable of generating informationfrom user instructions and supporting various tasks in education, research,healthcare, and others. AI has also impacted the field of medical imaging withseveral deep learning models capable of achieving expert-level performanceacross different tasks, e.g., detection, segmentation, and assisted clinicaldiagnosis. In addition, open-source Automatic Speech Recognition (ASR) systemscan be incorporated as modules in AI-based systems. This proposed fundedproject aims to combine LLMs, medical imaging, and speech recognition using AImethods to generate high-quality radiology reports from chest X-ray images.

→More information
Coordinated grid protection based on machine learning methods

(Third Party Funds Single)

Term: July 1, 2024 - June 30, 2027
Funding source: DFG-Einzelförderung / Sachbeihilfe (EIN-SBH)

→More information

2017

Training Network on Automatic Processing of PAthological Speech

(Third Party Funds Group – Overall project)

Term: November 1, 2017 - October 31, 2021
Funding source: Innovative Training Networks (ITN)
URL: https://www.tapas-etn-eu.org/

Abstract

There are an increasing number of people across Europe with debilitating speech pathologies (e.g., due to stroke, Parkinson's, etc). These groups face communication problems that can lead to social exclusion. They are now being further marginalised by a new wave of speech technology that is increasingly woven into everyday life but which is not robust to atypical speech. TAPAS is a Horizon 2020 Marie Skłodowska-Curie Actions Innovative Training Network European Training Network (MSCA-ITN-ETN) project that aims to transform the well being of these people.
The TAPAS work programme targets three key research problems:
(a) Detection: We will develop speech processing techniques for early detection of conditions that impact on speech production. The outcomes will be cheap and non-invasive diagnostic tools that provide early warning of the onset of progressive conditions such as Alzheimer's and Parkinson's.
(b) Therapy: We will use newly-emerging speech processing techniques to produce automated speech therapy tools. These tools will make therapy more accessible and more individually targeted. Better therapy can increase the chances of recovering intelligible speech after traumatic events such a stroke or oral surgery.
(c) Assisted Living: We will re-design current speech technology so that it works well for people with speech impairments and also helps in making informed clinical choices. People with speech impairments often have other co-occurring conditions making them reliant on carers. Speech-driven tools for assisted-living are a way to allow such people to live more independently.
TAPAS adopts an inter-disciplinary and multi-sectorial approach. The consortium includes clinical practitioners, academic researchers and industrial partners, with expertise spanning speech engineering, linguistics and clinical science. All members have expertise in some element of pathological speech. This rich network will train a new generation of 15 researchers, equipping them with the skills and resources necessary for lasting success.

→More information

Publications

2025

Conference Contributions

Ramachandran A., Arias Vergara T., Maier A., Bayer S.:
Water Demand Forecasting of District Metered Areas through Learned Consumer Representations
The 33rd European Signal Processing Conference (EUSIPCO 2025) (Palermo, Italy, September 8, 2025 - September 12, 2025)
In: Water Demand Forecasting of District Metered Areas through Learned Consumer Representations 2025
BibTeX: Download
Oelhaf J., Kordowich G., Perez Toro PA., Arias Vergara T., Maier A., Jäger J., Bayer S.:
A Systematic Evaluation of Machine Learning Methods for Fault Detection and Line Identification in Electrical Power Grids
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Hyderabad, April 6, 2025 - April 11, 2025)
In: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New York City: 2025
DOI: 10.1109/ICASSP49660.2025.10890544
BibTeX: Download
Schwarz A., Dickmann J., Hofmann C., Szkitsak J., Bert C., Maier A., Arias Vergara T.:
Cross-Modality Image Quality Prediction for Time-Resolved CT from Breathing Signals
Workshop on Longitudinal Disease Tracking and Modeling with Medical Images and Data, LDTM 2024, 5th International Workshop on Multiscale Multimodal Medical Imaging, MMMI 2024, 1st Workshop on Machine Learning for Multimodal/-sensor Healthcare Data, ML4MHD2024 and Workshop on Multimodal Learning and Fusion Across Scales for Clinical Decision Support, ML-CDS 2024 held in conjunction with the 27th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2024 (Marrakesh, October 6, 2024 - October 10, 2024)
In: Anna Schroder, Xiang Li, Tanveer Syeda-Mahmood, Neil P. Oxtoby, Alexandra Young, Alessa Hering, Tejas S. Mathai, Pritam Mukherjee, Sven Kuckertz, Tiantian He, Isaac Llorente-Saguer, Andreas Maier, Satyananda Kashyap, Hayit Greenspan, Anant Madabhushi (ed.): Lecture Notes in Computer Science 2025
DOI: 10.1007/978-3-031-84525-3_13
BibTeX: Download

2024

Journal Articles

Bi M(., Nguyen DD., Arias Vergara T., Döllinger M., Holik J., Madill C.:
Effects of Instructed Laryngeal Manipulation on Vocal Rise Time
In: Journal of Voice (2024)
ISSN: 0892-1997
DOI: 10.1016/j.jvoice.2024.10.009
BibTeX: Download
Arias Vergara T., Madill C., Nguyen D., Holik J., Döllinger M.:
VOAT: Voice Onset Analysis Tool
In: SoftwareX 27 (2024), Article No.: 101802
ISSN: 2352-7110
DOI: 10.1016/j.softx.2024.101802
BibTeX: Download
Schraut T., Schützenberger A., Arias Vergara T., Kunduk M., Echternach M., Döllinger M.:
Machine learning based estimation of hoarseness severity using sustained vowelsa)
In: Journal of the Acoustical Society of America 155 (2024), p. 381-395
ISSN: 0001-4966
DOI: 10.1121/10.0024341
BibTeX: Download
Chacon AM., Nguyen DD., Holik J., Döllinger M., Arias Vergara T., Arias-Vergara T., Madill CJ.:
Vowel onset measures and their reliability, sensitivity and specificity: A systematic literature review
In: PLoS ONE 19 (2024), p. e0301786-
ISSN: 1932-6203
DOI: 10.1371/journal.pone.0301786
BibTeX: Download

Conference Contributions

Kulyabin M., Sokolov G., Galaida A., Maier A., Arias Vergara T.:
SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology
27th International Conference on Pattern Recognition (Kolkata, India, December 1, 2024 - December 5, 2024)
In: Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal (ed.): Proceedings of the 27th International Conference on Pattern Recognition 2024 2024
DOI: 10.1007/978-3-031-78119-3_11
URL: https://arxiv.org/abs/2405.16115
BibTeX: Download
Vysotskaya N., Maul N., Fusco A., Hazra S., Harnisch J., Arias Vergara T., Maier A.:
Transforming Cardiovascular Health: a Transformer-Based Approach to Continuous, Non-Invasive Blood Pressure Estimation via Radar Sensing
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (Seoul, April 14, 2024 - April 19, 2024)
In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New York City: 2024
DOI: 10.1109/ICASSP48485.2024.10446988
BibTeX: Download
Arias Vergara T., Perez Toro PA., Liu X., Xing F., Stone M., Zhuo J., Prince JL., Schuster M., Nöth E., Woo J., Maier A.:
Contrastive Learning Approach for Assessment of Phonological Precision in Patients with Tongue Cancer Using MRI Data
25th Interspeech Conferece 2024 (Kos Island, September 1, 2024 - September 5, 2024)
In: Interspeech 2024 2024
DOI: 10.21437/Interspeech.2024-2236
BibTeX: Download
Perez Toro PA., Arias Vergara T., Klumpp P., Weise T., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Multilingual Speech and Language Analysis for the Assessment of Mild Cognitive Impairment: Outcomes from the Taukadial Challenge
25th Interspeech Conferece 2024 (Kos Island, September 1, 2024 - September 5, 2024)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2024
DOI: 10.21437/Interspeech.2024-2115
BibTeX: Download
Liu X., Xing F., Bian Z., Arias Vergara T., Perez Toro PA., Maier A., Stone M., Zhuo J., Prince JL., Woo J.:
Tagged-to-Cine MRI Sequence Synthesis via Light Spatial-Temporal Transformer
27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024 (Marrakesh, October 6, 2024 - October 10, 2024)
In: Medical Image Computing and Computer Assisted Intervention – MICCAI 2024, Cham: 2024
DOI: 10.1007/978-3-031-72104-5_67
BibTeX: Download

2023

Authored Books

Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster ME., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
Springer Science and Business Media Deutschland GmbH, 2023
ISBN: 9783031404979
DOI: 10.1007/978-3-031-40498-6_13
BibTeX: Download

Journal Articles

Arias Vergara T., Döllinger M., Schraut T., Mohd Khairuddin KA., Schützenberger A.:
Nyquist Plot Parametrization for Quantitative Analysis of Vibration of the Vocal Folds
In: Journal of Voice (2023)
ISSN: 0892-1997
DOI: 10.1016/j.jvoice.2023.01.014
BibTeX: Download

Conference Contributions

Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster M., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
TSD 2023: Text, Speech, and Dialogue (Pilsen, September 4, 2023 - September 6, 2023)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Text, Speech, and Dialogue, Cham: 2023
DOI: 10.1007/978-3-031-40498-6_13
BibTeX: Download
Escobar-Grisales D., Arias-Vergara T., Rios-Urrego CD., Nöth E., García AM., Orozco-Arroyave JR.:
An Automatic Multimodal Approach to Analyze Linguistic and Acoustic Cues on Parkinson's Disease Patients
24th International Speech Communication Association, Interspeech 2023 (Dublin, IRL, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2287
BibTeX: Download
Perez Toro PA., Arias Vergara T., Braun F., Hönig F., Tobón-Quintero CA., Aguillón D., Lopera F., Hincapié-Henao L., Schuster M., Riedhammer K., Maier A., Nöth E., Orozco Arroyave JR.:
Automatic Assessment of Alzheimer's across Three Languages Using Speech and Language Features
24th International Speech Communication Association, Interspeech 2023 (Dublin, IRL, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2079
BibTeX: Download
Hung H., Perez Toro PA., Arias Vergara T., Maier A., Nöth E.:
Speaking Clearly, Understanding Better: Predicting the L2 Narrative Comprehension of Chinese Bilingual Kindergarten Children Based on Speech Intelligibility Using a Machine Learning Approach
Interspeech 2023 (Dublin, August 20, 2023 - August 24, 2023)
In: Interspeech 2023 2023
DOI: 10.21437/Interspeech.2023-2057
BibTeX: Download
Perez Toro PA., Rodriguez Salas D., Arias Vergara T., Bayerl SP., Klumpp P., Riedhammer KT., Schuster M., Nöth E., Maier A., Orozco Arroyave JR.:
Transferring Quantified Emotion Knowledge for the Detection of Depression in Alzheimer’s Disease Using Forestnets
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Rhodes Island, June 4, 2023 - June 10, 2023)
In: ICASSP 2023 2023
DOI: 10.1109/ICASSP49357.2023.10095219
BibTeX: Download
Arias Vergara T., Londoño-Mora E., Perez Toro PA., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Measuring Phonological Precision in Children with Cleft Lip and Palate
24th International Speech Communication Association, Interspeech 2023 (Dublin, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2099
BibTeX: Download

2022

Authored Books

Perez Toro PA., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR., Arias Vergara T.:
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
Springer Science and Business Media Deutschland GmbH, 2022
ISBN: 9783031162695
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
Arias Vergara T.:
Analysis of Pathological Speech Signals
Erlangen, Bayern, Germany: Logos Verlag Berlin GmbH, 2022
(Studien zur Mustererkennung, Vol.50)
ISBN: 978-3-8325-5561-0
URL: https://logos-verlag.eu/cgi-bin/engbuchmid?isbn=5561&lng=eng&id=
BibTeX: Download

Journal Articles

Arias-Vergara T., Batliner A., Rader T., Polterauer D., Högerle C., Müller J., Orozco-Arroyave JR., Nöth E., Schuster M.:
Adult Cochlear Implant Users Versus Typical Hearing Persons: An Automatic Analysis of Acoustic–Prosodic Parameters
In: Journal of Speech Language and Hearing Research 65 (2022), p. 4623-4636
ISSN: 1092-4388
DOI: 10.1044/2022_JSLHR-21-00116
BibTeX: Download
Perez Toro PA., Rodriguez Salas D., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download
Schraut T., Schützenberger A., Arias Vergara T., Kunduk M., Echternach M., Döllinger M.:
Machine learning based estimation of hoarseness severity from sustained vowels
In: Journal of the Acoustical Society of America 152 (2022), p. A141-A141
ISSN: 0001-4966
DOI: 10.1121/10.0015825
BibTeX: Download
Arias Vergara T., Schraut T., Orozco-Arroyave JR., Döllinger M.:
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
Arias Vergara T., Schraut T., Orozco-Arroyave JR., Döllinger M.:
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
Perez Toro PA., Arias Vergara T., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco Arroyave JR.:
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
Perez Toro PA., Arias Vergara T., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR.:
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
Perez Toro PA., Rodriguez Salas D., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR., Maier A.:
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download

Conference Contributions

Dürr S., Schützenberger A., Kist A., Semmler M., Schraut T., Arias Vergara T., Döllinger M.:
High-speed video endoscopy to improve the diagnosis of voice disorders
DOI: 10.1055/s-0042-1746963
BibTeX: Download
Pérez-Toro PA., Klumpp P., Vasquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR., Arias Vergara T.:
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
25th International Conference on Text, Speech, and Dialogue, TSD 2022 (Brno, CZE, September 6, 2022 - September 9, 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
Perez Toro PA., Klumpp P., Hernandez A., Arias Vergara T., Lillo P., Slachevsky A., García AM., Schuster M., Maier A., Nöth E., Orozco Arroyave JR.:
Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10883
BibTeX: Download
Schäfer P., Perez Toro PA., Klumpp P., Orozco-Arroyave JR., Nöth E., Maier A., Abad A., Schuster M., Arias Vergara T.:
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
URL: https://www.scopus.com/record/display.uri?eid=2-s2.0-85140086069∨igin=inward
BibTeX: Download
Schäfer P., Perez Toro PA., Klumpp P., Orozco Arroyave JR., Nöth E., Maier A., Abad A., Schuster M., Arias Vergara T.:
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, KOR, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
BibTeX: Download
tom Dieck T., Perez Toro PA., Arias Vergara T., Nöth E., Klumpp P.:
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download
tom Dieck T., Perez Toro PA., Arias Vergara T., Nöth E., Klumpp P.:
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download

2021

Journal Articles

Garcia AM., Arias-Vergara T., C. Vasquez-Correa J., Nöth E., Schuster M., Welch AE., Bocanegra Y., Baena A., Orozco-Arroyave JR.:
Cognitive Determinants of Dysarthria in Parkinson's Disease: An Automated Machine Learning Approach
In: Movement Disorders (2021)
ISSN: 0885-3185
DOI: 10.1002/mds.28751
BibTeX: Download
Klumpp P., Arias Vergara T., Vásquez-Correa JC., Perez Toro PA., Orozco-Arroyave JR., Batliner A., Nöth E.:
The Phonetic Footprint of Parkinson's Disease
In: Computer Speech and Language 72 (2021)
ISSN: 0885-2308
DOI: 10.1016/j.csl.2021.101321
URL: https://www.sciencedirect.com/science/article/abs/pii/S0885230821001169
BibTeX: Download
Vásquez-Correa JC., Rios-Urrego CD., Arias Vergara T., Schuster M., Rusz J., Nöth E., Orozco-Arroyave JR.:
Transfer learning helps to improve the accuracy to classify patients with different speech disorders in different languages
In: Pattern Recognition Letters (2021)
ISSN: 0167-8655
DOI: 10.1016/j.patrec.2021.04.011
BibTeX: Download

Conference Contributions

Perez Toro PA., Vasquez Correa J., Arias Vergara T., Klumpp P., Sierra-Castrillón M., Roldán-López ME., Aguillón D., Hincapié-Henao L., Tóbon-Quintero CA., Bocklet T., Schuster M., Orozco-Arroyave JR., Nöth E.:
Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer's Disease
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414009
URL: https://ieeexplore.ieee.org/abstract/document/9414009
BibTeX: Download
Perez Toro PA., Vásquez-Correa JC., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR.:
Emotional State Modeling for the Assessment of Depression in Parkinson’s Disease
24th International Conference on Text, Speech, and Dialogue, TSD 2021 (Olomouc, CZE, September 6, 2021 - September 9, 2021)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2021
DOI: 10.1007/978-3-030-83527-9_39
BibTeX: Download
Vasquez Correa J., Arias Vergara T., Klumpp P., Perez Toro PA., Orozco-Arroyave JR., Nöth E.:
End-2-End Modeling of Speech and Gait from Patients with Parkinson’s Disease: Comparison Between High Quality Vs. Smartphone Data
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414729
URL: https://ieeexplore.ieee.org/abstract/document/9414729
BibTeX: Download
Perez Toro PA., Bayerl S., Arias Vergara T., Vasquez Correa J., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR., Riedhammer K.:
Influence of the Interviewer on the Automatic Assessment of Alzheimer’s Disease in the Context of the ADReSSo Challenge
22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1589
BibTeX: Download
Parra-Gallego LF., Arias-Vergara T., Arroyave JRO.:
Robust Automatic Speech Recognition for Call Center Applications
8th Workshop on Engineering Applications, WEA 2021 (Virtual, Online, October 6, 2021 - October 8, 2021)
In: Juan Carlos Figueroa-García, Yesid Díaz-Gutierrez, Elvis Eduardo Gaona-García, Alvaro David Orjuela-Cañón (ed.): Communications in Computer and Information Science 2021
DOI: 10.1007/978-3-030-86702-7_7
BibTeX: Download
Klumpp P., Bocklet T., Arias Vergara T., Vasquez Correa J., Perez Toro PA., Bayerl S., Orozco-Arroyave JR., Nöth E.:
The Phonetic Footprint of Covid-19
Interspeech 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1488
BibTeX: Download

2020

Journal Articles

Arias Vergara T., Arguello-Velez P., Vasquez Correa J., Nöth E., Schuster M., González-Rátiva MC., Orozco Arroyave JR.:
Automatic detection of Voice Onset Time in voiceless plosives using gated recurrent units
In: Digital Signal Processing 104 (2020), Article No.: 102779
ISSN: 1051-2004
DOI: 10.1016/j.dsp.2020.102779
BibTeX: Download
Vasquez Correa J., Arias Vergara T., Schuster M., Orozco Arroyave JR., Nöth E.:
Parallel Representation Learning for the Classification of Pathological Speech: Studies on Parkinson's Disease and Cleft Lip and Palate
In: Speech Communication 122 (2020), p. 56-67
ISSN: 0167-6393
DOI: 10.1016/j.specom.2020.07.005
BibTeX: Download
Orozco-Arroyave JR., Vasquez Correa J., Klumpp P., Perez Toro PA., Escobar-Grisales D., Roth N., Ríos-Urrego CD., Strauß M., Carvajal-Castaño HA., Bayerl S., Castrillón-Osorio LR., Arias-Vergara T., Küderle A., López-Pabón FO., Parra-Gallego LF., Eskofier B., Gómez-Gómez LF., Schuster M., Nöth E.:
Apkinson: the smartphone application for telemonitoring Parkinson's patients through speech, gait and hands movement
In: Neurodegenerative Disease Management (2020)
ISSN: 1758-2024
DOI: 10.2217/nmt-2019-0037
BibTeX: Download
Perez Toro PA., Vasquez Correa J., Arias Vergara T., Nöth E., Orozco-Arroyave JR.:
Nonlinear dynamics and Poincare sections to model gait impairments in different stages of Parkinson's disease
In: Nonlinear Dynamics 100 (2020), p. 3253-3276
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download
Pérez-Toro PA., Vasquez Correa J., Arias Vergara T., Nöth E., Orozco Arroyave JR.:
Nonlinear dynamics and Poincaré sections to model gait impairments in different stages of Parkinson’s disease
In: Nonlinear Dynamics (2020)
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download

Conference Contributions

Argüello-Vélez P., Arias-Vergara T., González-Rátiva MC., Orozco-Arroyave JR., Nöth E., Schuster ME.:
Acoustic characteristics of vot in plosive consonants produced by parkinson’s patients
23rd International Conference on Text, Speech, and Dialogue, TSD 2020 (Brno, September 8, 2020 - September 11, 2020)
In: Petr Sojka, Ivan Kopecek, Karel Pala, Aleš Horák (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2020
DOI: 10.1007/978-3-030-58323-1_33
BibTeX: Download
Klumpp P., Arias Vergara T., Vásquez-Correa JC., Pérez-Toro PA., Hönig FT., Nöth E., Orozco-Arroyave JR.:
Surgical mask detection with deep recurrent phonetic models
Interspeech 2020
In: Interspeech 2020 2020
BibTeX: Download
Klumpp P., Arias Vergara T., Vasquez Correa J., Perez Toro PA., Hönig FT., Nöth E., Orozco-Arroyave JR.:
Surgical mask detection with deep recurrent phonetic models
21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
DOI: 10.21437/Interspeech.2020-1723
BibTeX: Download

2019

Book Contributions

Vasquez Correa J., Arias Vergara T., Rios-Urrego CD., Schuster M., Rusz J., Orozco Arroyave JR., Nöth E.:
Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson’s Disease from Speech in Three Different Languages
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 697-706 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_66
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Gollwitzer S., Orozco-Arroyave JR., Schuster M., Nöth E.:
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 679-687 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download

Conference Contributions

Arias Vergara T., Orozco Arroyave JR., Vasquez Correa J., Nöth E., Schuster M., Gollwitzer S., Högerle C.:
Speech differences between CI users with pre- and postlingual onset of deafness detected by speech processing methods on voiceless to voice transitions
90. Jahresversammlung der Deutschen Gesellschaft für Hals-Nasen-Ohren-Heilkunde, Kopf- und Hals-Chirurgie (Estrel Congress Center Berlin, May 29, 2019 - June 1, 2019)
In: Laryngo-Rhino-Otol 2019 2019
DOI: 10.1055/s-0039-168632
BibTeX: Download
Arias Vergara T., Gollwitzer S., Orozco Arroyave JR., Schuster M., Nöth E.:
Consonant-to-Vowel/Vowel-to-Consonant Transitions to Analyze the Speech of Cochlear Implant Users.
Text, Speech, and Dialogue 2019 (Ljubljana, September 11, 2019 - September 13, 2019)
In: Kamil Ekštein (ed.): Lecture Notes in Computer Science 2019
DOI: 10.1007/978-3-030-27947-9_25
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Gollwitzer S., Orozco Arroyave JR., Schuster M., Nöth E.:
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
Iberoamerican Congress on Pattern Recognition
In: CIARP 2019: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications 2019
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download
Arias Vergara T., Orozco-Arroyave JR., Cernak M., Gollwitzer S., Schuster M., Nöth E.:
Phone-attribute posteriors to evaluate the speech of cochlear implant users
20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 (Graz, September 15, 2019 - September 19, 2019)
In: Gernot Kubin, Thomas Hain, Bjorn Schuller, Dina El Zarka, Petra Hodl (ed.): Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
DOI: 10.21437/Interspeech.2019-2144
BibTeX: Download

2018

Journal Articles

Vasquez Correa J., Arias Vergara T., Orozco-Arroyave JR., Eskofier B., Klucken J., Nöth E.:
Multimodal assessment of Parkinson's disease: a deep learning approach
In: IEEE Journal of Biomedical and Health Informatics (2018)
ISSN: 2168-2194
DOI: 10.1109/JBHI.2018.2866873
URL: https://ieeexplore.ieee.org/document/8444654
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Rafael Orozco-Arroyave J., Nöth E.:
Speaker models for monitoring Parkinson’s disease progression considering different communication channels and acoustic conditions
In: Speech Communication 101 (2018), p. 11-25
ISSN: 0167-6393
DOI: 10.1016/j.specom.2018.05.007
URL: https://www.sciencedirect.com/science/article/abs/pii/S0167639317304454
BibTeX: Download

Conference Contributions

Vasquez Correa J., Arias Vergara T., Orozco-Arroyave JR., Nöth E.:
A Multitask Learning Approach to Assess the Dysarthria Severity in Patients with Parkinson's Disease
INTERSPEECH
In: Proceedings of INTERSPEECH 2018
DOI: 10.21437/Interspeech.2018-1988
URL: https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1988.html
BibTeX: Download
Perez Toro PA., Camilo Vasquez-Correa J., Arias-Vergara T., Garcia-Ospina N., Rafael Orozco-Arroyave J., Nöth E.:
A Non-linear Dynamics Approach to Classify Gait Signals of Patients with Parkinson's Disease
5th Workshop on Engineering Applications (WEA) (Medellin, COLOMBIA, October 17, 2018 - October 19, 2018)
In: APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2018, PT II, BERLIN: 2018
DOI: 10.1007/978-3-030-00353-1_24
BibTeX: Download
Felipe Parra-Gallego L., Arias-Vergara T., Camilo Vasquez-Correa J., Garcia-Ospina N., Rafael Orozco-Arroyave J., Nöth E.:
Automatic Intelligibility Assessment of Parkinson's Disease with Diadochokinetic Exercises
5th Workshop on Engineering Applications (WEA) (Medellin, October 17, 2018 - October 19, 2018)
In: APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2018, PT II, BERLIN: 2018
DOI: 10.1007/978-3-030-00353-1_20
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Orozco Arroyave JR., Klumpp P., Nöth E.:
Unobtrusive Monitoring of Speech Impairments of Parkinson's Disease Patients Through Mobile Devices
2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
DOI: 10.1109/ICASSP.2018.8462332
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Rafael Orozco-Arroyave J., Klumpp P., Nöth E.:
Unobtrusive Monitoring of Speech Impairments of Parkinson'S Disease Patients Through Mobile Devices
43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP.2018.8462332
URL: https://ieeexplore.ieee.org/abstract/document/8462332
BibTeX: Download

2017

Authored Books

Vasquez Correa J., Castrillon R., Arias Vergara T., Orozco Arroyave JR., Nöth E.:
Speaker model to monitor the neurological state and the dysarthria level of patients with parkinson’s disease
Springer Verlag, 2017
ISBN: 9783319642055
DOI: 10.1007/978-3-319-64206-2_31
BibTeX: Download

Conference Contributions

Klumpp P., Janu T., Arias Vergara T., Vasquez Correa J., Rafael Orozco-Arroyave J., Nöth E.:
Apkinson — A Mobile Monitoring Solution for Parkinson's Disease
Interspeech 2017
In: Interspeech 2017 2017
URL: https://www.researchgate.net/profile/Juan_Vasquez12/publication/319185470_Apkinson_-_A_Mobile_Monitoring_Solution_for_Parkinson's_Disease/links/59b9fecfa6fdcc68723177dc/Apkinson-A-Mobile-Monitoring-Solution-for-Parkinsons-Disease.pdf
BibTeX: Download
Arias Vergara T., Klumpp P., Vasquez Correa J., Orozco Arroyave JR., Nöth E.:
Parkinson’s disease progression assessment from speech using a mobile device-based application
20th International Conference on Text, Speech and Dialogue, TSD 2017
In: TSD 2017: Text, Speech, and Dialogue 2017
DOI: 10.1007/978-3-319-64206-2_42
BibTeX: Download

2016

Conference Contributions

Arias Vergara T., Vasquez Correa J., Orozco-Arroyave JR., Vargas-Bonilla JF., Haderlein T., Nöth E.:
Gender-dependent GMM-UBM for Tracking Parkinson's Disease Progression from Speech
12. ITG Fachtagung Sprachkommunikation (Paderborn)
In: Speech Communication - 12. ITG Fachtagung Sprachkommunikation, Berlin: 2016
URL: https://www5.informatik.uni-erlangen.de/Forschung/Publikationen/2016/Arias-Vergara16-GGF.pdf
BibTeX: Download

Thesis Supervision

Type	Title	Status
MA thesis	Multi-Task Deep Learning for Parkinson’s Disease: Classification and Severity Estimation via Smartwatch Data	running
MA thesis	Analysis of Speech Production Assessment of Cochlear Implant Users	running
MA thesis	PaiChat: A Visual – Language Assistant for Histopathology	running
MA thesis	Pathological Voice Analysis with Selective State Space Models	running
Project	Interpretable Vision Transformers with Attention Maps for Phonological Precision Assessment from MRI	finished
MA thesis	Fast heart sound detection using audio fingerprint	running
MA thesis	Automatic Assessment of Parkinson’s Disease Using Audio and Text Analyses	running
MA thesis	Removing age bias in the context of pathological speech	running
MA thesis	Influence of Demographic Parameters in Radar-Based Blood Pressure Estimation	finished
Project	Deep Learning-Based Classification of Skin Diseases: A Comparative Analysis of CNN and Transformer Architectures	finished
Project	Influence of Age in Neural Embeddings to Analyze Voice Disorders of Parkinson’s Disease Patients	finished
MA thesis	Generative Modeling for Glottal Signals Synthesis	running
MA thesis	Enhancing Lithium-Ion Battery Safety	finished
MA thesis	Generation of Region-guided Clinical Text Reports from Chest X-Ray Images Using LLMs	finished
MA thesis	Stammering Identification using Large Language Models	finished
MA thesis	Annotation by Speech in Radiology	running
MA thesis	Investigating Liquidity Forecasting with Point-Based and Probabilistic Models to Enhance Financial Business Operations	finished
MA thesis	Enhancing SBOM Creation with Large Language Models	finished
MA thesis	Signal-Specific Fault Detection in Controller Area Network using Deep Learning	finished
MA thesis	Knowledge Distillation of Large Language Models for Automotive HMI Applications	finished
BA thesis	Automatic Speech Recognition at Phoneme and Word-Level To Analyze Parkinson’s Disease	finished
MA thesis	Speech-Based Classification of Parkinson’s Disease Under Acoustic Variability	finished
MA thesis	Large Language Models for Knowledge Management in Engineering Projects	running
MA thesis	Identification of failure detection patterns in log files of Computer Tomography systems	finished
Project	TSI Challenge Summer 2024: Heat & Water Demand Forecasting	finished
MA thesis	Text Generation in Alzheimer’s Disease	finished
MA thesis	Improving Text Summarization through Guided Decoding of Language Models	finished
MA thesis	Spoken Language Identification for Hearing Aids	finished
MA thesis	Understanding Odor Descriptors through Advanced NLP Models and Semantic Scores	finished
Project	Generation of Clinical Text Reports from Chest X-Ray Images	finished
MA thesis	Cross-Dataset Phonological Speech Analysis of Children with Cleft Lip and Palate	finished
Project	Automatic recognition of bavarian dialects	finished
MA thesis	Large Language Model for Generation of Structured Medical Report from X-ray Transcriptions	finished
MA thesis	Natural Language Text Generation for Symbolic Descriptions Using Language Models	finished
MA thesis	Development of a deep learning approach to detect faulty axial bearing components after assembly using acoustic signals	finished
MA thesis	Edge-AI: Self-sensing backpressure estimation in piezoelectric micropumps using machine learning methods on a limited hardware	finished
BA thesis	CoachLea: An Android Application to evaluate the progress of speaking and hearing abilities of children with Cochlear Implant	finished
BA thesis	CITA: An Android-based Application to Evaluate the Speech of Cochlear Implant Users	finished