Dr.-Ing. Tomás Arias Vergara

Lehrstuhl für Informatik 5 (Mustererkennung)

Research associates

Address

Martensstraße 3
91058 Erlangen
10.134 10

Contact

tomas.arias@fau.de
+49 9131 85-27872
lme.tf.fau.de/person/arias/

https://tariasvergara.github.io/CV_Tomas_AriasVergara.pdf

2025

Israel ISF-DFG: The effect of social interactions on (dis)honest communication

(Third Party Funds Single)

Project leader: Tomás Arias Vergara
Term: since September 1, 2025
Funding source: DFG-Einzelförderung / Sachbeihilfe (EIN-SBH)

Abstract

Animals make decisions based on cues that they receive from conspecifics. Acoustic cues, encoded in vocal frequencies, tempo, and syntax, convey honest information on, for example, the age and social status of the singer. Yet numerous studies in birds, anurans, and humans show that in social interactions (e.g., duets and counter-singing), acoustic features shift, compared to solo singing. This raises intriguing questions: How do social interactions affect honesty? Do they prompt a dishonest display of traits or a correction to display honest ones? Here, we propose to elucidate whether and how these social-context-affected changes reflect the honest depiction of individual traits, a topic that remains underexplored. For over 25 years, we have been studying the complex songs of male wild rock hyraxes (Procavia capensis), focusing on solo songs (performed spontaneously) and counter-singing (e.g. induced by other male songs). Male hyrax songs include a challenging sound called the "snort," a harsh sound developed with age, which varies with weight, social status, and hormone levels. While prior research shows that snorts differ between solo and counter-singing, their honesty in reflecting the singer’s traits wasn't assessed. This study aims to evaluate whether counter-singing represents the responder's traits than solo singing and to understand how social factors like weight and status affect these acoustic differences. To this end, we will adapt/implement novel methods originally designed for monitoring orcas (Orcinus orca), including denoising, segmentation, encoding, and generating artificial calls. Hyrax songs will be analyzed using deep learning methods to cluster the songs; thus, enabling automatic identification of distinct singers’ vocalizations (including ones we fail to observe in the field). Furthermore, new songs will be synthesized using generative AI to examine counter-singing behavior in the field. The effect of social context on the honest transmission of individual traits will be measured using acoustic analysis by contrasting the responses with the singer's solo songs. We hypothesize that vocal behavior will change according to both the initiating singer’s (or playback’s) and responder’s individual characteristics. This study is expected to elucidate whether social constraints generate honesty and expand our understanding of the contribution of sociality to the evolution of vocal communication in nature.

→More information

2024

A multimodal approach for automatic generation of radiology reports using chest X-ray images, clinical free-text, and spoken commands.

(FAU Funds)

Project leader: Tomás Arias Vergara
Term: January 15, 2024 - January 14, 2025

Abstract

Advancements in Artificial Intelligence (AI) methods have enabled thedevelopment of Large Language Models (LLMs) capable of generating informationfrom user instructions and supporting various tasks in education, research,healthcare, and others. AI has also impacted the field of medical imaging withseveral deep learning models capable of achieving expert-level performanceacross different tasks, e.g., detection, segmentation, and assisted clinicaldiagnosis. In addition, open-source Automatic Speech Recognition (ASR) systemscan be incorporated as modules in AI-based systems. This proposed fundedproject aims to combine LLMs, medical imaging, and speech recognition using AImethods to generate high-quality radiology reports from chest X-ray images.

→More information
Coordinated grid protection based on machine learning methods

(Third Party Funds Single)

Project leader: Andreas Maier, Johann Jäger, Siming Bayer
Term: July 1, 2024 - June 30, 2027
Acronym: Netzschutz-KI
Funding source: DFG-Einzelförderung / Sachbeihilfe (EIN-SBH)

→More information

2017

Training Network on Automatic Processing of PAthological Speech

(Third Party Funds Group – Overall project)

Term: November 1, 2017 - October 31, 2021
Acronym: TAPAS
Funding source: Innovative Training Networks (ITN)
URL: https://www.tapas-etn-eu.org/

Abstract

There are an increasing number of people across Europe with debilitating speech pathologies (e.g., due to stroke, Parkinson's, etc). These groups face communication problems that can lead to social exclusion. They are now being further marginalised by a new wave of speech technology that is increasingly woven into everyday life but which is not robust to atypical speech. TAPAS is a Horizon 2020 Marie Skłodowska-Curie Actions Innovative Training Network European Training Network (MSCA-ITN-ETN) project that aims to transform the well being of these people.
The TAPAS work programme targets three key research problems:
(a) Detection: We will develop speech processing techniques for early detection of conditions that impact on speech production. The outcomes will be cheap and non-invasive diagnostic tools that provide early warning of the onset of progressive conditions such as Alzheimer's and Parkinson's.
(b) Therapy: We will use newly-emerging speech processing techniques to produce automated speech therapy tools. These tools will make therapy more accessible and more individually targeted. Better therapy can increase the chances of recovering intelligible speech after traumatic events such a stroke or oral surgery.
(c) Assisted Living: We will re-design current speech technology so that it works well for people with speech impairments and also helps in making informed clinical choices. People with speech impairments often have other co-occurring conditions making them reliant on carers. Speech-driven tools for assisted-living are a way to allow such people to live more independently.
TAPAS adopts an inter-disciplinary and multi-sectorial approach. The consortium includes clinical practitioners, academic researchers and industrial partners, with expertise spanning speech engineering, linguistics and clinical science. All members have expertise in some element of pathological speech. This rich network will train a new generation of 15 researchers, equipping them with the skills and resources necessary for lasting success.

→More information

2026

Conference Contributions

Hernandez A., Arias Vergara T., Maier A., Perez Toro PA.:
Confidence-guided error correction for disordered speech recognition
In: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain: 2026
DOI: 10.1109/ICASSP55912.2026.11463078
BibTeX: Download
Perez Toro PA., Arias Vergara T., Bueß L., Hutter J., Woo J., Maier A.:
Cross‑modal diffusion for region‑aligned vocal tract MRI synthesis
DOI: 10.1117/12.3087986
URL: https://www.spiedigitallibrary.org/conference-proceedings-of-spie/13930/3087986/Crossmodal-diffusion-for-regionaligned-vocal-tract-MRI-synthesis/10.1117/12.3087986.short
BibTeX: Download

Unpublished Publications

Islam Bhuiyan MR., Bhat S., Qahqaie M., Nguyen TT., Perez-Toro PA., Arias-Vergara T., Maier A.:
LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation (Conference contribution, submitted)
DOI: https://arxiv.org/abs/2603.17576
URL: https://arxiv.org/abs/2603.17576
BibTeX: Download

2025

Journal Articles

Hung H., Piske T., Perez Toro PA., Arias Vergara T., Maier A.:
kidsNARRATE: a versatile corpus for studying Chinese-english bilingual L2 narrative skills in preschoolers
In: Language Resources and Evaluation 59 (2025), p. 3117-3138
ISSN: 1574-020X
DOI: 10.1007/s10579-025-09851-2
BibTeX: Download
Schraut T., Schützenberger A., Arias Vergara T., Kunduk M., Echternach M., Dürr S., Werz J., Döllinger M.:
Machine learning based assessment of hoarseness severity: a multi-sensor approach centered on high-speed videoendoscopy
In: Frontiers in Artificial Intelligence 8 (2025), Article No.: 1601716
ISSN: 2624-8212
DOI: 10.3389/frai.2025.1601716
BibTeX: Download
Weise T., Demir K., Perez Toro PA., Arias Vergara T., Maier A., Nöth E., Schuster M., Heismann B., Yang SH.:
Towards End-to-End Speech Articulation and Spoken Language Analysis Using Deep Learning
In: Human-Centric Intelligent Systems 5 (2025), p. 103-122
ISSN: 2667-1336
DOI: 10.1007/s44230-025-00094-6
BibTeX: Download

Conference Contributions

Ramachandran A., Neergaard TFB., Arias Vergara T., Maier A., Bayer S.:
Water Demand Forecasting of District Metered Areas through Learned Consumer Representations
The 33rd European Signal Processing Conference (EUSIPCO 2025) (Palermo, Italy, September 8, 2025 - September 12, 2025)
In: Water Demand Forecasting of District Metered Areas through Learned Consumer Representations 2025
URL: https://eusipco2025.org/wp-content/uploads/pdfs/0001842.pdf
BibTeX: Download
Oelhaf J., Kordowich G., Perez Toro PA., Arias Vergara T., Maier A., Jäger J., Bayer S.:
A Systematic Evaluation of Machine Learning Methods for Fault Detection and Line Identification in Electrical Power Grids
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Hyderabad, April 6, 2025 - April 11, 2025)
In: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New York City: 2025
DOI: 10.1109/ICASSP49660.2025.10890544
BibTeX: Download
Schwarz A., Dickmann J., Hofmann C., Szkitsak J., Bert C., Maier A., Arias Vergara T.:
Cross-Modality Image Quality Prediction for Time-Resolved CT from Breathing Signals
Workshop on Longitudinal Disease Tracking and Modeling with Medical Images and Data, LDTM 2024, 5th International Workshop on Multiscale Multimodal Medical Imaging, MMMI 2024, 1st Workshop on Machine Learning for Multimodal/-sensor Healthcare Data, ML4MHD2024 and Workshop on Multimodal Learning and Fusion Across Scales for Clinical Decision Support, ML-CDS 2024 held in conjunction with the 27th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2024 (Marrakesh, October 6, 2024 - October 10, 2024)
In: Anna Schroder, Xiang Li, Tanveer Syeda-Mahmood, Neil P. Oxtoby, Alexandra Young, Alessa Hering, Tejas S. Mathai, Pritam Mukherjee, Sven Kuckertz, Tiantian He, Isaac Llorente-Saguer, Andreas Maier, Satyananda Kashyap, Hayit Greenspan, Anant Madabhushi (ed.): Lecture Notes in Computer Science 2025
DOI: 10.1007/978-3-031-84525-3_13
BibTeX: Download

2024

Journal Articles

Bi M(., Nguyen DD., Arias Vergara T., Döllinger M., Holik J., Madill C.:
Effects of Instructed Laryngeal Manipulation on Vocal Rise Time
In: Journal of Voice (2024)
ISSN: 0892-1997
DOI: 10.1016/j.jvoice.2024.10.009
BibTeX: Download
Arias Vergara T., Madill C., Nguyen D., Holik J., Döllinger M.:
VOAT: Voice Onset Analysis Tool
In: SoftwareX 27 (2024), Article No.: 101802
ISSN: 2352-7110
DOI: 10.1016/j.softx.2024.101802
BibTeX: Download
Schraut T., Schützenberger A., Arias Vergara T., Kunduk M., Echternach M., Döllinger M.:
Machine learning based estimation of hoarseness severity using sustained vowelsa)
In: Journal of the Acoustical Society of America 155 (2024), p. 381-395
ISSN: 0001-4966
DOI: 10.1121/10.0024341
BibTeX: Download
Lesyk E., Arias Vergara T., Nöth E., Maier A., Orozco-Arroyave JR., Perez Toro PA.:
Empathetic Deep Learning: Transferring Adult Speech Emotion Models to Children With Gender-Specific Adaptations Using Neural Embeddings
In: Human-Centric Intelligent Systems 4 (2024), p. 633-642
ISSN: 2667-1336
DOI: 10.1007/s44230-024-00088-w
BibTeX: Download
Chacon AM., Nguyen DD., Holik J., Döllinger M., Arias Vergara T., Arias-Vergara T., Madill CJ.:
Vowel onset measures and their reliability, sensitivity and specificity: A systematic literature review
In: PLoS ONE 19 (2024), p. e0301786-
ISSN: 1932-6203
DOI: 10.1371/journal.pone.0301786
BibTeX: Download

Conference Contributions

Kulyabin M., Sokolov G., Galaida A., Maier A., Arias Vergara T.:
SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology
27th International Conference on Pattern Recognition (Kolkata, India, December 1, 2024 - December 5, 2024)
In: Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal (ed.): Proceedings of the 27th International Conference on Pattern Recognition 2024 2024
DOI: 10.1007/978-3-031-78119-3_11
URL: https://arxiv.org/abs/2405.16115
BibTeX: Download
Vysotskaya N., Maul N., Fusco A., Hazra S., Harnisch J., Arias Vergara T., Maier A.:
Transforming Cardiovascular Health: a Transformer-Based Approach to Continuous, Non-Invasive Blood Pressure Estimation via Radar Sensing
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (Seoul, April 14, 2024 - April 19, 2024)
In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New York City: 2024
DOI: 10.1109/ICASSP48485.2024.10446988
BibTeX: Download
Arias Vergara T., Perez Toro PA., Liu X., Xing F., Stone M., Zhuo J., Prince JL., Schuster M., Nöth E., Woo J., Maier A.:
Contrastive Learning Approach for Assessment of Phonological Precision in Patients with Tongue Cancer Using MRI Data
25th Interspeech Conferece 2024 (Kos Island, September 1, 2024 - September 5, 2024)
In: Interspeech 2024 2024
DOI: 10.21437/Interspeech.2024-2236
BibTeX: Download
Perez Toro PA., Arias Vergara T., Klumpp P., Weise T., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Multilingual Speech and Language Analysis for the Assessment of Mild Cognitive Impairment: Outcomes from the Taukadial Challenge
25th Interspeech Conferece 2024 (Kos Island, September 1, 2024 - September 5, 2024)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2024
DOI: 10.21437/Interspeech.2024-2115
BibTeX: Download
Liu X., Xing F., Bian Z., Arias Vergara T., Perez Toro PA., Maier A., Stone M., Zhuo J., Prince JL., Woo J.:
Tagged-to-Cine MRI Sequence Synthesis via Light Spatial-Temporal Transformer
27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024 (Marrakesh, October 6, 2024 - October 10, 2024)
In: Medical Image Computing and Computer Assisted Intervention – MICCAI 2024, Cham: 2024
DOI: 10.1007/978-3-031-72104-5_67
BibTeX: Download

2023

Authored Books

Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster ME., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
Springer Science and Business Media Deutschland GmbH, 2023
ISBN: 9783031404979
DOI: 10.1007/978-3-031-40498-6_13
BibTeX: Download

Journal Articles

Arias Vergara T., Döllinger M., Schraut T., Mohd Khairuddin KA., Schützenberger A.:
Nyquist Plot Parametrization for Quantitative Analysis of Vibration of the Vocal Folds
In: Journal of Voice (2023)
ISSN: 0892-1997
DOI: 10.1016/j.jvoice.2023.01.014
BibTeX: Download

Conference Contributions

Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster M., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
TSD 2023: Text, Speech, and Dialogue (Pilsen, September 4, 2023 - September 6, 2023)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Text, Speech, and Dialogue, Cham: 2023
DOI: 10.1007/978-3-031-40498-6_13
BibTeX: Download
Escobar-Grisales D., Arias-Vergara T., Rios-Urrego CD., Nöth E., García AM., Orozco-Arroyave JR.:
An Automatic Multimodal Approach to Analyze Linguistic and Acoustic Cues on Parkinson's Disease Patients
24th International Speech Communication Association, Interspeech 2023 (Dublin, IRL, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2287
BibTeX: Download
Perez Toro PA., Arias Vergara T., Braun F., Hönig F., Tobón-Quintero CA., Aguillón D., Lopera F., Hincapié-Henao L., Schuster M., Riedhammer K., Maier A., Nöth E., Orozco Arroyave JR.:
Automatic Assessment of Alzheimer's across Three Languages Using Speech and Language Features
24th International Speech Communication Association, Interspeech 2023 (Dublin, IRL, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2079
BibTeX: Download
Hung H., Perez Toro PA., Arias Vergara T., Maier A., Nöth E.:
Speaking Clearly, Understanding Better: Predicting the L2 Narrative Comprehension of Chinese Bilingual Kindergarten Children Based on Speech Intelligibility Using a Machine Learning Approach
Interspeech 2023 (Dublin, August 20, 2023 - August 24, 2023)
In: Interspeech 2023 2023
DOI: 10.21437/Interspeech.2023-2057
BibTeX: Download
Perez Toro PA., Rodriguez Salas D., Arias Vergara T., Bayerl SP., Klumpp P., Riedhammer KT., Schuster M., Nöth E., Maier A., Orozco Arroyave JR.:
Transferring Quantified Emotion Knowledge for the Detection of Depression in Alzheimer’s Disease Using Forestnets
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Rhodes Island, June 4, 2023 - June 10, 2023)
In: ICASSP 2023 2023
DOI: 10.1109/ICASSP49357.2023.10095219
BibTeX: Download
Arias Vergara T., Londoño-Mora E., Perez Toro PA., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Measuring Phonological Precision in Children with Cleft Lip and Palate
24th International Speech Communication Association, Interspeech 2023 (Dublin, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2099
BibTeX: Download

2022

Authored Books

Perez Toro PA., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR., Arias Vergara T.:
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
Springer Science and Business Media Deutschland GmbH, 2022
ISBN: 9783031162695
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
Arias Vergara T.:
Analysis of Pathological Speech Signals
Erlangen, Bayern, Germany: Logos Verlag Berlin GmbH, 2022
(Studien zur Mustererkennung, Vol.50)
ISBN: 978-3-8325-5561-0
URL: https://logos-verlag.eu/cgi-bin/engbuchmid?isbn=5561&lng=eng&id=
BibTeX: Download

Journal Articles

Arias-Vergara T., Batliner A., Rader T., Polterauer D., Högerle C., Müller J., Orozco-Arroyave JR., Nöth E., Schuster M.:
Adult Cochlear Implant Users Versus Typical Hearing Persons: An Automatic Analysis of Acoustic–Prosodic Parameters
In: Journal of Speech Language and Hearing Research 65 (2022), p. 4623-4636
ISSN: 1092-4388
DOI: 10.1044/2022_JSLHR-21-00116
BibTeX: Download
Perez Toro PA., Rodriguez Salas D., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco Arroyave JR., Maier A.:
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download
Schraut T., Schützenberger A., Arias Vergara T., Kunduk M., Echternach M., Döllinger M.:
Machine learning based estimation of hoarseness severity from sustained vowels
In: Journal of the Acoustical Society of America 152 (2022), p. A141-A141
ISSN: 0001-4966
DOI: 10.1121/10.0015825
BibTeX: Download
Arias Vergara T., Schraut T., Orozco-Arroyave JR., Döllinger M.:
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
Arias Vergara T., Schraut T., Orozco-Arroyave JR., Döllinger M.:
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
Perez Toro PA., Arias Vergara T., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco Arroyave JR.:
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
Perez Toro PA., Arias Vergara T., Klumpp P., Vásquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR.:
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
Perez Toro PA., Rodriguez Salas D., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR., Maier A.:
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download

Conference Contributions

Dürr S., Schützenberger A., Kist A., Semmler M., Schraut T., Arias Vergara T., Döllinger M.:
High-speed video endoscopy to improve the diagnosis of voice disorders
DOI: 10.1055/s-0042-1746963
BibTeX: Download
Pérez-Toro PA., Klumpp P., Vasquez-Correa JC., Schuster M., Nöth E., Orozco-Arroyave JR., Arias Vergara T.:
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
25th International Conference on Text, Speech, and Dialogue, TSD 2022 (Brno, CZE, September 6, 2022 - September 9, 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
Perez Toro PA., Klumpp P., Hernandez A., Arias Vergara T., Lillo P., Slachevsky A., García AM., Schuster M., Maier A., Nöth E., Orozco Arroyave JR.:
Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10883
BibTeX: Download
Schäfer P., Perez Toro PA., Klumpp P., Orozco-Arroyave JR., Nöth E., Maier A., Abad A., Schuster M., Arias Vergara T.:
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
URL: https://www.scopus.com/record/display.uri?eid=2-s2.0-85140086069∨igin=inward
BibTeX: Download
Schäfer P., Perez Toro PA., Klumpp P., Orozco Arroyave JR., Nöth E., Maier A., Abad A., Schuster M., Arias Vergara T.:
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, KOR, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
BibTeX: Download
tom Dieck T., Perez Toro PA., Arias Vergara T., Nöth E., Klumpp P.:
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download
tom Dieck T., Perez Toro PA., Arias Vergara T., Nöth E., Klumpp P.:
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download

2021

Journal Articles

Garcia AM., Arias-Vergara T., C. Vasquez-Correa J., Nöth E., Schuster M., Welch AE., Bocanegra Y., Baena A., Orozco-Arroyave JR.:
Cognitive Determinants of Dysarthria in Parkinson's Disease: An Automated Machine Learning Approach
In: Movement Disorders (2021)
ISSN: 0885-3185
DOI: 10.1002/mds.28751
BibTeX: Download
Klumpp P., Arias Vergara T., Vásquez-Correa JC., Perez Toro PA., Orozco-Arroyave JR., Batliner A., Nöth E.:
The Phonetic Footprint of Parkinson's Disease
In: Computer Speech and Language 72 (2021)
ISSN: 0885-2308
DOI: 10.1016/j.csl.2021.101321
URL: https://www.sciencedirect.com/science/article/abs/pii/S0885230821001169
BibTeX: Download
Vásquez-Correa JC., Rios-Urrego CD., Arias Vergara T., Schuster M., Rusz J., Nöth E., Orozco-Arroyave JR.:
Transfer learning helps to improve the accuracy to classify patients with different speech disorders in different languages
In: Pattern Recognition Letters (2021)
ISSN: 0167-8655
DOI: 10.1016/j.patrec.2021.04.011
BibTeX: Download

Conference Contributions

Perez Toro PA., Vasquez Correa J., Arias Vergara T., Klumpp P., Sierra-Castrillón M., Roldán-López ME., Aguillón D., Hincapié-Henao L., Tóbon-Quintero CA., Bocklet T., Schuster M., Orozco-Arroyave JR., Nöth E.:
Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer's Disease
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414009
URL: https://ieeexplore.ieee.org/abstract/document/9414009
BibTeX: Download
Perez Toro PA., Vásquez-Correa JC., Arias Vergara T., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR.:
Emotional State Modeling for the Assessment of Depression in Parkinson’s Disease
24th International Conference on Text, Speech, and Dialogue, TSD 2021 (Olomouc, CZE, September 6, 2021 - September 9, 2021)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2021
DOI: 10.1007/978-3-030-83527-9_39
BibTeX: Download
Vasquez Correa J., Arias Vergara T., Klumpp P., Perez Toro PA., Orozco-Arroyave JR., Nöth E.:
End-2-End Modeling of Speech and Gait from Patients with Parkinson’s Disease: Comparison Between High Quality Vs. Smartphone Data
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414729
URL: https://ieeexplore.ieee.org/abstract/document/9414729
BibTeX: Download
Perez Toro PA., Bayerl S., Arias Vergara T., Vasquez Correa J., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave JR., Riedhammer K.:
Influence of the Interviewer on the Automatic Assessment of Alzheimer’s Disease in the Context of the ADReSSo Challenge
22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1589
BibTeX: Download
Parra-Gallego LF., Arias-Vergara T., Orozco-Arroyave JR.:
Robust Automatic Speech Recognition for Call Center Applications
8th Workshop on Engineering Applications, WEA 2021 (Virtual, Online, October 6, 2021 - October 8, 2021)
In: Juan Carlos Figueroa-García, Yesid Díaz-Gutierrez, Elvis Eduardo Gaona-García, Alvaro David Orjuela-Cañón (ed.): Communications in Computer and Information Science 2021
DOI: 10.1007/978-3-030-86702-7_7
BibTeX: Download
Klumpp P., Bocklet T., Arias Vergara T., Vasquez Correa J., Perez Toro PA., Bayerl S., Orozco-Arroyave JR., Nöth E.:
The Phonetic Footprint of Covid-19
Interspeech 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1488
BibTeX: Download

2020

Journal Articles

Arias Vergara T., Arguello-Velez P., Vasquez Correa J., Nöth E., Schuster M., González-Rátiva MC., Orozco Arroyave JR.:
Automatic detection of Voice Onset Time in voiceless plosives using gated recurrent units
In: Digital Signal Processing 104 (2020), Article No.: 102779
ISSN: 1051-2004
DOI: 10.1016/j.dsp.2020.102779
BibTeX: Download
Vasquez Correa J., Arias Vergara T., Schuster M., Orozco Arroyave JR., Nöth E.:
Parallel Representation Learning for the Classification of Pathological Speech: Studies on Parkinson's Disease and Cleft Lip and Palate
In: Speech Communication 122 (2020), p. 56-67
ISSN: 0167-6393
DOI: 10.1016/j.specom.2020.07.005
BibTeX: Download
Orozco-Arroyave JR., Vasquez Correa J., Klumpp P., Perez Toro PA., Escobar-Grisales D., Roth N., Ríos-Urrego CD., Strauß M., Carvajal-Castaño HA., Bayerl S., Castrillón-Osorio LR., Arias-Vergara T., Küderle A., López-Pabón FO., Parra-Gallego LF., Eskofier B., Gómez-Gómez LF., Schuster M., Nöth E.:
Apkinson: the smartphone application for telemonitoring Parkinson's patients through speech, gait and hands movement
In: Neurodegenerative Disease Management (2020)
ISSN: 1758-2024
DOI: 10.2217/nmt-2019-0037
BibTeX: Download
Perez Toro PA., Vasquez Correa J., Arias Vergara T., Nöth E., Orozco-Arroyave JR.:
Nonlinear dynamics and Poincare sections to model gait impairments in different stages of Parkinson's disease
In: Nonlinear Dynamics 100 (2020), p. 3253-3276
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download
Pérez-Toro PA., Vasquez Correa J., Arias Vergara T., Nöth E., Orozco Arroyave JR.:
Nonlinear dynamics and Poincaré sections to model gait impairments in different stages of Parkinson’s disease
In: Nonlinear Dynamics (2020)
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download

Conference Contributions

Argüello-Vélez P., Arias-Vergara T., González-Rátiva MC., Orozco-Arroyave JR., Nöth E., Schuster ME.:
Acoustic characteristics of vot in plosive consonants produced by parkinson’s patients
23rd International Conference on Text, Speech, and Dialogue, TSD 2020 (Brno, September 8, 2020 - September 11, 2020)
In: Petr Sojka, Ivan Kopecek, Karel Pala, Aleš Horák (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2020
DOI: 10.1007/978-3-030-58323-1_33
BibTeX: Download
Klumpp P., Arias Vergara T., Vásquez-Correa JC., Pérez-Toro PA., Hönig FT., Nöth E., Orozco-Arroyave JR.:
Surgical mask detection with deep recurrent phonetic models
Interspeech 2020
In: Interspeech 2020 2020
BibTeX: Download
Klumpp P., Arias Vergara T., Vasquez Correa J., Perez Toro PA., Hönig FT., Nöth E., Orozco-Arroyave JR.:
Surgical mask detection with deep recurrent phonetic models
21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
DOI: 10.21437/Interspeech.2020-1723
BibTeX: Download

2019

Book Contributions

Vasquez Correa J., Arias Vergara T., Rios-Urrego CD., Schuster M., Rusz J., Orozco Arroyave JR., Nöth E.:
Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson’s Disease from Speech in Three Different Languages
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 697-706 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_66
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Gollwitzer S., Orozco-Arroyave JR., Schuster M., Nöth E.:
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 679-687 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download

Conference Contributions

Arias Vergara T., Orozco Arroyave JR., Vasquez Correa J., Nöth E., Schuster M., Gollwitzer S., Högerle C.:
Speech differences between CI users with pre- and postlingual onset of deafness detected by speech processing methods on voiceless to voice transitions
90. Jahresversammlung der Deutschen Gesellschaft für Hals-Nasen-Ohren-Heilkunde, Kopf- und Hals-Chirurgie (Estrel Congress Center Berlin, May 29, 2019 - June 1, 2019)
In: Laryngo-Rhino-Otol 2019 2019
DOI: 10.1055/s-0039-168632
BibTeX: Download
Arias Vergara T., Gollwitzer S., Orozco Arroyave JR., Schuster M., Nöth E.:
Consonant-to-Vowel/Vowel-to-Consonant Transitions to Analyze the Speech of Cochlear Implant Users.
Text, Speech, and Dialogue 2019 (Ljubljana, September 11, 2019 - September 13, 2019)
In: Kamil Ekštein (ed.): Lecture Notes in Computer Science 2019
DOI: 10.1007/978-3-030-27947-9_25
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Gollwitzer S., Orozco Arroyave JR., Schuster M., Nöth E.:
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
Iberoamerican Congress on Pattern Recognition
In: CIARP 2019: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications 2019
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download
Arias Vergara T., Orozco-Arroyave JR., Cernak M., Gollwitzer S., Schuster M., Nöth E.:
Phone-attribute posteriors to evaluate the speech of cochlear implant users
20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 (Graz, September 15, 2019 - September 19, 2019)
In: Gernot Kubin, Thomas Hain, Bjorn Schuller, Dina El Zarka, Petra Hodl (ed.): Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
DOI: 10.21437/Interspeech.2019-2144
BibTeX: Download

2018

Journal Articles

Vasquez Correa J., Arias Vergara T., Orozco-Arroyave JR., Eskofier B., Klucken J., Nöth E.:
Multimodal assessment of Parkinson's disease: a deep learning approach
In: IEEE Journal of Biomedical and Health Informatics (2018)
ISSN: 2168-2194
DOI: 10.1109/JBHI.2018.2866873
URL: https://ieeexplore.ieee.org/document/8444654
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Orozco-Arroyave JR., Nöth E.:
Speaker models for monitoring Parkinson’s disease progression considering different communication channels and acoustic conditions
In: Speech Communication 101 (2018), p. 11-25
ISSN: 0167-6393
DOI: 10.1016/j.specom.2018.05.007
URL: https://www.sciencedirect.com/science/article/abs/pii/S0167639317304454
BibTeX: Download

Conference Contributions

Vasquez Correa J., Arias Vergara T., Orozco-Arroyave JR., Nöth E.:
A Multitask Learning Approach to Assess the Dysarthria Severity in Patients with Parkinson's Disease
INTERSPEECH
In: Proceedings of INTERSPEECH 2018
DOI: 10.21437/Interspeech.2018-1988
URL: https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1988.html
BibTeX: Download
Perez Toro PA., Camilo Vasquez-Correa J., Arias-Vergara T., Garcia-Ospina N., Orozco-Arroyave JR., Nöth E.:
A Non-linear Dynamics Approach to Classify Gait Signals of Patients with Parkinson's Disease
5th Workshop on Engineering Applications (WEA) (Medellin, COLOMBIA, October 17, 2018 - October 19, 2018)
In: APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2018, PT II, BERLIN: 2018
DOI: 10.1007/978-3-030-00353-1_24
BibTeX: Download
Felipe Parra-Gallego L., Arias-Vergara T., Camilo Vasquez-Correa J., Garcia-Ospina N., Orozco-Arroyave JR., Nöth E.:
Automatic Intelligibility Assessment of Parkinson's Disease with Diadochokinetic Exercises
5th Workshop on Engineering Applications (WEA) (Medellin, October 17, 2018 - October 19, 2018)
In: APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2018, PT II, BERLIN: 2018
DOI: 10.1007/978-3-030-00353-1_20
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Orozco Arroyave JR., Klumpp P., Nöth E.:
Unobtrusive Monitoring of Speech Impairments of Parkinson's Disease Patients Through Mobile Devices
2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
DOI: 10.1109/ICASSP.2018.8462332
BibTeX: Download
Arias Vergara T., Vasquez Correa J., Orozco-Arroyave JR., Klumpp P., Nöth E.:
Unobtrusive Monitoring of Speech Impairments of Parkinson'S Disease Patients Through Mobile Devices
43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP.2018.8462332
URL: https://ieeexplore.ieee.org/abstract/document/8462332
BibTeX: Download

2017

Authored Books

Vasquez Correa J., Castrillon R., Arias Vergara T., Orozco Arroyave JR., Nöth E.:
Speaker model to monitor the neurological state and the dysarthria level of patients with parkinson’s disease
Springer Verlag, 2017
ISBN: 9783319642055
DOI: 10.1007/978-3-319-64206-2_31
BibTeX: Download

Conference Contributions

Klumpp P., Janu T., Arias Vergara T., Vasquez Correa J., Orozco-Arroyave JR., Nöth E.:
Apkinson — A Mobile Monitoring Solution for Parkinson's Disease
Interspeech 2017
In: Interspeech 2017 2017
URL: https://www.researchgate.net/profile/Juan_Vasquez12/publication/319185470_Apkinson_-_A_Mobile_Monitoring_Solution_for_Parkinson's_Disease/links/59b9fecfa6fdcc68723177dc/Apkinson-A-Mobile-Monitoring-Solution-for-Parkinsons-Disease.pdf
BibTeX: Download
Arias Vergara T., Klumpp P., Vasquez Correa J., Orozco Arroyave JR., Nöth E.:
Parkinson’s disease progression assessment from speech using a mobile device-based application
20th International Conference on Text, Speech and Dialogue, TSD 2017
In: TSD 2017: Text, Speech, and Dialogue 2017
DOI: 10.1007/978-3-319-64206-2_42
BibTeX: Download

2016

Conference Contributions

Arias Vergara T., Vasquez Correa J., Orozco-Arroyave JR., Vargas-Bonilla JF., Haderlein T., Nöth E.:
Gender-dependent GMM-UBM for Tracking Parkinson's Disease Progression from Speech
12. ITG Fachtagung Sprachkommunikation (Paderborn)
In: Speech Communication - 12. ITG Fachtagung Sprachkommunikation, Berlin: 2016
URL: https://www5.informatik.uni-erlangen.de/Forschung/Publikationen/2016/Arias-Vergara16-GGF.pdf
BibTeX: Download

2022

Tomás Arias Vergara: Junior researcher status (Ministry of Science, Technology, and Innovation (Colombia)) – 2022
Tomás Arias Vergara: Summa cum laude (Doctoral thesis) (Friedrich-Alexander Universität Erlangen-Nürnberg (Germany) & Universidad de Antioquia (Colombia)) – 2022
Tomás Arias Vergara: GI-Dissertation price nominee (Friedrich-Alexander Universität Erlangen-Nürnberg) – 2022

2018

Tomás Arias Vergara: Early Stage Researcher under Marie Sklodowska-Curie grant (European Union’s Horizon 2020) – 2018

2017

Tomás Arias Vergara: National PhD scholarship program (Colciencias (Colombia)) – 2017
Tomás Arias Vergara: Distinction to Master thesis (Universidad de Antioquia (Colombia)) – 2017

2015

Tomás Arias Vergara: Young researchers and innovators scholarship (Colciencias (Colombia)) – 2015

Current Theses & Projects

Title	Type	Student	Period	Status
Text-based Cross-Lingual Emotion Recogntion using Natural Language Processing Methods	BA thesis	Dila S. Celikkol	Apr 2026 – Oct 2026	running
Adaptive Hybrid Deep Learning modal for Personalized Electric Vehicle Energy Consumption Prediction with Continuous Learning	MA thesis	Neel Thakkar	Mar 2026	running
Large Language Models for Surgical Workflow Monitoring and Summarization	MA thesis	Daoqi Jin		running
Transfer learning Based Forecasting Of Heat Pump Energy Consumption Across Multiple Time Horizons	MA thesis	Dhruvil Kalubhai Kalathiya	Aug 2025	running

Completed Theses & Projects

Title	Type	Student	Period	Status
Automatic Prediction of German Regional Accents	MA thesis	Veronika Stengl		finished
Investigating the Influence of Different Motion Sensors for Detecting Parkinson’s Disease	MA thesis	Mohammad Hamza	Sep 2025	finished
Reduction of die trials via machine learning approaches	MA thesis	Tai Hoang Nguyen	Sep 2025 – Mar 2026	finished
Large Language Models for Modified Frenchay Dysarthria Assessment Reports from Parkinson’s Speech: Model Choice and Prompting Effects	MA thesis	Zixuan Chai	Sep 2025 – Mar 2026	finished
Sequence-Based Deep Learning for Endovascular Device Segmentation in Interventional X-ray Imaging	MA thesis	Sleiman Sharara	Sep 2025 – Mar 2026	finished
Parkinson’s Disease Classification from Smartwatch Inertial Measurement Unit (IMU) Signals Across Structured Motor Tasks	MA thesis	Emin Mammadov	Jul 2025	finished
Analysis of Speech Production Assessment of Cochlear Implant Users	MA thesis	Tejashree Dhawle	Jul 2025	finished
PaiChat: A Visual – Language Assistant for Histopathology	MA thesis	Bhavanikbhai Kanani	Mar 2025 – Dec 2025	finished
Pathological Voice Analysis with Selective State Space Models	MA thesis	Lucca Baumgärtner	Jun 2025 – Dec 2025	finished
Interpretable Vision Transformers with Attention Maps for Phonological Precision Assessment from MRI	Project			finished
Heart sound detection using audio fingerprint	MA thesis	Shayan Alvandnyia	May 2025 – Dec 2025	finished
Automatic Assessment of Parkinson’s Disease Using Audio and Text Analyses	MA thesis	Zhipeng Peng	Mar 2025 – Sep 2025	finished
Removing age bias in the context of pathological speech	MA thesis	Yuhan Gao	Mar 2025 – Sep 2025	finished
Influence of Demographic Parameters in Radar-Based Blood Pressure Estimation	MA thesis	Felix Tobias Büppelmann	Dec 2024 – May 2025	finished
Deep Learning-Based Classification of Skin Diseases: A Comparative Analysis of CNN and Transformer Architectures	Project	Sleiman Sharara		finished
Influence of Age in Neural Embeddings to Analyze Voice Disorders of Parkinson’s Disease Patients	Project	Zixuan Chai		finished
Generative Modeling for Glottal Signals Synthesis	MA thesis	Su Wu	Jan 2025 – Jul 2025	finished
Enhancing Lithium-Ion Battery Safety	MA thesis	Youssef Bouraha	Dec 2024 – Jun 2025	finished
Generation of Region-guided Clinical Text Reports from Chest X-Ray Images Using LLMs	MA thesis	Mohammad Hasan	Dec 2024 – Jun 2025	finished
Stammering Identification using Large Language Models	MA thesis	Aagam Sunilbhai Shah	Nov 2024 – Apr 2025	finished
Annotation by Speech in Radiology	MA thesis	Jan Geier		finished
Investigating Liquidity Forecasting with Point-Based and Probabilistic Models to Enhance Financial Business Operations	MA thesis	Ram Saran Kakumanu	Oct 2024 – Mar 2025	finished
Enhancing SBOM Creation with Large Language Models	MA thesis	Gaurav Bhalala	Nov 2024 – May 2025	finished
Signal-Specific Fault Detection in Controller Area Network using Deep Learning	MA thesis	Vamsi Krishna Chalampalem	Nov 2024 – May 2025	finished
Knowledge Distillation of Large Language Models for Automotive HMI Applications	MA thesis	Aravind Ryali	Nov 2024 – May 2025	finished
Automatic Speech Recognition at Phoneme and Word-Level To Analyze Parkinson’s Disease	BA thesis	Malena Grimm Piquer	Nov 2024 – Apr 2025	finished
Speech-Based Classification of Parkinson’s Disease Under Acoustic Variability	MA thesis	Anisha Bhandare	Aug 2024 – Feb 2025	finished
Large Language Models for Knowledge Management in Engineering Projects	MA thesis	Xinyuan Tu	Oct 2024 – Apr 2025	finished
Identification of failure detection patterns in log files of Computer Tomography systems	MA thesis	Aishwarya Tandel	Oct 2024 – Apr 2025	finished
TSI Challenge Summer 2024: Heat & Water Demand Forecasting	Project		Apr 2024 – Aug 2024	finished
Text Generation in Alzheimer’s Disease	MA thesis	Mahmoud Alimizel	Jul 2024 – Jan 2025	finished
Improving Text Summarization through Guided Decoding of Language Models	MA thesis	Jannick Gluch	Jul 2024 – Jan 2025	finished
Spoken Language Identification for Hearing Aids	MA thesis	Mahmoud G. A. Sanad	Feb 2024 – Aug 2024	finished
Understanding Odor Descriptors through Advanced NLP Models and Semantic Scores	MA thesis	Fatma Mami	Feb 2024 – Aug 2024	finished
Generation of Clinical Text Reports from Chest X-Ray Images	Project	Md Hasan	Feb 2024 – Aug 2024	finished
Cross-Dataset Phonological Speech Analysis of Children with Cleft Lip and Palate	MA thesis	Marta López-Brea García	Dec 2023 – Jun 2024	finished
Automatic recognition of bavarian dialects	Project	Veronika Stengl	Nov 2023 – Apr 2024	finished
Large Language Model for Generation of Structured Medical Report from X-ray Transcriptions	MA thesis	Uttam Asodariya	Sep 2023 – Mar 2024	finished
Natural Language Text Generation for Symbolic Descriptions Using Language Models	MA thesis	Deepak Parappagoudar	Aug 2023 – Dec 2023	finished
Development of a deep learning approach to detect faulty axial bearing components after assembly using acoustic signals	MA thesis	Gracia Apfelthaler	Aug 2023 – Feb 2024	finished
Edge-AI: Self-sensing backpressure estimation in piezoelectric micropumps using machine learning methods on a limited hardware	MA thesis	Mohammadhossien Sheikhsarraf	May 2023 – Nov 2023	finished
CoachLea: An Android Application to evaluate the progress of speaking and hearing abilities of children with Cochlear Implant	BA thesis	Paula Schäfer	Jun 2021 – Nov 2021	finished
CITA: An Android-based Application to Evaluate the Speech of Cochlear Implant Users	BA thesis	Christoph Popp	Jul 2020 – Dec 2020	finished

Dr.-Ing. Tomás Arias Vergara

Address

Contact

Academic CV

Projects

2025

Israel ISF-DFG: The effect of social interactions on (dis)honest communication

2024

A multimodal approach for automatic generation of radiology reports using chest X-ray images, clinical free-text, and spoken commands.

Coordinated grid protection based on machine learning methods

2017

Training Network on Automatic Processing of PAthological Speech

Publications

2026

Conference Contributions

Unpublished Publications

2025

Journal Articles

Conference Contributions

2024

Journal Articles

Conference Contributions

2023

Authored Books

Journal Articles

Conference Contributions

2022

Authored Books

Journal Articles

Conference Contributions

2021

Journal Articles

Conference Contributions

2020

Journal Articles

Conference Contributions

2019

Book Contributions

Conference Contributions

2018

Journal Articles

Conference Contributions

2017

Authored Books

Conference Contributions

2016

Conference Contributions

Awards

2022

2018

2017

2015

Theses & Projects

Current Theses & Projects

Completed Theses & Projects