Dr.-Ing. Tomás Arias Vergara
Dr.-Ing. Tomás Arias Vergara, M. Sc.
I received a B.S. in Electronics Engineering from the University of Antioquia (UdeA, Colombia) in 2014, a Master of Science degree at the same institution in 2017, and a Ph.D. in a joint program between the UdeA and the FAU in 2022. Since 2015, my research has focused on speech processing and machine learning methods for the analysis of pathological speech signals resulting from neurological (e.g., Parkinson’s disease), structural (e.g., children with cleft lip and palate), and perceptual (e.g., hearing loss) disorders. I have also investigated the effect of the natural aging process on speech, participated in developing Android-based applications for collecting and analyzing data from Parkinson’s disease patients and adults/children with hearing loss, and performed research on automatic methods for the analysis of high-speed videoendoscopy data of people with voice disorders.
Projects
2017
-
Training Network on Automatic Processing of PAthological Speech
(Third Party Funds Group – Overall project)
Term: November 1, 2017 - October 31, 2021
Funding source: Innovative Training Networks (ITN)
URL: https://www.tapas-etn-eu.org/There are an increasing number of people across Europe with debilitating speech pathologies (e.g., due to stroke, Parkinson's, etc). These groups face communication problems that can lead to social exclusion. They are now being further marginalised by a new wave of speech technology that is increasingly woven into everyday life but which is not robust to atypical speech. TAPAS is a Horizon 2020 Marie Skłodowska-Curie Actions Innovative Training Network European Training Network (MSCA-ITN-ETN) project that aims to transform the well being of these people.
The TAPAS work programme targets three key research problems:
(a) Detection: We will develop speech processing techniques for early detection of conditions that impact on speech production. The outcomes will be cheap and non-invasive diagnostic tools that provide early warning of the onset of progressive conditions such as Alzheimer's and Parkinson's.
(b) Therapy: We will use newly-emerging speech processing techniques to produce automated speech therapy tools. These tools will make therapy more accessible and more individually targeted. Better therapy can increase the chances of recovering intelligible speech after traumatic events such a stroke or oral surgery.
(c) Assisted Living: We will re-design current speech technology so that it works well for people with speech impairments and also helps in making informed clinical choices. People with speech impairments often have other co-occurring conditions making them reliant on carers. Speech-driven tools for assisted-living are a way to allow such people to live more independently.
TAPAS adopts an inter-disciplinary and multi-sectorial approach. The consortium includes clinical practitioners, academic researchers and industrial partners, with expertise spanning speech engineering, linguistics and clinical science. All members have expertise in some element of pathological speech. This rich network will train a new generation of 15 researchers, equipping them with the skills and resources necessary for lasting success.
Publications
2023
Journal Articles
Nyquist Plot Parametrization for Quantitative Analysis of Vibration of the Vocal Folds
In: Journal of Voice (2023)
ISSN: 0892-1997
DOI: 10.1016/j.jvoice.2023.01.014
BibTeX: Download
, , , , :
Conference Contributions
Transferring Quantified Emotion Knowledge for the Detection of Depression in Alzheimer’s Disease Using Forestnets
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Rhodes Island, June 4, 2023 - June 10, 2023)
In: ICASSP 2023 2023
DOI: 10.1109/ICASSP49357.2023.10095219
BibTeX: Download
, , , , , , , , , :
Measuring Phonological Precision in Children with Cleft Lip and Palate
24th International Speech Communication Association, Interspeech 2023 (Dublin, August 20, 2023 - August 24, 2023)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
DOI: 10.21437/Interspeech.2023-2099
BibTeX: Download
, , , , , , :
2022
Authored Books
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
Springer Science and Business Media Deutschland GmbH, 2022
ISBN: 9783031162695
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
, , , , , , :
Analysis of Pathological Speech Signals
Erlangen, Bayern, Germany: Logos Verlag Berlin GmbH, 2022
(Studien zur Mustererkennung, Vol.50)
ISBN: 978-3-8325-5561-0
URL: https://logos-verlag.eu/cgi-bin/engbuchmid?isbn=5561&lng=eng&id=
BibTeX: Download
:
Journal Articles
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download
, , , , , , , :
Machine learning based estimation of hoarseness severity from sustained vowels
In: Journal of the Acoustical Society of America 152 (2022), p. A141-A141
ISSN: 0001-4966
DOI: 10.1121/10.0015825
BibTeX: Download
, , , , , :
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
, , , :
Parameterization of voice onset for automatic assessment of Parkinson’s disease
In: Journal of the Acoustical Society of America 152 (2022), p. A140-A140
ISSN: 0001-4966
DOI: 10.1121/10.0015820
BibTeX: Download
, , , :
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
, , , , , , :
Interpreting acoustic features for the assessment of Alzheimer's disease using ForestNet
In: Smart Health 26 (2022), Article No.: 100347
ISSN: 2352-6483
DOI: 10.1016/j.smhl.2022.100347
BibTeX: Download
, , , , , , , :
Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing
In: Speech Communication 145 (2022), p. 10-20
ISSN: 0167-6393
DOI: 10.1016/j.specom.2022.09.001
BibTeX: Download
, , , , , , :
Conference Contributions
High-speed video endoscopy to improve the diagnosis of voice disorders
DOI: 10.1055/s-0042-1746963
BibTeX: Download
, , , , , , :
50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders
25th International Conference on Text, Speech, and Dialogue, TSD 2022 (Brno, CZE, September 6, 2022 - September 9, 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
DOI: 10.1007/978-3-031-16270-1_29
BibTeX: Download
, , , , , , :
Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10883
BibTeX: Download
, , , , , , , , , , :
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, KOR, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
BibTeX: Download
, , , , , , , , :
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
URL: https://www.scopus.com/record/display.uri?eid=2-s2.0-85140086069∨igin=inward
BibTeX: Download
, , , , , , , , :
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download
, , , , :
Wav2vec behind the Scenes: How end2end Models learn Phonetics
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 (Incheon, September 18, 2022 - September 22, 2022)
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
DOI: 10.21437/Interspeech.2022-10865
BibTeX: Download
, , , , :
2021
Journal Articles
The Phonetic Footprint of Parkinson's Disease
In: Computer Speech and Language 72 (2021)
ISSN: 0885-2308
DOI: 10.1016/j.csl.2021.101321
URL: https://www.sciencedirect.com/science/article/abs/pii/S0885230821001169
BibTeX: Download
, , , , , , :
Transfer learning helps to improve the accuracy to classify patients with different speech disorders in different languages
In: Pattern Recognition Letters (2021)
ISSN: 0167-8655
DOI: 10.1016/j.patrec.2021.04.011
BibTeX: Download
, , , , , , :
Conference Contributions
Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer's Disease
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414009
URL: https://ieeexplore.ieee.org/abstract/document/9414009
BibTeX: Download
, , , , , , , , , , , , :
Emotional State Modeling for the Assessment of Depression in Parkinson’s Disease
24th International Conference on Text, Speech, and Dialogue, TSD 2021 (Olomouc, CZE, September 6, 2021 - September 9, 2021)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2021
DOI: 10.1007/978-3-030-83527-9_39
BibTeX: Download
, , , , , , :
End-2-End Modeling of Speech and Gait from Patients with Parkinson’s Disease: Comparison Between High Quality Vs. Smartphone Data
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP39728.2021.9414729
URL: https://ieeexplore.ieee.org/abstract/document/9414729
BibTeX: Download
, , , , , :
Influence of the Interviewer on the Automatic Assessment of Alzheimer’s Disease in the Context of the ADReSSo Challenge
22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1589
BibTeX: Download
, , , , , , , , :
The Phonetic Footprint of Covid-19
Interspeech 2021 (, August 30, 2021 - September 3, 2021)
In: Proc. Interspeech 2021 2021
DOI: 10.21437/Interspeech.2021-1488
BibTeX: Download
, , , , , , , :
2020
Journal Articles
Automatic detection of Voice Onset Time in voiceless plosives using gated recurrent units
In: Digital Signal Processing 104 (2020), Article No.: 102779
ISSN: 1051-2004
DOI: 10.1016/j.dsp.2020.102779
BibTeX: Download
, , , , , , :
Parallel Representation Learning for the Classification of Pathological Speech: Studies on Parkinson's Disease and Cleft Lip and Palate
In: Speech Communication 122 (2020), p. 56-67
ISSN: 0167-6393
DOI: 10.1016/j.specom.2020.07.005
BibTeX: Download
, , , , :
Nonlinear dynamics and Poincare sections to model gait impairments in different stages of Parkinson's disease
In: Nonlinear Dynamics 100 (2020), p. 3253-3276
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download
, , , , :
Nonlinear dynamics and Poincaré sections to model gait impairments in different stages of Parkinson’s disease
In: Nonlinear Dynamics (2020)
ISSN: 0924-090X
DOI: 10.1007/s11071-020-05691-7
BibTeX: Download
, , , , :
Conference Contributions
Surgical mask detection with deep recurrent phonetic models
Interspeech 2020
In: Interspeech 2020 2020
BibTeX: Download
, , , , , , :
Surgical mask detection with deep recurrent phonetic models
21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
DOI: 10.21437/Interspeech.2020-1723
BibTeX: Download
, , , , , , :
2019
Book Contributions
Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson’s Disease from Speech in Three Different Languages
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 697-706 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_66
BibTeX: Download
, , , , , , :
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
In: Ingela Nyström, Yanio Hernández Heredia, Vladimir Milián Núñez (ed.): Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2019, p. 679-687 (Image Processing, Computer Vision, Pattern Recognition, and Graphics, Vol.11896)
ISBN: 9783030339036
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download
, , , , , :
Conference Contributions
Speech differences between CI users with pre- and postlingual onset of deafness detected by speech processing methods on voiceless to voice transitions
90. Jahresversammlung der Deutschen Gesellschaft für Hals-Nasen-Ohren-Heilkunde, Kopf- und Hals-Chirurgie (Estrel Congress Center Berlin, May 29, 2019 - June 1, 2019)
In: Laryngo-Rhino-Otol 2019 2019
DOI: 10.1055/s-0039-168632
BibTeX: Download
, , , , , , :
Consonant-to-Vowel/Vowel-to-Consonant Transitions to Analyze the Speech of Cochlear Implant Users.
Text, Speech, and Dialogue 2019 (Ljubljana, September 11, 2019 - September 13, 2019)
In: Kamil Ekštein (ed.): Lecture Notes in Computer Science 2019
DOI: 10.1007/978-3-030-27947-9_25
BibTeX: Download
, , , , :
Phone-attribute posteriors to evaluate the speech of cochlear implant users
20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 (Graz, September 15, 2019 - September 19, 2019)
In: Gernot Kubin, Thomas Hain, Bjorn Schuller, Dina El Zarka, Petra Hodl (ed.): Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
DOI: 10.21437/Interspeech.2019-2144
BibTeX: Download
, , , , , :
Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users
Iberoamerican Congress on Pattern Recognition
In: CIARP 2019: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications 2019
DOI: 10.1007/978-3-030-33904-3_64
BibTeX: Download
, , , , , :
2018
Journal Articles
Multimodal assessment of Parkinson's disease: a deep learning approach
In: IEEE Journal of Biomedical and Health Informatics (2018)
ISSN: 2168-2194
DOI: 10.1109/JBHI.2018.2866873
URL: https://ieeexplore.ieee.org/document/8444654
BibTeX: Download
, , , , , :
Speaker models for monitoring Parkinson’s disease progression considering different communication channels and acoustic conditions
In: Speech Communication 101 (2018), p. 11-25
ISSN: 0167-6393
DOI: 10.1016/j.specom.2018.05.007
URL: https://www.sciencedirect.com/science/article/abs/pii/S0167639317304454
BibTeX: Download
, , , :
Conference Contributions
A Multitask Learning Approach to Assess the Dysarthria Severity in Patients with Parkinson's Disease
INTERSPEECH
In: Proceedings of INTERSPEECH 2018
DOI: 10.21437/Interspeech.2018-1988
URL: https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1988.html
BibTeX: Download
, , , :
Unobtrusive Monitoring of Speech Impairments of Parkinson's Disease Patients Through Mobile Devices
2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
DOI: 10.1109/ICASSP.2018.8462332
BibTeX: Download
, , , , :
Unobtrusive Monitoring of Speech Impairments of Parkinson'S Disease Patients Through Mobile Devices
43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP.2018.8462332
URL: https://ieeexplore.ieee.org/abstract/document/8462332
BibTeX: Download
, , , , :
2017
Authored Books
Speaker model to monitor the neurological state and the dysarthria level of patients with parkinson’s disease
Springer Verlag, 2017
ISBN: 9783319642055
DOI: 10.1007/978-3-319-64206-2_31
BibTeX: Download
, , , , :
Conference Contributions
Apkinson — A Mobile Monitoring Solution for Parkinson's Disease
Interspeech 2017
In: Interspeech 2017 2017
URL: https://www.researchgate.net/profile/Juan_Vasquez12/publication/319185470_Apkinson_-_A_Mobile_Monitoring_Solution_for_Parkinson's_Disease/links/59b9fecfa6fdcc68723177dc/Apkinson-A-Mobile-Monitoring-Solution-for-Parkinsons-Disease.pdf
BibTeX: Download
, , , , , :
Parkinson’s disease progression assessment from speech using a mobile device-based application
20th International Conference on Text, Speech and Dialogue, TSD 2017
In: TSD 2017: Text, Speech, and Dialogue 2017
DOI: 10.1007/978-3-319-64206-2_42
BibTeX: Download
, , , , :
2016
Conference Contributions
Gender-dependent GMM-UBM for Tracking Parkinson's Disease Progression from Speech
12. ITG Fachtagung Sprachkommunikation (Paderborn)
In: Speech Communication - 12. ITG Fachtagung Sprachkommunikation, Berlin: 2016
URL: https://www5.informatik.uni-erlangen.de/Forschung/Publikationen/2016/Arias-Vergara16-GGF.pdf
BibTeX: Download
, , , , , :