Literature documents the impact of Parkinson’s Disease (PD) on speech but no study has analyzed in detail the importance of the distinct phonemic groups for the automatic identification of the disease. This study presents new approaches that are evaluated in three different corpora containing speakers suffering from PD with two main objectives: to investigate the influence of the different phonemic groups in the detection of PD and to propose more accurate detection schemes employing speech. The proposed methodology uses GMM-UBM classifiers combined with a technique introduced in this paper called phonemic grouping, that permits observation of the differences in accuracy depending on the manner of articulation. Cross-validation results reach accuracies between 85% and 94% with AUC ranging from 0.91 to 0.98, while cross-corpora trials yield accuracies between 75% and 82% with AUC between 0.84 and 0.95, depending on the corpus. This is the first work analyzing the generalization properties of the proposed approaches employing cross-corpora trials and reaching high accuracies. Among the different phonemic groups, results suggest that plosives, vowels and fricatives are the most relevant acoustic segments for the detection of PD with the proposed schemes. In addition, the use of text-dependent utterances leads to more consistent and accurate models.
Acknowledgements
Authors want to thank to Jesus F. Vargas-Bonilla, Julian D. Arias-Londoño and Rafael Orozco-Arroyave for sharing the GITA corpus. Also thanks to Jan Rusz for sharing the CzechPD corpus. This work is part of the doctoral thesis of the first author. The work was supported by a grant of the Government of Spain (DPI2017-83405-R), a MISTI Global Seed Funds Award, and by Universidad Politecnica de Madrid.