REAL

A Siamese-based Approach to Improve Parkinson’s Disease Detection and Severity Prediction from Speech Using X-Vector Embedding

Jenei, Attila Zoltán and Ágoston, Réka and Valálik, István (2025) A Siamese-based Approach to Improve Parkinson’s Disease Detection and Severity Prediction from Speech Using X-Vector Embedding. INFOCOMMUNICATIONS JOURNAL, 17 (1). pp. 76-81. ISSN 2061-2079

[img]
Preview
Text
InfocomJournal_2025_1_9_.pdf - Published Version

Download (972kB) | Preview

Abstract

Parkinson’s disease is incurable and is considered one of the most common neurological diseases. It is a progressive disease, which highlights the importance of early detection. Machine learning-based diagnostic support is desirable since the diagnosis is based on history, visual inspection, and drug tests. Speech is presumed to be one of the promising biomarkers that can predict the state of the disease. Combining speech data with deep learning feature extraction in Siamese-based architecture may improve the detection compared with direct regression with acoustic and prosodic features. Read text-based speech samples were acquired from 98 patients with Parkinson’s disease and 107 healthy participants. Feature vectors were extracted with pre- trained x-vector embedding and were used directly with a support vector regressor in a nested cross-validation setup (baseline approach). Furthermore, pairs were allocated, and difference vectors were calculated. These difference vectors were then used to train support vector regressor models in nested crossvalidation (Siamese-based approach). Severity predictions and classification were performed with the outcomes. The Siamesebased setup outperformed the baseline approach both in regression and classification metrics. The relative improvement in root mean square error is 14.4%, and the Pearson correlation is 12.5% at best. After the classification, the relative improvement is 6.0% in sensitivity, 3.0% in specificity, and 4.5% in accuracy. Furthermore, comparing the test sample to not only one but multiple others decreases the average standard deviation of the predicted severity by 16.5% in relative value. Changing only the architecture of the traditional examination setup to a Siamesebased approach may increase the performance of the models.

Item Type: Article
Uncontrolled Keywords: Classification, Deep-learning, Parkinson’s Disease, Siamese Network
Subjects: Q Science / természettudomány > QA Mathematics / matematika > QA76.527 Network technologies / Internetworking / hálózati technológiák, hálózatosodás
R Medicine / orvostudomány > R1 Medicine (General) / orvostudomány általában
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 03 Apr 2025 11:37
Last Modified: 03 Apr 2025 11:37
URI: https://real.mtak.hu/id/eprint/217525

Actions (login required)

Edit Item Edit Item