REAL

Extracting Phonetic Posterior-Based Features for Detecting Multiple Sclerosis From Speech

Gosztolya, Gábor and Svindt, Veronika and Bóna, Judit and Hoffmann, Ildikó (2023) Extracting Phonetic Posterior-Based Features for Detecting Multiple Sclerosis From Speech. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 31. pp. 3234-3244. ISSN 1534-4320

[img]
Preview
Text
2023-tnsre-sm.pdf - Published Version
Available under License Creative Commons Attribution.

Download (3MB) | Preview

Abstract

Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system which, in addition to affecting motor and cognitive functions, may also lead to specific changes in the speech of patients. Speech production, comprehension, repetition and naming tasks, as well as structural and content changes in narratives, might indicate a limitation of executive functions. In this study we present a speech-based machine learning technique to distinguish speakers with relapsing-remitting subtype MS and healthy controls (HC). We exploit the fact that MS might cause a motor speech disorder similar to dysarthria, which, with our hypothesis, might affect the phonetic posterior estimates supplied by a Deep Neural Networkacousticmodel.Fromourexperimentalresults,the proposed posterior posteriorgram-based feature extraction approach is useful for detecting MS: depending on the actual speech task, we obtained Equal Error Rate values as low as 13.3%, and AUC scores up to 0.891, indicating a competitive and more consistent classification performance compared to both the x-vector and the openSMILE ‘ComParE functionals’ attributes. Besides this discrimination performance, the interpretable nature of the phonetic posterior features might also make our method suitable for automatic MS screening or monitoring the progression of the disease. Furthermore, by examining which specific phonetic groups are the most useful for this feature extraction process, the potential utility of the proposed phonetic features could also be utilized in the speech therapy of MS patients.

Item Type: Article
Uncontrolled Keywords: Multiple sclerosis, deep neural networks, DNNacoustic models, phonetic posteriors
Subjects: Q Science / természettudomány > QA Mathematics / matematika > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány
Q Science / természettudomány > QA Mathematics / matematika > QA76.76 Software Design and Development / Szoftvertervezés és -fejlesztés
R Medicine / orvostudomány > R1 Medicine (General) / orvostudomány általában
R Medicine / orvostudomány > RC Internal medicine / belgyógyászat > RC0321 Neuroscience. Biological psychiatry. Neuropsychiatry / idegkórtan, neurológia, pszichiátria
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 22 Aug 2024 08:27
Last Modified: 22 Aug 2024 08:27
URI: https://real.mtak.hu/id/eprint/203110

Actions (login required)

Edit Item Edit Item