REAL

Wav2vec 2.0 Embeddings Are No Swiss Army Knife : A Case Study for Multiple Sclerosis

Gosztolya, Gábor and Kiss-Vetráb, Mercedes and Svindt, Veronika and Bóna, Judit and Hoffmann, Ildikó (2024) Wav2vec 2.0 Embeddings Are No Swiss Army Knife : A Case Study for Multiple Sclerosis. In: 25th Interspeech Conference (Interspeech 2024). Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH . International Speech Communication Association (ISCA), Dublin, pp. 2499-2503.

[img]
Preview
Text
2024-interspeech-wav2vec2-vs-others.pdf - Published Version

Download (234kB) | Preview

Abstract

In the past few years, self-supervised learning has revolutionalized automatic speech recognition. Self-supervised models such as wav2vec2, due to their generalization ability on huge unannotated audio corpora, were claimed to be state-ofthe-art feature extractors in paralinguistic and pathological applications as well. In this study we test embeddings extracted from a wav2vec 2.0 model fine-tuned on the target language as features on a multiple sclerosis audio corpus, using three speech tasks. After comparing the resulting classification performances with traditional features such as ComParE functionals, ECAPATDNN and activations of a HMM/DNN hybrid acoustic model, we found that wav2vec2-based models, surprisingly, only produced a mediocre classification performance. In contrast, the decade-old ComParE functionals feature set consistently led to high scores. Our results also indicate that the number of features correlates surprisingly well with classification performance.

Item Type: Book Section
Uncontrolled Keywords: wav2vec 2.0, feature extraction, pathological speech processing, multiple sclerosis
Subjects: P Language and Literature / nyelvészet és irodalom > P0 Philology. Linguistics / filológia, nyelvészet
R Medicine / orvostudomány > RC Internal medicine / belgyógyászat > RC0321 Neuroscience. Biological psychiatry. Neuropsychiatry / idegkórtan, neurológia, pszichiátria
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 09 Apr 2025 06:53
Last Modified: 09 Apr 2025 06:53
URI: https://real.mtak.hu/id/eprint/217638

Actions (login required)

Edit Item Edit Item