REAL

Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience

Király, Bálint and Hangya, Balázs (2022) Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience. ENEURO, 9 (4). ISSN 2373-2822

[img]
Preview
Text
NavigatingtheStatistical....pdf
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

Model selection is often implicit: when performing an ANOVA, one assumes that the normal distribution is a good model of the data; fitting a tuning curve implies that an additive and a multiplicative scaler describes the behavior of the neuron; even calculating an average implicitly assumes that the data were sampled from a distribution that has a finite first statistical moment: the mean. Model selection may be explicit, when the aim is to test whether one model provides a better description of the data than a competing one. As a special case, clustering algorithms identify groups with similar properties within the data. They are widely used from spike sorting to cell type identification to gene expression analysis. We discuss model selection and clustering techniques from a statistician's point of view, revealing the assumptions behind, and the logic that governs the various approaches. We also showcase important neuroscience applications and provide suggestions how neuroscientists could put model selection algorithms to best use as well as what mistakes should be avoided.

Item Type: Article
Uncontrolled Keywords: clustering, bootstrap, information criterion, cross-validation, resampling
Subjects: Q Science / természettudomány > QH Natural history / természetrajz > QH301 Biology / biológia > QH3020 Biophysics / biofizika
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 20 Jul 2022 08:48
Last Modified: 20 Jul 2022 08:48
URI: http://real.mtak.hu/id/eprint/144970

Actions (login required)

Edit Item Edit Item