REAL

Prevalence dependence in model goodness measures with special emphasis on true skill statistics

Somodi, Imelda and Lepesi, Nikolett and Botta-Dukát, Zoltán (2017) Prevalence dependence in model goodness measures with special emphasis on true skill statistics. Ecology and Evolution, 7 (3). pp. 863-872. ISSN 2045-7758, ESSN: 2045-7758

[img]
Preview
Text
SomodiI_etal_Ecology_and_Evolution_OA.pdf

Download (544kB) | Preview

Abstract

It has long been a concern that performance measures of species distribution models react to attributes of the modeled entity arising from the input data structure rather than to model performance. Thus, the study of Allouche et al. (Journal of Applied Ecology, 43, 1223, 2006) identifying the true skill statistics (TSS) as being independent of prevalence had a great impact. However, empirical experience questioned the validity of the statement. We searched for technical reasons behind these observations. We explored possible sources of prevalence dependence in TSS including sampling constraints and species characteristics, which influence the calculation of TSS. We also examined whether the widespread solution of using the maximum of TSS for comparison among species introduces a prevalence effect. We found that the design of Allouche et al. (Journal of Applied Ecology, 43, 1223, 2006) was flawed, but TSS is indeed independent of prevalence if model predictions are binary and under the strict set of assumptions methodological studies usually apply. However, if we take realistic sources of prevalence dependence, effects appear even in binary calculations. Furthermore, in the widespread approach of using maximum TSS for continuous predictions, the use of the maximum alone induces prevalence dependence for small, but realistic samples. Thus, prevalence differences need to be taken into account when model comparisons are carried out based on discrimination capacity. The sources we identified can serve as a checklist to safely control comparisons, so that true discrimination capacity is compared as opposed to artefacts arising from data structure, species characteristics, or the calculation of the comparison measure (here TSS). © 2017 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.

Item Type: Article
Uncontrolled Keywords: Species distribution models; Sample Size; Predictive models; Model performance; Cohen's kappa
Subjects: Q Science / természettudomány > QH Natural history / természetrajz > QH540 Ecology / ökológia
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 15 Feb 2017 10:48
Last Modified: 15 Feb 2017 10:48
URI: http://real.mtak.hu/id/eprint/48982

Actions (login required)

Edit Item Edit Item