. 2018

What does a zero mean? Understanding false, random and structural zeros in ecology

Dipòsit Digital de Documents de la UAB
oai:ddd.uab.cat:194390
Dipòsit Digital de Documents de la UAB
  • Blasco-Moreno, Anabel|||0000-0001-7574-9911
  • Pérez-Casany, Marta
  • Puig, Pedro|||0000-0002-6607-9642
  • Morante, Maria
  • Castells, Eva|||0000-0001-7423-2742
Data Description
 
DOI: https://ddd.uab.cat/record/194390, https://dx.doi.org/10.5565/ddd.uab.cat/194390
Dipòsit Digital de Documents de la UAB
oai:ddd.uab.cat:194390

HANDLE: https://ddd.uab.cat/record/194390, https://dx.doi.org/10.5565/ddd.uab.cat/194390
Dipòsit Digital de Documents de la UAB
oai:ddd.uab.cat:194390
 
Ver en: https://ddd.uab.cat/record/194390, https://dx.doi.org/10.5565/ddd.uab.cat/194390
Dipòsit Digital de Documents de la UAB
oai:ddd.uab.cat:194390

UPCommons. Portal del coneixement obert de la UPC
oai:upcommons.upc.edu:2117/169416
Artículo científico (article). 2019

WHAT DOES A ZERO MEAN? UNDERSTANDING FALSE, RANDOM AND STRUCTURAL ZEROS IN ECOLOGY

UPCommons. Portal del coneixement obert de la UPC
  • Blasco Moreno, Anabel
  • Pérez Casany, Marta|||0000-0003-3675-6902
  • Puig, Pedro
  • Morante, Maria
  • Castells, Eva
1. Zeros (i.e. events that do not happen) are the source of two common phenomena in count data: overdispersion and zero-inflation. Zeros have multiple origins in a dataset: false zeros occur due to errors in the experimental design or the observer; structural zeros are related to the ecological or evolutionary restrictions of the system under study; and random zeros are the result of the sampling variability. Identifying the type of zeros and their relation with overdispersion and/or zero inflation is key to select the most appropriate statistical model. 2. We first introduce the dispersion and zero inflation indeces and review the theory of the Zero-inflated (ZI) models and the use of the score tests to assess overdispersion and zero inflation over a model. Then, we present the following protocol to assist with the analysis of count data: Step 1) classify the zeros and minimize the presence of false zeros; Step 2) identify suitable covariates; Step 3) test the data for overdispersion and zero-inflation; and Step 4) choose the most adequate model based on the results of step 3 and use score tests to determine whether more complex models should be implemented. 3. We applied the recommended protocol on a real dataset on plant-herbivore interactions to evaluate the suitability of six different models (Poisson, NB and their zero-inflated versions –ZIP, ZINB–) in the light of overdispersion and zero inflation. Finally, we discuss the consequences of adjusting suboptimal models. 4. Our data was overdispersed and zero-inflated, and the ZINB was the model with the best fit, as predicted. Ignoring overdispersion and/or zero inflation during data analyses caused biased estimates of the statistical parameters and serious errors in the interpretation of the results. Our results are a clear example on how the conclusions of an ecological hypothesis can change depending on the model applied. Understanding of how zeros arise in count data, for example identifying the potential sources of structural zeros, is essential to select the best statistical design. A good model not only fits the data correctly but also takes into account the idiosyncrasies of the biological system under study., Peer Reviewed





Dipòsit Digital de Documents de la UAB
oai:ddd.uab.cat:194390
. 2018

WHAT DOES A ZERO MEAN? UNDERSTANDING FALSE, RANDOM AND STRUCTURAL ZEROS IN ECOLOGY

Dipòsit Digital de Documents de la UAB
  • Blasco-Moreno, Anabel|||0000-0001-7574-9911
  • Pérez-Casany, Marta
  • Puig, Pedro|||0000-0002-6607-9642
  • Morante, Maria
  • Castells, Eva|||0000-0001-7423-2742
Data Description




Recercat. Dipósit de la Recerca de Catalunya
oai:recercat.cat:2072/362831
Artículo científico (article).

WHAT DOES A ZERO MEAN? UNDERSTANDING FALSE, RANDOM AND STRUCTURAL ZEROS IN ECOLOGY

Recercat. Dipósit de la Recerca de Catalunya
  • Blasco Moreno, Anabel
  • Pérez Casany, Marta
  • Puig, Pedro
  • Morante, Maria
  • Castells, Eva
1. Zeros (i.e. events that do not happen) are the source of two common phenomena in count data: overdispersion and zero-inflation. Zeros have multiple origins in a dataset: false zeros occur due to errors in the experimental design or the observer; structural zeros are related to the ecological or evolutionary restrictions of the system under study; and random zeros are the result of the sampling variability. Identifying the type of zeros and their relation with overdispersion and/or zero inflation is key to select the most appropriate statistical model. 2. We first introduce the dispersion and zero inflation indeces and review the theory of the Zero-inflated (ZI) models and the use of the score tests to assess overdispersion and zero inflation over a model. Then, we present the following protocol to assist with the analysis of count data: Step 1) classify the zeros and minimize the presence of false zeros; Step 2) identify suitable covariates; Step 3) test the data for overdispersion and zero-inflation; and Step 4) choose the most adequate model based on the results of step 3 and use score tests to determine whether more complex models should be implemented. 3. We applied the recommended protocol on a real dataset on plant-herbivore interactions to evaluate the suitability of six different models (Poisson, NB and their zero-inflated versions –ZIP, ZINB–) in the light of overdispersion and zero inflation. Finally, we discuss the consequences of adjusting suboptimal models. 4. Our data was overdispersed and zero-inflated, and the ZINB was the model with the best fit, as predicted. Ignoring overdispersion and/or zero inflation during data analyses caused biased estimates of the statistical parameters and serious errors in the interpretation of the results. Our results are a clear example on how the conclusions of an ecological hypothesis can change depending on the model applied. Understanding of how zeros arise in count data, for example identifying the potential sources of structural zeros, is essential to select the best statistical design. A good model not only fits the data correctly but also takes into account the idiosyncrasies of the biological system under study., Peer Reviewed