Estimation of Species Diversity: HCDT Entropy, Hill Numbers, and Chao Estimator | Summaries Biology

Hal 01212435 v2

Practical Estimation of Diversity from Abundance

Data

Eric Marcon1*

Abstract

Measuring biodiversity requires empirical techniques to effectively estimate it from real data. The well-known underestimation of

the number of species applies to low orders of diversity in general. I test nine estimators including three new ones on geometric

and lognormal distributions that represent realistic, hyper-diverse communities. The best two estimators allow a good estimation

of diversity of orders over 0.5, even when the sampling effort is low. I provide criteria to choose the estimator and the necessary

code in the R package entropart.

Keywords

Biodiversity, HCDT entropy, Phylodiversity

AgroParisTech, UMR EcoFoG, CNRS, Cirad, INRA, Universite´ des Antilles, Universite´ de Guyane, Campus agronomique, BP 316, F-97310

Kourou, French Guiana.

Contents

Introduction 1

1 Methods 2

1.1 Sample coverage . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.2 Estimators of entropy . . . . . . . . . . . . . . . . . . . . . . 3

1.3 Confidence intervals . . . . . . . . . . . . . . . . . . . . . . . 4

1.4 From entropy to diversity . . . . . . . . . . . . . . . . . . . 4

1.5 Typical distributions . . . . . . . . . . . . . . . . . . . . . . . 4

1.6 Evaluation of the performance of estimators . . . . . . . 5

2 Results 5

2.1 Sample coverage . . . . . . . . . . . . . . . . . . . . . . . . . 5

2.2 Entropy and diversity . . . . . . . . . . . . . . . . . . . . . . 5

3 Discussion 7

3.1

The sample coverage is not always the good indicator of

the quality of estimation . . . . . . . . . . . . . . . . . . . . 7

3.2

Comparing the diversity of real communities with different

distributions remains untractable . . . . . . . . . . . . . . 7

3.3 Estimating the number of species is the critical step . . 8

3.4

Better, but probably not much better, estimators may be

derived ...............................8

4 Application to real data 8

5 Conclusion 9

Introduction

Measuring biodiversity requires both a robust theoretical

framework (Patil and Taillie, 1982) and empirical tech-

niques to effectively estimate the theoretical variables

with real data (Beck and Schwanghart, 2010). In this

paper I focus on species-neutral measures of diversity

based on HCDT entropy (Havrda and Charv´at, 1967;

Dar´oczy, 1970; Tsallis, 1988) that fulfill the first require-

ment. Entropy measures the average surprise brought

by observing individuals of a community. Surprise is a

decreasing function of probability dropping to 0 when

probability is 1. HCDT entropy uses a parameterized

surprise function that is the deformed logarithm of order

of the reciprocal of probability(Marcon et al., 2014a).

Traditional measures of diversity, namely the number of

species as well as Shannon’s and Simpson’s indices, are

special cases of the HCDT entropy for values of

equal

to 0, 1 and 2. HCDT entropy should be transformed into

Hill numbers (Hill, 1973) for better interpretation of the

value of diversity as an effective number of species (Jost,

2006). Hill numbers are simply the deformed exponential

of HCDT entropy (Marcon et al., 2014a). Rather than

focusing on a single value of

, a profile of diversity, i.e.

a plot of diversity against

, can be built (Tothmeresz,

1995). Low values of

(starting from 0) give much im-

portance to rare species, whilst higher values (usually up

to 2) focus on abundant species. Negative values of

are not used because of poor mathematical properties of

their entropy (Beck, 2009), and values over 2 generally

bring little more information. Ordering communities in

terms of diversity requires that their profile do not cross

(Tothmeresz, 1995); else, declaring a community more

diverse than another only holds for a range of values

reflecting the importance given to rare or frequent

species (Lande et al., 2000).

To plot those profiles, diversity must be estimated

from the data. Estimation bias (I follow the terminology

of Dauby and Hardy, 2012) is a well-known issue (Marcon

et al., 2014a). Real data are almost always samples

of larger communities, so some species may have been

missed. The induced bias on the Simpson entropy is

Estimation of Species Diversity: HCDT Entropy, Hill Numbers, and Chao Estimator, Summaries of Biology

Partial preview of the text

Download Estimation of Species Diversity: HCDT Entropy, Hill Numbers, and Chao Estimator and more Summaries Biology in PDF only on Docsity!

Practical Estimation of Diversity from Abundance

Data

Eric Marcon^1 *

Contents

Introduction

1. Methods

[

]

C^ ˆ = 1 −

]

2. Results

3. Discussion

4. Application to real data

Appendix 1: Sample coverage estimation

Appendix 2: Estimated diversity profiles