Question 1

Should I report Shannon, Simpson, or both?

Accepted Answer

Both, when space allows. They emphasise different things: Shannon is more sensitive to species richness and to rare species, Simpson is more sensitive to evenness among the dominant species. A community where Shannon and Simpson disagree (one says "very diverse", the other says "moderate") is usually one with a long tail of rare species — and which answer is "right" depends on what your scientific question is. For ecology papers, reporting both alongside richness (S) is normal and lets readers interpret your data their own way. For microbiome work, the convention has converged on reporting at least Shannon and inverse Simpson, often alongside Faith's phylogenetic diversity (which this calculator doesn't compute — for that you need a tree). If you have to pick one for a single-number summary, inverse Simpson is the easiest to interpret because it has units of "effective species" — a number a non-specialist can immediately reason about.

Question 2

Why does deeper sequencing make my Shannon look higher?

Accepted Answer

Because deeper sampling discovers more rare species, and rare species push richness up — Shannon includes a richness term, so it rises mechanically with read count. The same community sequenced to 1,000 vs 50,000 reads can show meaningfully different Shannon values purely due to sampling depth, not biology. Two standard fixes. (1) Rarefy: subsample every sample down to the lowest read depth in your dataset before computing diversity. Loses real data but makes samples directly comparable. (2) Use coverage-based or model-based estimators (e.g. Hill numbers via the iNEXT framework, or Chao1 for richness alone) that explicitly account for sampling effort. For a single one-off calculation on a single sample, the raw Shannon is fine to report alongside the read count; for cross-sample comparisons, never compare raw Shannon between samples sequenced to different depths.

Question 3

What input formats does the calculator accept?

Accepted Answer

Anything that contains a list of numeric counts. The parser extracts every numeric token from the pasted text and treats each as one species count, so you can paste a single column of numbers from a spreadsheet, a comma-separated list, a tab-separated table with species names in one column and counts in another, or even a sentence like "Species A had 12 individuals, Species B had 7". Species names are ignored — only the counts matter for the indices. Zero counts are dropped (a species with zero observations isn't in the sample). Negative numbers are silently ignored as input errors. If you have a spreadsheet with multiple samples and want diversity for each, run them one at a time; the calculator computes a single sample's diversity per submission, not a matrix.

Question 4

My Pielou evenness is 1.00 — is that right?

Accepted Answer

Yes — J′ = 1 means the community is perfectly even, i.e. every species has the same count. Mathematically that's when Shannon hits its theoretical maximum of ln(S), and J′ = H′/ln(S) = 1. It's rare to see in real ecological or microbiome data because real communities almost always have at least some unevenness; if you're seeing exactly 1.00, double-check that your input wasn't accidentally a list of identical numbers (e.g. you pasted relative frequencies after rounding to the same value, or you have a constant column from a spreadsheet). At the other extreme, J′ approaches 0 as one species comes to dominate completely; J′ = 0 would mean a single-species "community", which the calculator flags as not-applicable for evenness because there's no theoretical maximum to compare against (S = 1 makes ln(S) = 0).

Diversity Index Calculator

How this works

The formula

Example calculation

Frequently asked questions

Related calculators