Abstract
This deliverable report identifies and substantiates criteria for determining the reliability of species status and trend estimates derived from aggregated data cubes from the Global Biodiversity Information Facility (GBIF). To achieve this, the report adopts a comparative approach, contrasting unstructured GBIF cube data with structured monitoring data from bird surveys in Flanders (Belgium) and the Western Cape (South Africa).
Key findings from these case studies demonstrate that extensive quality control, data filtering, and validation are essential to producing robust results from unstructured data. Specific technical barriers identified include coordinate uncertainty, where large uncertainty radii can distort spatial signals; taxonomic inconsistencies, where unlinked accepted names artificially inflate species richness; and publication delays, which can create misleading temporal trends. Furthermore, the analysis reveals that data cubes are often dominated by a small number of highly influential component datasets, making indicators sensitive to the presence or absence of specific sources.
To address these biases, the report implements diagnostic frameworks to quantify survey effort and completeness. It introduces a survey-effort score, capturing record volume, temporal replication, and taxonomic coverage, and utilizes probabilistic estimators to assess survey completeness. Additionally, the report implementation examines species detectability, implementing survey-based detection probability metrics to distinguish between genuine ecological signals and reporting biases driven by technology or observer behaviour.
Finally, the report operationalizes these assessments through specialized software tools developed within the B3 project, specifically the gcube R package for simulating occurrence cubes and the dubicube R package for quality checks and quantifying indicator uncertainty. These insights are synthesized into a set of operational guidelines for reliable indicator and trend calculations, ensuring that biodiversity reporting based on aggregated occurrence data is transparent, reproducible, and robust.
Key findings from these case studies demonstrate that extensive quality control, data filtering, and validation are essential to producing robust results from unstructured data. Specific technical barriers identified include coordinate uncertainty, where large uncertainty radii can distort spatial signals; taxonomic inconsistencies, where unlinked accepted names artificially inflate species richness; and publication delays, which can create misleading temporal trends. Furthermore, the analysis reveals that data cubes are often dominated by a small number of highly influential component datasets, making indicators sensitive to the presence or absence of specific sources.
To address these biases, the report implements diagnostic frameworks to quantify survey effort and completeness. It introduces a survey-effort score, capturing record volume, temporal replication, and taxonomic coverage, and utilizes probabilistic estimators to assess survey completeness. Additionally, the report implementation examines species detectability, implementing survey-based detection probability metrics to distinguish between genuine ecological signals and reporting biases driven by technology or observer behaviour.
Finally, the report operationalizes these assessments through specialized software tools developed within the B3 project, specifically the gcube R package for simulating occurrence cubes and the dubicube R package for quality checks and quantifying indicator uncertainty. These insights are synthesized into a set of operational guidelines for reliable indicator and trend calculations, ensuring that biodiversity reporting based on aggregated occurrence data is transparent, reproducible, and robust.
| Original language | English |
|---|
| Publisher | Biodiversity building blocks for policy |
|---|---|
| Number of pages | 86 |
| Publication status | Published - 27-Feb-2026 |
Thematic List 2020
- Data & infrastructure
Fingerprint
Dive into the research topics of 'D4.3 Report on the criteria for data quality and species characteristics for estimating species status and trends'. Together they form a unique fingerprint.Projects
- 1 Active
-
B-Cubed - Biodiversity Building Blocks for policy
Desmet, P. (Project leader), Adriaens, T. (Cooperator), Adriaenssens, V. (Cooperator), Cartuyvels, E. (Cooperator), Delva, S. (Cooperator), Govaert, S. (Cooperator), Hillaert, J. (Cooperator), Huybrechts, P. (Cooperator), Langeraert, W. (Cooperator), Oldoni, D. (Cooperator), Onkelinx, T. (Cooperator), Reyserhove, L. (Cooperator), Strubbe, D. (Cooperator), Van Calster, H. (Cooperator), Van Daele, T. (Cooperator), Van den Broeck, F. (Cooperator) & Vanderhaeghe, F. (Cooperator)
1/03/23 → 31/08/26
Project: EVINBO - Europees
Research output
- 2 Report not published by INBO
-
MS19 Preliminary criteria for data quality and species characteristics for estimating species status and trends.
Cartuyvels, E., Faulkner, K., Langeraert, W. & Van Daele, T., 30-Apr-2025, Biodiversity building blocks for policy. 37 p.Research output: Book/Report › Report not published by INBO
Open AccessFile -
MS 18 Selection of the monitoring and inventory projects: selection of species (groups), spatial and temporal extent
Langeraert, W., Faulkner, K., Zengeya, T., Martini, M., Rocchini, D., Breugelmans, L., Cortès, R. & Van Daele, T., 30-Nov-2023, Biodiversity building blocks for policy. 21 p.Research output: Book/Report › Report not published by INBO
Open AccessFile
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver