Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Imputing missing data in plant traits: A guide to improve gap‐filling

Joswig, Julia S; Kattge, Jens; Kraemer, Guido; Mahecha, Miguel D; Rüger, Nadja; Schaepman, Michael E; Schrodt, Franziska; Schuman, Meredith Christine (2023). Imputing missing data in plant traits: A guide to improve gap‐filling. Global Ecology and Biogeography, 32(8):1395-1408.

Abstract

Aim: Globally distributed plant trait data are increasingly used to understand relationships between biodiversity and ecosystem processes. However, global trait databases are sparse because they are compiled from many, mostly small databases. This sparsity in both trait space completeness and geographical distribution limits the potential for both multivariate and global analyses. Thus, ‘gap-filling’ approaches are often used to impute missing trait data. Recent methods, like Bayesian hierarchical probabilistic matrix factorization (BHPMF), can impute large and sparse data sets
using side information. We investigate whether BHPMF imputation leads to biases in trait space and identify aspects influencing bias to provide guidance for its usage.
Innovation: We use a fully observed trait data set from which entries are randomly removed, along with extensive but sparse additional data. We use BHPMF for imputation and evaluate bias by: (1) accuracy (residuals, RMSE, trait means), (2) correlations (bi-and multivariate) and (3) taxonomic and functional clustering (valuewise, uni-and
multivariate). BHPMF preserves general patterns of trait distributions but induces taxonomic clustering. Data set–external trait data had little effect on induced taxonomic clustering and stabilized trait–trait correlations.
Main Conclusions: Our study extends the criteria for the evaluation of gap-filling beyond RMSE, providing insight into statistical data structure and allowing better informed use of imputed trait data, with improved practice for imputation. We expect our findings to be valuable beyond applications in plant ecology, for any study using hierarchical side information for imputation.

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:07 Faculty of Science > Department of Chemistry
07 Faculty of Science > Institute of Geography
08 Research Priority Programs > Global Change and Biodiversity
Dewey Decimal Classification:910 Geography & travel
Scopus Subject Areas:Physical Sciences > Global and Planetary Change
Life Sciences > Ecology, Evolution, Behavior and Systematics
Physical Sciences > Ecology
Uncontrolled Keywords:Ecology, Ecology, Evolution, Behavior and Systematics, Global and Planetary Change
Language:English
Date:1 August 2023
Deposited On:08 Jun 2023 11:10
Last Modified:29 Dec 2024 02:38
Publisher:Wiley-Blackwell Publishing, Inc.
ISSN:1466-822X
OA Status:Hybrid
Publisher DOI:https://doi.org/10.1111/geb.13695
Download PDF  'Imputing missing data in plant traits: A guide to improve gap‐filling'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
5 citations in Web of Science®
4 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

48 downloads since deposited on 08 Jun 2023
13 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications