Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Identifying Informative Predictor Variables With Random Forests

Rothacher, Yannick; Strobl, Carolin (2024). Identifying Informative Predictor Variables With Random Forests. Journal of Educational and Behavioral Statistics, 49(4):595-629.

Abstract

Random forests are a nonparametric machine learning method, which is currently gaining popularity in the behavioral sciences. Despite random forests’ potential advantages over more conventional statistical methods, a remaining question is how reliably informative predictor variables can be identified by means of random forests. The present study aims at giving a comprehensible introduction to the topic of variable selection with random forests and providing an overview of the currently proposed selection methods. Using simulation studies, the variable selection methods are examined regarding their statistical properties, and comparisons between their performances and the performance of a conventional linear model are drawn. Advantages and disadvantages of the examined methods are discussed, and practical recommendations for the use of random forests for variable selection are given.

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Psychology
Dewey Decimal Classification:150 Psychology
Scopus Subject Areas:Social Sciences & Humanities > Education
Social Sciences & Humanities > Social Sciences (miscellaneous)
Uncontrolled Keywords:random forest; variable importance; interpretable machine learning; recursive partitioning; variable selection
Language:English
Date:1 August 2024
Deposited On:16 Jan 2024 11:33
Last Modified:28 Apr 2025 01:38
Publisher:Sage Publications
ISSN:1076-9986
OA Status:Hybrid
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.3102/10769986231193327
Download PDF  'Identifying Informative Predictor Variables With Random Forests'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
11 citations in Web of Science®
11 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

19 downloads since deposited on 16 Jan 2024
19 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications