Publication:

On the use of random forest for two-sample testing

Date

Date

Date
2022
Journal Article
Published version

Citations

Citation copied

Hediger, S., Michel, L., & Näf, J. (2022). On the use of random forest for two-sample testing. Computational Statistics & Data Analysis, 170, 107435. https://doi.org/10.1016/j.csda.2022.107435

Abstract

Abstract

Abstract

Following the line of classification-based two-sample testing, tests based on the Random Forest classifier are proposed. The developed tests are easy to use, require almost no tuning, and are applicable for any distribution on R^d. Furthermore, the built-in variable importance measure of the Random Forest gives potential insights into which variables make out the difference in distribution. An asymptotic power analysis for the proposed tests is conducted. Finally, two real-world applications illustrate the usefulness of the introduced

Metrics

Downloads

73 since deposited on 2022-02-07
Acq. date: 2025-11-13

Views

102 since deposited on 2022-02-07
Acq. date: 2025-11-13

Additional indexing

Creators (Authors)

  • Hediger, Simon
    affiliation.icon.alt
  • Michel, Loris
    affiliation.icon.alt
  • Näf, Jeffrey
    affiliation.icon.alt

Journal/Series Title

Journal/Series Title

Journal/Series Title

Volume

Volume

Volume
170

Page range/Item number

Page range/Item number

Page range/Item number
107435

Item Type

Item Type

Item Type
Journal Article

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Scope

Scope

Scope
Discipline-based scholarship (basic research)

Language

Language

Language
English

Publication date

Publication date

Publication date
2022-06-01

Date available

Date available

Date available
2022-02-07

Publisher

Publisher

Publisher

ISSN or e-ISSN

ISSN or e-ISSN

ISSN or e-ISSN
0167-9473

OA Status

OA Status

OA Status
Hybrid

Free Access at

Free Access at

Free Access at
DOI

Other Identification Number

Other Identification Number

Other Identification Number
merlin-id:21963

Related URLs

Related URLs

Related URLs

Metrics

Downloads

73 since deposited on 2022-02-07
Acq. date: 2025-11-13

Views

102 since deposited on 2022-02-07
Acq. date: 2025-11-13

Citations

Citation copied

Hediger, S., Michel, L., & Näf, J. (2022). On the use of random forest for two-sample testing. Computational Statistics & Data Analysis, 170, 107435. https://doi.org/10.1016/j.csda.2022.107435

Hybrid Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image