Publication:

Densify: An R package to reduce empty cells in dataframes of typological linguistic data

Date

Date

Date
2024
Journal Article
Published version

Citations

Citation copied

Graff, A., Lischka, M., Zakharko, T., Furrer, R., & Bickel, B. (2024). Densify: An R package to reduce empty cells in dataframes of typological linguistic data. Journal of Open Source Software, 9(101), 7024. https://doi.org/10.21105/joss.07024

Abstract

Abstract

Abstract

The R package densify provides a procedure to prune input data frames containing empty cells (or cells with values {?} or {NA}) to denser sub-matrices with fewer empty cells. The pruning process trades off a series of variably weighted concerns, including data retention, coding density (proportion of non-empty cells) and taxonomic diversity of rows (representing for example phylogenetic relations). Users can adapt the relative weights given to these concerns through various parameters so that the densification process best fits their

Metrics

Citations

Additional indexing

Creators (Authors)

Journal/Series Title

Journal/Series Title

Journal/Series Title

Volume

Volume

Volume
9

Number

Number

Number
101

Page range/Item number

Page range/Item number

Page range/Item number
7024

Item Type

Item Type

Item Type
Journal Article

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Language

Language

Language
English

Publication date

Publication date

Publication date
2024-09-06

Date available

Date available

Date available
2024-09-23

Publisher

Publisher

Publisher

ISSN or e-ISSN

ISSN or e-ISSN

ISSN or e-ISSN
2475-9066

Additional Information

Additional Information

Additional Information
Conclusions: The R package densify provides users with a flexible and explicit method to generate submatrices from an input matrix in a mathematically principled way. The package documents case examples using a standard sparse linguistic dataset (WALS) and the standard linguistic taxonomy provided by Glottolog. Examples and further usage details for this software are found in the vignette hosted in the software repository on GitHub. Acknowledgements: The authors declare that there are no conflicts of interest.

OA Status

OA Status

OA Status
Gold

Free Access at

Free Access at

Free Access at
DOI

Metrics

Citations

Citations

Citation copied

Graff, A., Lischka, M., Zakharko, T., Furrer, R., & Bickel, B. (2024). Densify: An R package to reduce empty cells in dataframes of typological linguistic data. Journal of Open Source Software, 9(101), 7024. https://doi.org/10.21105/joss.07024

Gold Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image