Publication: Densify: An R package to reduce empty cells in dataframes of typological linguistic data
Densify: An R package to reduce empty cells in dataframes of typological linguistic data
Date
Date
Date
| cris.virtual.orcid | 0000-0002-9087-0565 | |
| cris.virtual.orcid | 0000-0002-7703-3471 | |
| cris.virtual.orcid | 0000-0002-6319-2332 | |
| cris.virtualsource.orcid | 0a73188e-c464-488a-b544-64ea66244d77 | |
| cris.virtualsource.orcid | 19102f9f-d890-4292-ac62-bb028e4f3c1b | |
| cris.virtualsource.orcid | b9152a18-bf87-4222-a67d-211bfb1d8bf1 | |
| dc.contributor.institution | University of Zurich | |
| dc.date.accessioned | 2024-09-23T12:04:42Z | |
| dc.date.available | 2024-09-23T12:04:42Z | |
| dc.date.issued | 2024-09-06 | |
| dc.description.abstract | The R package densify provides a procedure to prune input data frames containing empty cells (or cells with values {?} or {NA}) to denser sub-matrices with fewer empty cells. The pruning process trades off a series of variably weighted concerns, including data retention, coding density (proportion of non-empty cells) and taxonomic diversity of rows (representing for example phylogenetic relations). Users can adapt the relative weights given to these concerns through various parameters so that the densification process best fits their needs. As such, the software is useful for several purposes, including the densification of sparse input matrices and the subsampling of large input matrices according to a procedure that is sensitive to taxonomic structure. | |
| dc.identifier.doi | 10.21105/joss.07024 | |
| dc.identifier.issn | 2475-9066 | |
| dc.identifier.uri | https://www.zora.uzh.ch/handle/20.500.14742/221416 | |
| dc.language.iso | eng | |
| dc.subject.ddc | 510 Mathematics | |
| dc.subject.ddc | 490 Other languages | |
| dc.subject.ddc | 890 Other literatures | |
| dc.subject.ddc | 410 Linguistics | |
| dc.title | Densify: An R package to reduce empty cells in dataframes of typological linguistic data | |
| dc.type | article | |
| dcterms.accessRights | info:eu-repo/semantics/openAccess | |
| dcterms.bibliographicCitation.journaltitle | Journal of Open Source Software | |
| dcterms.bibliographicCitation.number | 101 | |
| dcterms.bibliographicCitation.originalpublishername | Open Journals | |
| dcterms.bibliographicCitation.pagestart | 7024 | |
| dcterms.bibliographicCitation.volume | 9 | |
| dspace.entity.type | Publication | en |
| uzh.contributor.author | Graff, Anna | |
| uzh.contributor.author | Lischka, Marc | |
| uzh.contributor.author | Zakharko, Taras | |
| uzh.contributor.author | Furrer, Reinhard | |
| uzh.contributor.author | Bickel, Balthasar | |
| uzh.contributor.correspondence | Yes | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.document.availability | published_version | |
| uzh.eprint.datestamp | 2024-09-23 12:04:42 | |
| uzh.eprint.lastmod | 2025-02-04 19:40:17 | |
| uzh.eprint.statusChange | 2024-09-23 12:04:42 | |
| uzh.harvester.eth | Yes | |
| uzh.harvester.nb | No | |
| uzh.identifier.doi | 10.5167/uzh-262418 | |
| uzh.jdb.eprintsId | 42654 | |
| uzh.note.public | Conclusions: The R package densify provides users with a flexible and explicit method to generate submatrices from an input matrix in a mathematically principled way. The package documents case examples using a standard sparse linguistic dataset (WALS) and the standard linguistic taxonomy provided by Glottolog. Examples and further usage details for this software are found in the vignette hosted in the software repository on GitHub. Acknowledgements: The authors declare that there are no conflicts of interest. | |
| uzh.oastatus.unpaywall | gold | |
| uzh.oastatus.zora | Gold | |
| uzh.publication.citation | Graff, Anna; Lischka, Marc; Zakharko, Taras; Furrer, Reinhard; Bickel, Balthasar (2024). Densify: An R package to reduce empty cells in dataframes of typological linguistic data. Journal of Open Source Software, 9(101):7024. | |
| uzh.publication.freeAccessAt | doi | |
| uzh.publication.originalwork | original | |
| uzh.publication.publishedStatus | final | |
| uzh.workflow.doaj | uzh.workflow.doaj.true | |
| uzh.workflow.eprintid | 262418 | |
| uzh.workflow.fulltextStatus | public | |
| uzh.workflow.revisions | 18 | |
| uzh.workflow.rightsCheck | offen | |
| uzh.workflow.source | Crossref:10.21105/joss.07024 | |
| uzh.workflow.status | archive | |
| Files | ||
| Publication available in collections: |