Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories

Gote, Christoph; Scholtes, Ingo; Schweitzer, Frank (2019). git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories. In: 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR), Montreal, QC, 25 June 2019 - 1 July 2019. IEEE, 433-444.

Abstract

Data from software repositories have become an important foundation for the empirical study of software engineering processes. A recurring theme in the repository mining literature is the inference of developer networks capturing e.g. collaboration, coordination, or communication from the commit history of projects. Most of the studied networks are based on the co-authorship of software artefacts defined at the level of files, modules, or packages. While this approach has led to insights into the social aspects of software development, it neglects detailed information on code changes and code ownership, e.g. which exact lines of code have been authored by which developers, that is contained in the commit log of software projects.
Addressing this issue, we introduce git2net, a scalable python software that facilitates the extraction of fine-grained co-editing networks in large git repositories. It uses text mining techniques to analyse the detailed history of textual modifications within files. This information allows us to construct directed, weighted, and time-stamped networks, where a link signifies that one developer has edited a block of source code originally written by another developer. Our tool is applied in case studies of an Open Source and a commercial software project. We argue that it opens up a massive new source of high-resolution data on human collaboration patterns.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Scopus Subject Areas:Physical Sciences > Computer Science Applications
Physical Sciences > Software
Scope:Contributions to practice (applied research)
Language:English
Event End Date:1 July 2019
Deposited On:17 Sep 2019 12:32
Last Modified:06 Mar 2024 14:30
Publisher:IEEE
ISBN:978-1-7281-3412-3
OA Status:Closed
Publisher DOI:https://doi.org/10.1109/MSR.2019.00070
Other Identification Number:merlin-id:18272

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

0 downloads since deposited on 17 Sep 2019
0 downloads since 12 months

Authors, Affiliations, Collaborations

Similar Publications