Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Reducing Redundancies in Multi-Revision Code Analysis

Alexandru, Carol V; Panichella, Sebastiano; Gall, Harald C (2017). Reducing Redundancies in Multi-Revision Code Analysis. In: IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), Klagenfurt, Austria, 20 February 2017 - 24 February 2017, IEEE.

Abstract

Software engineering research often requires analyzing multiple revisions of several software projects, be it to make and test predictions or to observe and identify patterns in how software evolves. However, code analysis tools are almost exclusively designed for the analysis of one specific version of the code, and the time and resources requirements grow linearly with each additional revision to be analyzed. Thus, code studies often observe a relatively small number of revisions and projects. Furthermore, each programming ecosystem provides dedicated tools, hence researchers typically only analyze code of one language, even when researching topics that should generalize to other ecosystems. To alleviate these issues, frameworks and models have been developed to combine analysis tools or automate the analysis of multiple revisions, but little research has gone into actually removing redundancies in multi-revision, multi-language code analysis. We present a novel end-to-end approach that systematically avoids redundancies every step of the way: when reading sources from version control, during parsing, in the internal code representation, and during the actual analysis. We evaluate our open-source implementation, LISA, on the full history of 300 projects, written in 3 different programming languages, computing basic code metrics for over 1.1 million program revisions. When analyzing many revisions, LISA requires less than a second on average to compute basic code metrics for all files in a single revision, even for projects consisting of millions of lines of code.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Scopus Subject Areas:Physical Sciences > Software
Scope:Discipline-based scholarship (basic research)
Language:English
Event End Date:24 February 2017
Deposited On:10 Oct 2017 12:23
Last Modified:06 Mar 2024 14:22
Publisher:IEEE
OA Status:Green
Publisher DOI:https://doi.org/10.1109/SANER.2017.7884617
Other Identification Number:merlin-id:14106

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
6 citations in Web of Science®
7 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

161 downloads since deposited on 10 Oct 2017
18 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications