Tools and Datasets

AmPyfier by Ebert Schoofs, Mehrdad Abdi (University of Antwerp), 2022

test amplification for Python.

» https://ansymore.uantwerpen.be/artefacts/ampyfier

Related paper: AmPyfier: Test amplification in Python by Ebert Schoofs, Mehrdad Abdi and Serge Demeyer in Journal of Software: Evolution and Process

Continuous Integration Antipattern Analyzer by Freek De Sagher (University of Antwerp), 2022

a command line tool to analyze CI workflows in git repositories written in Python 3.10.

» https://github.com/FreekDS/CIAN

HyDRa by Maxime Gobert (University of Namur), 2022

a framework for hybrid polystore modeling and manipulation

» https://github.com/gobertm/HyDRa

License compatibility checking for npm dependencies by Ilyas Said Makari, Ahmed Zerouali, Coen De Roover (Vrije Universiteit Brussel), 2022

Tool to check license compatibilities for npm dependencies

» https://doi.org/10.5281/zenodo.5913761

LiFUSO by Camilo Velázquez-Rodríguez (Vrije Universiteit Brussel), 2022

A tool for library feature unveiling based on Stack Overflow posts

» https://github.com/softwarelanguageslab/lifuso

PaReco by Poedjadevie Kadjel Ramkisoen and John Businge (University of Antwerp), 2022

Patched and missed patch code clone detection tool.

» https://github.com/KadjelRamkisoen/PaReco

Related paper: PaReco - Patched Clones and Missed Patches among the Divergent Variants of a Software Family by Poedjadevie Kadjel Ramkisoen, John Businge, Brent van Bradel, Alexandre Decan, Serge Demeyer, Coen De Roover, Foutse Khomh in The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) 2022

Small-Mince by Mehrdad Abdi (University of Antwerp), 2022

A tool to slice tests in Pharo Smalltalk.

» https://github.com/mabdi/small-mince

Related paper: Test Transplantation through Dynamic Test Slicing by Mehrdad Abdi and Serge Demeyer in International Working Conference on Source Code Analysis and Manipulation (SCAM 2022), Nier Track

A dataset of missed Opportunity/Effort Duplication patches in divergent variants by Poedjadevie Kadjel Ramkisoen and John Businge (University of Antwerp), 2022

A dataset of 364 (source->target) variant pairs with 8,323 patches resulting in a curated dataset containing 1,116 cases of effort duplication and 1,008 cases of missed opportunities.

» https://figshare.com/s/4952174de6c21fa9874f

AmPyfier Dataset by Ebert Schoofs (University of Antwerp), 2022

Dataset for test amplification for Python.

» https://github.com/SchoofsEbert/AmPyfier_evaluation

Related paper: AmPyfier: Test amplification in Python by Ebert Schoofs, Mehrdad Abdi and Serge Demeyer in Journal of Software: Evolution and Process

Fork variant pair dataset by John Businge (University of Antwerp), 2022

A dataset of 11K variant pairs from three ecosystems of Android, .NET, and JavaScript.

» https://github.com/johnxu21/emse2021

Related paper: Reuse and Maintenance Practices among Divergent Forks in Three Software Ecosystems by John Businge, Moses Openja, Sarah Nadi, Thorsten Berger in Empirical Software Engineering

Test amplification in Pharo by Mehrdad Abdi (University of Antwerp), 2022

A dataset of real projects in Pharo and their amplified tests.

» https://github.com/mabdi/SmallAmp-evaluations

Related paper: Small-Amp: Test amplification in a dynamically typed language by Mehrdad Abdi, Henrique Rocha, Serge Demeyer and Alexandre Bergel in Empirical Software Engineering

Test transplantation feasibility dataset by Igor Schittekat (University of Antwerp), 2022

Dataset for test transplantation feasibility in Python.

» https://figshare.com/s/304e0f741c3879b6e068

Related paper: Can We Increase the Test-coverage in Libraries using Dependent Projects’ Test-suites? by Igor Schittekat and Mehrdad Abdi and Serge Demeyer in International Conference on Evaluation and Assessment in Software Engineering 2022, NIER track

Survey dataset for variant fork motivations by variant developers by John Businge, Ahmed Zerouali, Alexandre Decan, Tom Mens, Serge Demeyer (University of Antwerp), 2021

A dataset resulting from an online survey, conducted with 105 maintainers involved in different active variant forks hosted on Github.

» https://doi.org/10.5281/zenodo.5855808

Related paper: Variant Forks - Motivations and Impediments by John Businge, Ahmed Zerouali, Alexandre Decan, Tom Mens, Serge Demeyer, Coen De Roover in 29th International Conference on Software Analysis, Evolution and Reengineering (SANER 2022)

BoDeGha by Mehdi Golzadeh (University of Mons), 2020

BoDeGha is an automated tool to identify bots in GitHub repositories by analysing pull request and issue comments.

» https://github.com/mehdigolzadeh/BoDeGha

BoDeGiC by Mehdi Golzadeh (University of Mons), 2020

BoDeGiC is an automated tool to identify bots in GitHub repositories by analysing git commit messages.

» https://github.com/mehdigolzadeh/BoDeGiC

GAP by Alexandre Decan (University of Mons), 2020

GAP is a command-line tool for forecasting future commit activity of contributors involved in software projects distributed through git.

» https://github.com/AlexandreDecan/gap

Related paper: GAP: Forecasting Commit Activity in git Projects by Alexandre Decan, Eleni Constantinou, Tom Mens, Henrique Rocha in Journal of Systems and Software

MUTAMA by Camilo Velázquez-Rodríguez (Vrije Universiteit Brussel), 2020

An open source tool recommending MVNRepository tags for a given Java library.

» https://github.com/cvelazquezr/MUTAMA

RESICO by Camilo Velázquez-Rodríguez (Vrije Universiteit Brussel), 2020

An open source tool for resolving the simple names of API types in incomplete code snippets (e.g., from Stack Overflow) to their fully-qualified name.

» https://github.com/cvelazquezr/RESICO

SCARE by Ruben Opdebeeck (Vrije Universiteit Brussel), 2020

Tool to discover and collect a dataset of Ansible roles from Ansible Galaxy and extract structural changes between versions of a role.

» https://github.com/ROpdebee/SCARE

Ansible Galaxy roles, versions, and metadata. by Ruben Opdebeeck, Ahmed Zerouali, Camilo Velázquez-Rodríguez and Coen De Roover (Vrije Universiteit Brussel), 2020

A large dataset of 24,620 Ansible roles, repositories, metadata and structural changes between role releases.

» https://doi.org/10.5281/zenodo.4039514

Human/Bot Ground-Truth Dataset by M. Golzadeh, A. Decan, D. Legay and T. Mens (University of Mons), 2020

A ground-truth dataset of 5,000 commenters active in GitHub issues and pull requests, manually labeled (with high interrater agreement) as either "bot" or "human".

» http://doi.org/10.5281/zenodo.4000388

ConPan by Ahmed Zerouali and Valerio Cosentino (University of Mons, Bitergia), 2019

ConPan inspects Docker containers, extracts their installed packages and analyzes their technical lag, vulnerabilities and other type of bugs.

» https://github.com/neglectos/ConPan

Related paper: ConPan: A Tool to Analyze Packages in Software Containers by Ahmed Zerouali, Valerio Cosentino, Jesus M. Gonzalez-Barahona, Gregorio Robles, Tom Mens in International Conference on Mining Software Repositories (MSR 2019)

SmallAmp by Mehrdad Abdi (University of Antwerp), 2019

SmallAmp is a test amplification tool in Pharo Smalltalk; it creates new test methods based on manually written ones to increase mutation coverage.

» https://github.com/mabdi/small-amp

Related paper: Test amplification in the pharo smalltalk ecosystem by Mehrdad Abdi, Henrique Rocha, Serge Demeyer in In Proceedings IWST 2019 (International Workshop on Smalltalk Technologies). Cologne, 2019

portion by Alexandre Decan (University of Mons), 2018

portion is a Python library providing data structures and operations for intervals, supporting intervals of any comparable objects and interval sets out of the box.

» https://pypi.org/project/portion/
» https://github.com/AlexandreDecan/portion (documentation)

Sismic by Alexandre Decan and Tom Mens (University of Mons), 2018

Sismic is a Python library providing a tool suite to define, simulate, execute and test statecharts, supporting test-driven development, behaviour-driven development, design by contract, and property statecharts to monitor violations of behavioural properties during statechart execution.

» https://pypi.org/project/sismic/
» https://sismic.readthedocs.io/ (documentation)
» https://github.com/AlexandreDecan/sismic (source)

Related paper: Sismic - A Python library for statechart execution and testing by Alexandre Decan, Tom Mens in SoftwareX

SQLInspect by Csaba Nagy, Anthony Cleve (University of Namur), 2018

A static SQL analyzer with plug-in support for Eclipse to inspect database usage in Java applications. It statically extracts SQL queries embedded in Java and performs various analyses on them.

» https://bitbucket.org/csnagy/sqlinspect

Related paper: SQLInspect: a static analyzer to inspect database usage in Java applications by Csaba Nagy, Anthony Cleve in 40th International Conference on Software Engineering (ICSE 2018)