Tools and Datasets
AmPyfier by Ebert Schoofs, Mehrdad Abdi (University of Antwerp), 2022
test amplification for Python.
» https://ansymore.uantwerpen.be/artefacts/ampyfier
Related paper: AmPyfier: Test amplification in Python by Ebert Schoofs, Mehrdad Abdi and Serge Demeyer in Journal of Software: Evolution and Process
Continuous Integration Antipattern Analyzer by Freek De Sagher (University of Antwerp), 2022
a command line tool to analyze CI workflows in git repositories written in Python 3.10.
» https://github.com/FreekDS/CIAN
HyDRa by Maxime Gobert (University of Namur), 2022
a framework for hybrid polystore modeling and manipulation
» https://github.com/gobertm/HyDRa
License compatibility checking for npm dependencies by Ilyas Said Makari, Ahmed Zerouali, Coen De Roover (Vrije Universiteit Brussel), 2022
Tool to check license compatibilities for npm dependencies
» https://doi.org/10.5281/zenodo.5913761
LiFUSO by Camilo Velázquez-Rodríguez (Vrije Universiteit Brussel), 2022
A tool for library feature unveiling based on Stack Overflow posts
» https://github.com/softwarelanguageslab/lifuso
PaReco by Poedjadevie Kadjel Ramkisoen and John Businge (University of Antwerp), 2022
Patched and missed patch code clone detection tool.
» https://github.com/KadjelRamkisoen/PaReco
Related paper: PaReco - Patched Clones and Missed Patches among the Divergent Variants of a Software Family by Poedjadevie Kadjel Ramkisoen, John Businge, Brent van Bradel, Alexandre Decan, Serge Demeyer, Coen De Roover, Foutse Khomh in The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) 2022
Small-Mince by Mehrdad Abdi (University of Antwerp), 2022
A tool to slice tests in Pharo Smalltalk.
» https://github.com/mabdi/small-mince
Related paper: Test Transplantation through Dynamic Test Slicing by Mehrdad Abdi and Serge Demeyer in International Working Conference on Source Code Analysis and Manipulation (SCAM 2022), Nier Track
A dataset of missed Opportunity/Effort Duplication patches in divergent variants by Poedjadevie Kadjel Ramkisoen and John Businge (University of Antwerp), 2022
A dataset of 364 (source->target) variant pairs with 8,323 patches resulting in a curated dataset containing 1,116 cases of effort duplication and 1,008 cases of missed opportunities.
» https://figshare.com/s/4952174de6c21fa9874f
Related paper: PaReco - Patched Clones and Missed Patches among the Divergent Variants of a Software Family by Poedjadevie Kadjel Ramkisoen, John Businge, Brent van Bradel, Alexandre Decan, Serge Demeyer, Coen De Roover, Foutse Khomh in The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) 2022
AmPyfier Dataset by Ebert Schoofs (University of Antwerp), 2022
Dataset for test amplification for Python.
» https://github.com/SchoofsEbert/AmPyfier_evaluation
Related paper: AmPyfier: Test amplification in Python by Ebert Schoofs, Mehrdad Abdi and Serge Demeyer in Journal of Software: Evolution and Process
Fork variant pair dataset by John Businge (University of Antwerp), 2022
A dataset of 11K variant pairs from three ecosystems of Android, .NET, and JavaScript.
» https://github.com/johnxu21/emse2021
Related paper: Reuse and Maintenance Practices among Divergent Forks in Three Software Ecosystems by John Businge, Moses Openja, Sarah Nadi, Thorsten Berger in Empirical Software Engineering
Test amplification in Pharo by Mehrdad Abdi (University of Antwerp), 2022
A dataset of real projects in Pharo and their amplified tests.
» https://github.com/mabdi/SmallAmp-evaluations
Related paper: Small-Amp: Test amplification in a dynamically typed language by Mehrdad Abdi, Henrique Rocha, Serge Demeyer and Alexandre Bergel in Empirical Software Engineering
Test transplantation feasibility dataset by Igor Schittekat (University of Antwerp), 2022
Dataset for test transplantation feasibility in Python.
» https://figshare.com/s/304e0f741c3879b6e068
Related paper: Can We Increase the Test-coverage in Libraries using Dependent Projects’ Test-suites? by Igor Schittekat and Mehrdad Abdi and Serge Demeyer in International Conference on Evaluation and Assessment in Software Engineering 2022, NIER track
Survey dataset for variant fork motivations by variant developers by John Businge, Ahmed Zerouali, Alexandre Decan, Tom Mens, Serge Demeyer (University of Antwerp), 2021
A dataset resulting from an online survey, conducted with 105 maintainers involved in different active variant forks hosted on Github.
» https://doi.org/10.5281/zenodo.5855808
Related paper: Variant Forks - Motivations and Impediments by John Businge, Ahmed Zerouali, Alexandre Decan, Tom Mens, Serge Demeyer, Coen De Roover in 29th International Conference on Software Analysis, Evolution and Reengineering (SANER 2022)
BoDeGha by Mehdi Golzadeh (University of Mons), 2020
BoDeGha is an automated tool to identify bots in GitHub repositories by analysing pull request and issue comments.
» https://github.com/mehdigolzadeh/BoDeGha
BoDeGiC by Mehdi Golzadeh (University of Mons), 2020
BoDeGiC is an automated tool to identify bots in GitHub repositories by analysing git commit messages.
» https://github.com/mehdigolzadeh/BoDeGiC
GAP by Alexandre Decan (University of Mons), 2020
GAP is a command-line tool for forecasting future commit activity of contributors involved in software projects distributed through git.
» https://github.com/AlexandreDecan/gap
Related paper: GAP: Forecasting Commit Activity in git Projects by Alexandre Decan, Eleni Constantinou, Tom Mens, Henrique Rocha in Journal of Systems and Software
MUTAMA by Camilo Velázquez-Rodríguez (Vrije Universiteit Brussel), 2020
An open source tool recommending MVNRepository tags for a given Java library.
» https://github.com/cvelazquezr/MUTAMA
RESICO by Camilo Velázquez-Rodríguez (Vrije Universiteit Brussel), 2020
An open source tool for resolving the simple names of API types in incomplete code snippets (e.g., from Stack Overflow) to their fully-qualified name.
» https://github.com/cvelazquezr/RESICO
SCARE by Ruben Opdebeeck (Vrije Universiteit Brussel), 2020
Tool to discover and collect a dataset of Ansible roles from Ansible Galaxy and extract structural changes between versions of a role.
» https://github.com/ROpdebee/SCARE
Ansible Galaxy roles, versions, and metadata. by Ruben Opdebeeck, Ahmed Zerouali, Camilo Velázquez-Rodríguez and Coen De Roover (Vrije Universiteit Brussel), 2020
A large dataset of 24,620 Ansible roles, repositories, metadata and structural changes between role releases.
» https://doi.org/10.5281/zenodo.4039514
Human/Bot Ground-Truth Dataset by M. Golzadeh, A. Decan, D. Legay and T. Mens (University of Mons), 2020
A ground-truth dataset of 5,000 commenters active in GitHub issues and pull requests, manually labeled (with high interrater agreement) as either "bot" or "human".
» http://doi.org/10.5281/zenodo.4000388
ConPan by Ahmed Zerouali and Valerio Cosentino (University of Mons, Bitergia), 2019
ConPan inspects Docker containers, extracts their installed packages and analyzes their technical lag, vulnerabilities and other type of bugs.
» https://github.com/neglectos/ConPan
Related paper: ConPan: A Tool to Analyze Packages in Software Containers by Ahmed Zerouali, Valerio Cosentino, Jesus M. Gonzalez-Barahona, Gregorio Robles, Tom Mens in International Conference on Mining Software Repositories (MSR 2019)
SmallAmp by Mehrdad Abdi (University of Antwerp), 2019
SmallAmp is a test amplification tool in Pharo Smalltalk; it creates new test methods based on manually written ones to increase mutation coverage.
» https://github.com/mabdi/small-amp
Related paper: Test amplification in the pharo smalltalk ecosystem by Mehrdad Abdi, Henrique Rocha, Serge Demeyer in In Proceedings IWST 2019 (International Workshop on Smalltalk Technologies). Cologne, 2019
portion by Alexandre Decan (University of Mons), 2018
portion is a Python library providing data structures and operations for intervals, supporting intervals of any comparable objects and interval sets out of the box.
» https://pypi.org/project/portion/
» https://github.com/AlexandreDecan/portion (documentation)
Sismic by Alexandre Decan and Tom Mens (University of Mons), 2018
Sismic is a Python library providing a tool suite to define, simulate, execute and test statecharts, supporting test-driven development, behaviour-driven development, design by contract, and property statecharts to monitor violations of behavioural properties during statechart execution.
» https://pypi.org/project/sismic/
» https://sismic.readthedocs.io/ (documentation)
» https://github.com/AlexandreDecan/sismic (source)
Related paper: Sismic - A Python library for statechart execution and testing by Alexandre Decan, Tom Mens in SoftwareX
SQLInspect by Csaba Nagy, Anthony Cleve (University of Namur), 2018
A static SQL analyzer with plug-in support for Eclipse to inspect database usage in Java applications. It statically extracts SQL queries embedded in Java and performs various analyses on them.
» https://bitbucket.org/csnagy/sqlinspect
Related paper: SQLInspect: a static analyzer to inspect database usage in Java applications by Csaba Nagy, Anthony Cleve in 40th International Conference on Software Engineering (ICSE 2018)