FSE 2024 CoLos
32nd ACM International Conference on the Foundations of Software Engineering (FSE 2024)

Powered by

20th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE 2024), July 16, 2024, Porto de Galinhas, Brazil

PROMISE 2024 – Proceedings

Contents - Abstracts - Authors

Twitter: https://twitter.com/esecfse

20th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE 2024)

Frontmatter

Title Page

Welcome from the Chairs

PROMISE 2024 Organization

Keynote

The Ever-Evolving Promises of Data in Software Ecosystems: Models, AI, and Analytics (Keynote)
Raula Gaikovina Kula

(Nara Institute of Science and Technology, Japan)

Publisher's Version

Papers

Graph Neural Network vs. Large Language Model: A Comparative Analysis for Bug Report Priority and Severity Prediction
Jagrit Acharya

and Gouri Ginde

(University of Calgary, Canada)

Publisher's Version

Smarter Project Selection for Software Engineering Research
Tapajit Dey

, Jonathan Loungani

, and James Ivers

(Carnegie Mellon University, USA)

Publisher's Version

Published Artifact

Artifacts Available

Sociotechnical Dynamics in Open Source Smart Contract Repositories: An Exploratory Data Analysis of Curated High Market Value Projects
Saori Costa

, Matheus Paixao

, Igor Steinmacher

, Pamella Soares

, Allysson Allex Araújo

, and Jerffeson Souza

(State University of Ceará, Brazil; Northern Arizona University, USA; Federal University of Cariri, Brazil)

Publisher's Version

Info

A Curated Solidity Smart Contracts Repository of Metrics and Vulnerability
Giacomo Ibba

, Sabrina Aufiero

, Rumyana Neykova

, Silvia Bartolucci

, Marco Ortu

, Roberto Tonelli

, and Giuseppe Destefanis

(University of Cagliari, Italy; University College London, United Kingdom; Brunel University, United Kingdom)

Publisher's Version

MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery
Jafar Akhoundali

, Sajad Rahim Nouri

, Kristian Rietveld

, and Olga Gadyatskaya

(Leiden University, Netherlands; Islamic Azad University of Ramsar, Iran)
Vulnerability datasets have become an important instrument in software security research, being used to develop automated, machine learning-based vulnerability detection and patching approaches. Yet, any limitations of these datasets may translate into inadequate performance of the developed solutions. For example, the limited size of a vulnerability dataset may restrict the applicability of deep learning techniques. In our work, we have designed and implemented a novel workflow with several heuristic methods to combine state-of-the-art methods related to CVE fix commits gathering. As a consequence of our improvements, we have been able to gather the largest programming language-independent real-world dataset of CVE vulnerabilities with the associated fix commits. Our dataset containing 26,617 unique CVEs coming from 6,945 unique GitHub projects is, to the best of our knowledge, by far the biggest CVE vulnerability dataset with fix commits available today. These CVEs are associated with 31,883 unique commits that fixed those vulnerabilities. Compared to prior work, our dataset brings about a 397% increase in CVEs, a 295% increase in covered open-source projects, and a 480% increase in commit fixes. Our larger dataset thus substantially improves over the current real-world vulnerability datasets and enables further progress in research on vulnerability detection and software security. We release to the community a 14GB PostgreSQL database that contains information on CVEs up to January 24, 2024, CWEs of each CVE, files and methods changed by each commit, and repository metadata. Additionally, patch files related to the fix commits are available as a separate package. Furthermore, we make our dataset collection tool also available to the community.

Publisher's Version

Published Artifact

Artifacts Available

Prioritising GitHub Priority Labels
James Caddy

and Christoph Treude

(University of Adelaide, Australia; Singapore Management University, Singapore)

Publisher's Version

Published Artifact

Info

Artifacts Available

Predicting Fairness of ML Software Configurations
Salvador Robles Herrera

, Verya Monjezi

, Vladik Kreinovich

, Ashutosh Trivedi

, and Saeid Tizpaz-Niari

(University of Texas at El Paso, USA; University of Colorado Boulder, USA)

Publisher's Version

proc time: 2.74