Workshop SDD 2023 – Author Index |
Contents -
Abstracts -
Authors
|
Alamir, Salwa |
SDD '23: "Code Revert Prediction with ..."
Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase
Yulong Pei, Salwa Alamir, Rares Dolga, and Sameena Shah (JPMorgan AI Research, UK) Code revert prediction, a specialized form of software defect detection, aims to forecast or predict the likelihood of code changes being reverted or rolled back in software development. This task is very important in practice because by identifying code changes that are more prone to being reverted, developers and project managers can proactively take measures to prevent issues, improve code quality, and optimize development processes. However, compared to code defect detection, code revert prediction has been rarely studied in previous research. Additionally, many previous methods for code defect detection relied on independent features but ignored relationships between code scripts. Moreover, new challenges are introduced due to constraints in an industry setting such as company regulation, limited features and large-scale codebase. To overcome these limitations, this paper presents a systematic empirical study for code revert prediction that integrates the code import graph with code features. Different strategies to address anomalies and data imbalance have been implemented including graph neural networks with imbalance classification and anomaly detection. We conduct the experiments on real-world code commit data within J.P. Morgan Chase which is extremely imbalanced in order to make a comprehensive comparison of these different approaches for the code revert prediction problem. @InProceedings{SDD23p1, author = {Yulong Pei and Salwa Alamir and Rares Dolga and Sameena Shah}, title = {Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {1--5}, doi = {10.1145/3617572.3617879}, year = {2023}, } Publisher's Version SDD '23: "Log Summarisation for Defect ..." Log Summarisation for Defect Evolution Analysis Rares Dolga, Ran Zmigrod, Rui Silva, Salwa Alamir, and Sameena Shah (JPMorgan Chase, UK) Log analysis and monitoring are essential aspects in software maintenance and identifying defects. In particular, the temporal nature and vast size of log data leads to an interesting and important research question: How can logs be summarised and monitored over time? While this has been a fundamental topic of research in the software engineering community, work has typically focused on heuristic-, syntax-, or static-based methods. In this work, we suggest an online semantic-based clustering approach to error logs that dynamically updates the log clusters to enable monitoring code error life-cycles. We also introduce a novel metric to evaluate the performance of temporal log clusters. We test our system and evaluation metric with an industrial dataset and find that our solution outperforms similar systems. We hope that our work encourages further temporal exploration in defect datasets. @InProceedings{SDD23p11, author = {Rares Dolga and Ran Zmigrod and Rui Silva and Salwa Alamir and Sameena Shah}, title = {Log Summarisation for Defect Evolution Analysis}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {11--16}, doi = {10.1145/3617572.3617881}, year = {2023}, } Publisher's Version |
|
Blum, Florian |
SDD '23: "Towards Assessing the Real-World ..."
Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts
Michael Hettmer, Benedikt Severin, Florian Blum, and Volker Gruhn (University of Duisburg-Essen, Germany) Blockchain-based smart contracts enable a wide range of use cases by relying on trustless and guaranteed code execution. The immutability of blockchains means that defects in smart contracts cannot be fixed once the code has been deployed. Previous research has focused on identifying defects and developing tools to improve defect detection. However, several existing blockchain defect datasets rely solely on expert knowledge or static code analysis. They do not provide an objective assessment of the real-world impact of defects. We show how blockchain transactions and execution traces can support an objective evaluation of the impact of a smart contract defect. As a foundation for future research, we propose three novel blockchain-specific metrics. Using the defect dataset SmartBugs, we demonstrate the computation of the metrics and the use cases that can benefit from them. This example focuses on five distinct types of the commonly occurring reentrancy vulnerability defect as detected by Slither, Manticore and Osiris. @InProceedings{SDD23p6, author = {Michael Hettmer and Benedikt Severin and Florian Blum and Volker Gruhn}, title = {Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {6--10}, doi = {10.1145/3617572.3617880}, year = {2023}, } Publisher's Version Published Artifact Artifacts Available |
|
Dolga, Rares |
SDD '23: "Code Revert Prediction with ..."
Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase
Yulong Pei, Salwa Alamir, Rares Dolga, and Sameena Shah (JPMorgan AI Research, UK) Code revert prediction, a specialized form of software defect detection, aims to forecast or predict the likelihood of code changes being reverted or rolled back in software development. This task is very important in practice because by identifying code changes that are more prone to being reverted, developers and project managers can proactively take measures to prevent issues, improve code quality, and optimize development processes. However, compared to code defect detection, code revert prediction has been rarely studied in previous research. Additionally, many previous methods for code defect detection relied on independent features but ignored relationships between code scripts. Moreover, new challenges are introduced due to constraints in an industry setting such as company regulation, limited features and large-scale codebase. To overcome these limitations, this paper presents a systematic empirical study for code revert prediction that integrates the code import graph with code features. Different strategies to address anomalies and data imbalance have been implemented including graph neural networks with imbalance classification and anomaly detection. We conduct the experiments on real-world code commit data within J.P. Morgan Chase which is extremely imbalanced in order to make a comprehensive comparison of these different approaches for the code revert prediction problem. @InProceedings{SDD23p1, author = {Yulong Pei and Salwa Alamir and Rares Dolga and Sameena Shah}, title = {Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {1--5}, doi = {10.1145/3617572.3617879}, year = {2023}, } Publisher's Version SDD '23: "Log Summarisation for Defect ..." Log Summarisation for Defect Evolution Analysis Rares Dolga, Ran Zmigrod, Rui Silva, Salwa Alamir, and Sameena Shah (JPMorgan Chase, UK) Log analysis and monitoring are essential aspects in software maintenance and identifying defects. In particular, the temporal nature and vast size of log data leads to an interesting and important research question: How can logs be summarised and monitored over time? While this has been a fundamental topic of research in the software engineering community, work has typically focused on heuristic-, syntax-, or static-based methods. In this work, we suggest an online semantic-based clustering approach to error logs that dynamically updates the log clusters to enable monitoring code error life-cycles. We also introduce a novel metric to evaluate the performance of temporal log clusters. We test our system and evaluation metric with an industrial dataset and find that our solution outperforms similar systems. We hope that our work encourages further temporal exploration in defect datasets. @InProceedings{SDD23p11, author = {Rares Dolga and Ran Zmigrod and Rui Silva and Salwa Alamir and Sameena Shah}, title = {Log Summarisation for Defect Evolution Analysis}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {11--16}, doi = {10.1145/3617572.3617881}, year = {2023}, } Publisher's Version |
|
Gruhn, Volker |
SDD '23: "Towards Assessing the Real-World ..."
Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts
Michael Hettmer, Benedikt Severin, Florian Blum, and Volker Gruhn (University of Duisburg-Essen, Germany) Blockchain-based smart contracts enable a wide range of use cases by relying on trustless and guaranteed code execution. The immutability of blockchains means that defects in smart contracts cannot be fixed once the code has been deployed. Previous research has focused on identifying defects and developing tools to improve defect detection. However, several existing blockchain defect datasets rely solely on expert knowledge or static code analysis. They do not provide an objective assessment of the real-world impact of defects. We show how blockchain transactions and execution traces can support an objective evaluation of the impact of a smart contract defect. As a foundation for future research, we propose three novel blockchain-specific metrics. Using the defect dataset SmartBugs, we demonstrate the computation of the metrics and the use cases that can benefit from them. This example focuses on five distinct types of the commonly occurring reentrancy vulnerability defect as detected by Slither, Manticore and Osiris. @InProceedings{SDD23p6, author = {Michael Hettmer and Benedikt Severin and Florian Blum and Volker Gruhn}, title = {Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {6--10}, doi = {10.1145/3617572.3617880}, year = {2023}, } Publisher's Version Published Artifact Artifacts Available |
|
Hettmer, Michael |
SDD '23: "Towards Assessing the Real-World ..."
Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts
Michael Hettmer, Benedikt Severin, Florian Blum, and Volker Gruhn (University of Duisburg-Essen, Germany) Blockchain-based smart contracts enable a wide range of use cases by relying on trustless and guaranteed code execution. The immutability of blockchains means that defects in smart contracts cannot be fixed once the code has been deployed. Previous research has focused on identifying defects and developing tools to improve defect detection. However, several existing blockchain defect datasets rely solely on expert knowledge or static code analysis. They do not provide an objective assessment of the real-world impact of defects. We show how blockchain transactions and execution traces can support an objective evaluation of the impact of a smart contract defect. As a foundation for future research, we propose three novel blockchain-specific metrics. Using the defect dataset SmartBugs, we demonstrate the computation of the metrics and the use cases that can benefit from them. This example focuses on five distinct types of the commonly occurring reentrancy vulnerability defect as detected by Slither, Manticore and Osiris. @InProceedings{SDD23p6, author = {Michael Hettmer and Benedikt Severin and Florian Blum and Volker Gruhn}, title = {Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {6--10}, doi = {10.1145/3617572.3617880}, year = {2023}, } Publisher's Version Published Artifact Artifacts Available |
|
Pei, Yulong |
SDD '23: "Code Revert Prediction with ..."
Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase
Yulong Pei, Salwa Alamir, Rares Dolga, and Sameena Shah (JPMorgan AI Research, UK) Code revert prediction, a specialized form of software defect detection, aims to forecast or predict the likelihood of code changes being reverted or rolled back in software development. This task is very important in practice because by identifying code changes that are more prone to being reverted, developers and project managers can proactively take measures to prevent issues, improve code quality, and optimize development processes. However, compared to code defect detection, code revert prediction has been rarely studied in previous research. Additionally, many previous methods for code defect detection relied on independent features but ignored relationships between code scripts. Moreover, new challenges are introduced due to constraints in an industry setting such as company regulation, limited features and large-scale codebase. To overcome these limitations, this paper presents a systematic empirical study for code revert prediction that integrates the code import graph with code features. Different strategies to address anomalies and data imbalance have been implemented including graph neural networks with imbalance classification and anomaly detection. We conduct the experiments on real-world code commit data within J.P. Morgan Chase which is extremely imbalanced in order to make a comprehensive comparison of these different approaches for the code revert prediction problem. @InProceedings{SDD23p1, author = {Yulong Pei and Salwa Alamir and Rares Dolga and Sameena Shah}, title = {Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {1--5}, doi = {10.1145/3617572.3617879}, year = {2023}, } Publisher's Version |
|
Severin, Benedikt |
SDD '23: "Towards Assessing the Real-World ..."
Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts
Michael Hettmer, Benedikt Severin, Florian Blum, and Volker Gruhn (University of Duisburg-Essen, Germany) Blockchain-based smart contracts enable a wide range of use cases by relying on trustless and guaranteed code execution. The immutability of blockchains means that defects in smart contracts cannot be fixed once the code has been deployed. Previous research has focused on identifying defects and developing tools to improve defect detection. However, several existing blockchain defect datasets rely solely on expert knowledge or static code analysis. They do not provide an objective assessment of the real-world impact of defects. We show how blockchain transactions and execution traces can support an objective evaluation of the impact of a smart contract defect. As a foundation for future research, we propose three novel blockchain-specific metrics. Using the defect dataset SmartBugs, we demonstrate the computation of the metrics and the use cases that can benefit from them. This example focuses on five distinct types of the commonly occurring reentrancy vulnerability defect as detected by Slither, Manticore and Osiris. @InProceedings{SDD23p6, author = {Michael Hettmer and Benedikt Severin and Florian Blum and Volker Gruhn}, title = {Towards Assessing the Real-World Impact of Defects in Blockchain-Based Smart Contracts}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {6--10}, doi = {10.1145/3617572.3617880}, year = {2023}, } Publisher's Version Published Artifact Artifacts Available |
|
Shah, Sameena |
SDD '23: "Code Revert Prediction with ..."
Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase
Yulong Pei, Salwa Alamir, Rares Dolga, and Sameena Shah (JPMorgan AI Research, UK) Code revert prediction, a specialized form of software defect detection, aims to forecast or predict the likelihood of code changes being reverted or rolled back in software development. This task is very important in practice because by identifying code changes that are more prone to being reverted, developers and project managers can proactively take measures to prevent issues, improve code quality, and optimize development processes. However, compared to code defect detection, code revert prediction has been rarely studied in previous research. Additionally, many previous methods for code defect detection relied on independent features but ignored relationships between code scripts. Moreover, new challenges are introduced due to constraints in an industry setting such as company regulation, limited features and large-scale codebase. To overcome these limitations, this paper presents a systematic empirical study for code revert prediction that integrates the code import graph with code features. Different strategies to address anomalies and data imbalance have been implemented including graph neural networks with imbalance classification and anomaly detection. We conduct the experiments on real-world code commit data within J.P. Morgan Chase which is extremely imbalanced in order to make a comprehensive comparison of these different approaches for the code revert prediction problem. @InProceedings{SDD23p1, author = {Yulong Pei and Salwa Alamir and Rares Dolga and Sameena Shah}, title = {Code Revert Prediction with Graph Neural Networks: A Case Study at J.P. Morgan Chase}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {1--5}, doi = {10.1145/3617572.3617879}, year = {2023}, } Publisher's Version SDD '23: "Log Summarisation for Defect ..." Log Summarisation for Defect Evolution Analysis Rares Dolga, Ran Zmigrod, Rui Silva, Salwa Alamir, and Sameena Shah (JPMorgan Chase, UK) Log analysis and monitoring are essential aspects in software maintenance and identifying defects. In particular, the temporal nature and vast size of log data leads to an interesting and important research question: How can logs be summarised and monitored over time? While this has been a fundamental topic of research in the software engineering community, work has typically focused on heuristic-, syntax-, or static-based methods. In this work, we suggest an online semantic-based clustering approach to error logs that dynamically updates the log clusters to enable monitoring code error life-cycles. We also introduce a novel metric to evaluate the performance of temporal log clusters. We test our system and evaluation metric with an industrial dataset and find that our solution outperforms similar systems. We hope that our work encourages further temporal exploration in defect datasets. @InProceedings{SDD23p11, author = {Rares Dolga and Ran Zmigrod and Rui Silva and Salwa Alamir and Sameena Shah}, title = {Log Summarisation for Defect Evolution Analysis}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {11--16}, doi = {10.1145/3617572.3617881}, year = {2023}, } Publisher's Version |
|
Silva, Rui |
SDD '23: "Log Summarisation for Defect ..."
Log Summarisation for Defect Evolution Analysis
Rares Dolga, Ran Zmigrod, Rui Silva, Salwa Alamir, and Sameena Shah (JPMorgan Chase, UK) Log analysis and monitoring are essential aspects in software maintenance and identifying defects. In particular, the temporal nature and vast size of log data leads to an interesting and important research question: How can logs be summarised and monitored over time? While this has been a fundamental topic of research in the software engineering community, work has typically focused on heuristic-, syntax-, or static-based methods. In this work, we suggest an online semantic-based clustering approach to error logs that dynamically updates the log clusters to enable monitoring code error life-cycles. We also introduce a novel metric to evaluate the performance of temporal log clusters. We test our system and evaluation metric with an industrial dataset and find that our solution outperforms similar systems. We hope that our work encourages further temporal exploration in defect datasets. @InProceedings{SDD23p11, author = {Rares Dolga and Ran Zmigrod and Rui Silva and Salwa Alamir and Sameena Shah}, title = {Log Summarisation for Defect Evolution Analysis}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {11--16}, doi = {10.1145/3617572.3617881}, year = {2023}, } Publisher's Version |
|
Zmigrod, Ran |
SDD '23: "Log Summarisation for Defect ..."
Log Summarisation for Defect Evolution Analysis
Rares Dolga, Ran Zmigrod, Rui Silva, Salwa Alamir, and Sameena Shah (JPMorgan Chase, UK) Log analysis and monitoring are essential aspects in software maintenance and identifying defects. In particular, the temporal nature and vast size of log data leads to an interesting and important research question: How can logs be summarised and monitored over time? While this has been a fundamental topic of research in the software engineering community, work has typically focused on heuristic-, syntax-, or static-based methods. In this work, we suggest an online semantic-based clustering approach to error logs that dynamically updates the log clusters to enable monitoring code error life-cycles. We also introduce a novel metric to evaluate the performance of temporal log clusters. We test our system and evaluation metric with an industrial dataset and find that our solution outperforms similar systems. We hope that our work encourages further temporal exploration in defect datasets. @InProceedings{SDD23p11, author = {Rares Dolga and Ran Zmigrod and Rui Silva and Salwa Alamir and Sameena Shah}, title = {Log Summarisation for Defect Evolution Analysis}, booktitle = {Proc.\ SDD}, publisher = {ACM}, pages = {11--16}, doi = {10.1145/3617572.3617881}, year = {2023}, } Publisher's Version |
10 authors
proc time: 2.99