Impact of Mislabeled Changes by SZZ on Performance and Interpretation of Just-in-time Defect Prediction for Mobile APP

doi:10.13328/j.cnki.jos.007297

微信服务号

微信订阅号

2025-5-2- 9

Home > Archive>Volume , Issue , >1-32. DOI:10.13328/j.cnki.jos.007297

PDF HTML XML Export Cite reminder

Impact of Mislabeled Changes by SZZ on Performance and Interpretation of Just-in-time Defect Prediction for Mobile APP
DOI:
                        10.13328/j.cnki.jos.007297
                    
Author:
                        LI Zhi-QiangLI Zhi-Qiang
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
MA RuiMA Rui
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG Hong-YuZHANG Hong-Yu
School of Big Data and Software Engineering, Chongqing University, Chongqing 401331, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JING Xiao-YuanJING Xiao-Yuan
School of Computer Science, Wuhan University, Wuhan 430072, China;School of Computer, Guangdong University of Petrochemical Technology, Maoming 525011, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
REN JieREN Jie
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LIU Jin-HuiLIU Jin-Hui
School of Cybersecurity, Northwestern Polytechnical University, Xi’an 710072, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP311
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In recent years, as an algorithm for identifying bug-introducing changes, SZZ has been widely employed in just-in-time software defect prediction. Previous studies show that the SZZ algorithm may mislabel data during data annotation, which could influence the dataset quality and consequently the performance of the defect prediction model. Therefore, researchers have made improvements to the SZZ algorithm and proposed multiple variants of SZZ. However, there is no empirical study to explore the effect of data annotation quality by SZZ on the performance and interpretability of just-in-time defect prediction for mobile APP. To investigate the influence of mislabeled changes by SZZ on just-in-time defect prediction for mobile APP, this study conducts an extensive and in-depth empirical comparison of four SZZ algorithms. Firstly, 17 large-scale mobile APP projects are selected from the GitHub repository, and software metrics are extracted by adopting the PyDriller tool. Then, B-SZZ (original SZZ), AG-SZZ, MA-SZZ, and RA-SZZ are employed for data annotation. Then, the just-in-time defect prediction models are built with random forest, naive Bayes, and logistic regression classifiers based on the time-series data partitioning. Finally, the performance of the models is evaluated by traditional measures of AUC, MCC, and G-mean, and effort-aware measures of F-measure@20% and IFA, and a statistical significance test and interpretability analysis are conducted on the results by employing SKESD and SHAP respectively. By comparing the annotation performance of the four SZZ algorithms, the results are as follows. (1) The data annotation quality conforms to the progressive relationship among SZZ variants. (2) The mislabeled changes by B-SZZ, AG-SZZ, and MA-SZZ can cause performance reduction of AUC and MCC of different levels, but cannot lead to performance reduction of G-mean. (3) B-SZZ is likely to cause a performance reduction of F-measure@20%, while B-SZZ, AG-SZZ, and MA-SZZ are unlikely to increase effort during code inspection. (4) In terms of model interpretation, different SZZ algorithms will influence the three metrics with the largest contribution during the prediction, and the la metric has a significant influence on the prediction results.

Key words:just-in-time software defect prediction;mobile APP;SZZ method;mining software repository;interpretability;effort aware;empirical software engineering

Get Citation

李志强,马睿,张洪宇,荆晓远,任杰,刘金会. SZZ误标变更对移动APP即时缺陷预测性能和解释的影响.软件学报,,():1-32

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 28,2023
Revised:December 26,2023
Adopted:
Online: February 19,2025
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History