SZZ误标变更对移动APP即时缺陷预测性能和解释的影响

doi:10.13328/j.cnki.jos.007297

微信服务号

微信订阅号

2025年5月1日 21:56 星期四

首页 > 过刊浏览>年第卷第期 >1-32. DOI:10.13328/j.cnki.jos.007297

PDF HTML阅读 XML下载导出引用引用提醒

SZZ误标变更对移动APP即时缺陷预测性能和解释的影响
DOI:
                        10.13328/j.cnki.jos.007297
                    
CSTR:
                        
                    
作者:
                        李志强李志强
陕西师范大学 计算机科学学院, 陕西 西安 710119
在期刊界中查找
在百度中查找
在本站中查找
马睿马睿
陕西师范大学 计算机科学学院, 陕西 西安 710119
在期刊界中查找
在百度中查找
在本站中查找
张洪宇张洪宇
重庆大学 大数据与软件学院, 重庆 401331
在期刊界中查找
在百度中查找
在本站中查找
荆晓远荆晓远
武汉大学 计算机学院, 湖北 武汉 430072;广东石油化工学院 计算机学院, 广东 茂名 525011
在期刊界中查找
在百度中查找
在本站中查找
任杰任杰
陕西师范大学 计算机科学学院, 陕西 西安 710119
在期刊界中查找
在百度中查找
在本站中查找
刘金会刘金会
西北工业大学 网络空间安全学院, 陕西 西安 710072
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP311
基金项目:国家自然科学基金(61902228, 62176069, U23A20302); 陕西省自然科学基础研究计划(2024JC-YBMS-497); 陕西省重点研发计划 (2023-YBGY-265)

Impact of Mislabeled Changes by SZZ on Performance and Interpretation of Just-in-time Defect Prediction for Mobile APP

Author:

LI Zhi-Qiang
LI Zhi-Qiang
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
在期刊界中查找
在百度中查找
在本站中查找
MA Rui
MA Rui
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
在期刊界中查找
在百度中查找
在本站中查找
ZHANG Hong-Yu
ZHANG Hong-Yu
School of Big Data and Software Engineering, Chongqing University, Chongqing 401331, China
在期刊界中查找
在百度中查找
在本站中查找
JING Xiao-Yuan
JING Xiao-Yuan
School of Computer Science, Wuhan University, Wuhan 430072, China;School of Computer, Guangdong University of Petrochemical Technology, Maoming 525011, China
在期刊界中查找
在百度中查找
在本站中查找
REN Jie
REN Jie
School of Computer Science, Shaanxi Normal University, Xi’an 710119, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Jin-Hui
LIU Jin-Hui
School of Cybersecurity, Northwestern Polytechnical University, Xi’an 710072, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

近年来, SZZ作为一种识别引入缺陷的变更算法, 被广泛应用于即时软件缺陷预测技术中. 先前的研究表明, SZZ算法在对数据进行标注时会存在误标问题, 这将影响数据集的质量, 进而影响预测模型的性能. 因此, 研究人员对SZZ算法进行了改进, 并提出多个SZZ变体. 然而, 目前尚未有文献研究数据标注质量对移动APP即时缺陷预测性能和解释的影响. 为探究SZZ错误标注的变更对移动APP即时软件缺陷预测模型的影响, 对4种SZZ算法进行广泛而深入的实证研究. 首先, 选取GitHub库中17个大型移动APP项目, 借助PyDriller工具抽取软件度量元. 其次, 采用B-SZZ (原始SZZ版本)、AG-SZZ、MA-SZZ和RA-SZZ这4种算法标注数据. 然后, 根据时间序列划分数据, 利用随机森林、朴素贝叶斯和逻辑回归分类器分别建立即时缺陷预测模型. 最后, 使用AUC、MCC、G-mean传统指标和F-measure@20%、IFA工作量感知指标评估模型性能, 并使用SKESD和SHAP算法对结果进行统计显著性检验与可解释性分析. 通过对比4种SZZ算法的标注性能, 研究发现: (1) 数据的标注质量符合SZZ变体之间的递进关系; (2) B-SZZ、AG-SZZ 和MA-SZZ错误标注的变更会造成AUC、MCC 得分不同程度的下降, 但不会造成G-mean得分下降; (3) B-SZZ会造成F-measure@20%得分下降, 而在代码审查时, B-SZZ、AG-SZZ 和MA-SZZ不会导致审查工作量的增加; (4)在模型解释方面, 不同SZZ算法会影响预测过程中贡献程度排名前3的度量元, 并且la度量元对预测结果有重要影响.

关键词:即时软件缺陷预测;移动APP;SZZ算法;挖掘软件存储库;可解释性;工作量感知;实证软件工程

Abstract:

In recent years, as an algorithm for identifying bug-introducing changes, SZZ has been widely employed in just-in-time software defect prediction. Previous studies show that the SZZ algorithm may mislabel data during data annotation, which could influence the dataset quality and consequently the performance of the defect prediction model. Therefore, researchers have made improvements to the SZZ algorithm and proposed multiple variants of SZZ. However, there is no empirical study to explore the effect of data annotation quality by SZZ on the performance and interpretability of just-in-time defect prediction for mobile APP. To investigate the influence of mislabeled changes by SZZ on just-in-time defect prediction for mobile APP, this study conducts an extensive and in-depth empirical comparison of four SZZ algorithms. Firstly, 17 large-scale mobile APP projects are selected from the GitHub repository, and software metrics are extracted by adopting the PyDriller tool. Then, B-SZZ (original SZZ), AG-SZZ, MA-SZZ, and RA-SZZ are employed for data annotation. Then, the just-in-time defect prediction models are built with random forest, naive Bayes, and logistic regression classifiers based on the time-series data partitioning. Finally, the performance of the models is evaluated by traditional measures of AUC, MCC, and G-mean, and effort-aware measures of F-measure@20% and IFA, and a statistical significance test and interpretability analysis are conducted on the results by employing SKESD and SHAP respectively. By comparing the annotation performance of the four SZZ algorithms, the results are as follows. (1) The data annotation quality conforms to the progressive relationship among SZZ variants. (2) The mislabeled changes by B-SZZ, AG-SZZ, and MA-SZZ can cause performance reduction of AUC and MCC of different levels, but cannot lead to performance reduction of G-mean. (3) B-SZZ is likely to cause a performance reduction of F-measure@20%, while B-SZZ, AG-SZZ, and MA-SZZ are unlikely to increase effort during code inspection. (4) In terms of model interpretation, different SZZ algorithms will influence the three metrics with the largest contribution during the prediction, and the la metric has a significant influence on the prediction results.

Key words:just-in-time software defect prediction;mobile APP;SZZ method;mining software repository;interpretability;effort aware;empirical software engineering

引用本文

李志强,马睿,张洪宇,荆晓远,任杰,刘金会. SZZ误标变更对移动APP即时缺陷预测性能和解释的影响.软件学报,,():1-32

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-09-28
最后修改日期:2023-12-26
录用日期:
在线发布日期: 2025-02-19
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码