基于数据变异的神经网络测试用例选择方法

doi:10.13328/j.cnki.jos.007005

微信服务号

微信订阅号

2025年6月16日 13:41 星期一

首页 > 过刊浏览>2024年第35卷第11期 >4973-4992. DOI:10.13328/j.cnki.jos.007005

PDF HTML阅读 XML下载导出引用引用提醒

基于数据变异的神经网络测试用例选择方法
DOI:
                        10.13328/j.cnki.jos.007005
                    
CSTR:
                        32375.14.jos.007005
                    
作者:
                        曹雪洁曹雪洁
天津大学 智能与计算学部, 天津 300350
在期刊界中查找
在百度中查找
在本站中查找
陈俊洁陈俊洁
天津大学 智能与计算学部, 天津 300350
在期刊界中查找
在百度中查找
在本站中查找
闫明闫明
天津大学 智能与计算学部, 天津 300350
在期刊界中查找
在百度中查找
在本站中查找
尤翰墨尤翰墨
天津大学 智能与计算学部, 天津 300350
在期刊界中查找
在百度中查找
在本站中查找
吴卓吴卓
天津大学 新媒体与传播学院, 天津 300350
在期刊界中查找
在百度中查找
在本站中查找
王赞王赞
天津大学 智能与计算学部, 天津 300350;天津大学 新媒体与传播学院, 天津 300350
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:曹雪洁(2001－), 女, 硕士, 主要研究领域为深度学习测试. ;陈俊洁(1992－), 男, 博士, 副教授, 博士生导师, CCF高级会员, 主要研究领域为软件分析与测试. ;闫明(1996－), 男, 博士生, CCF学生会员, 主要研究领域为深度学习系统测试, 芯片设计程序测试. ;尤翰墨(1997－), 男, 博士生, CCF学生会员, 主要研究领域为软件测试, 深度学习系统测试. ;吴卓(1996－), 女, 博士生, CCF学生会员, 主要研究领域为软件测试. ;王赞(1979－), 男, 博士, CCF专业会员, 主要研究领域为软件测试, 深度学习.
通讯作者:陈俊洁, E-mail: junjiechen@tju.edu.cn
中图分类号:TP311
基金项目:国家自然科学基金(62002256)

Test Case Selection for Neural Network via Data Mutation

Author:

CAO Xue-Jie
CAO Xue-Jie
College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
在期刊界中查找
在百度中查找
在本站中查找
CHEN Jun-Jie
CHEN Jun-Jie
College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
在期刊界中查找
在百度中查找
在本站中查找
YAN Ming
YAN Ming
College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
在期刊界中查找
在百度中查找
在本站中查找
YOU Han-Mo
YOU Han-Mo
College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
在期刊界中查找
在百度中查找
在本站中查找
WU Zhuo
WU Zhuo
School of New Media and Communication, Tianjin University, Tianjin 300350, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Zan
WANG Zan
College of Intelligence and Computing, Tianjin University, Tianjin 300350, China;School of New Media and Communication, Tianjin University, Tianjin 300350, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

深度神经网络目前已被广泛应用于自动驾驶、医疗诊断、语音识别、人脸识别等安全攸关领域, 因此深度神经网络测试对于保证其质量非常关键. 然而, 为判断DNN模型预测是否正确而对测试用例进行标注的成本很高. 因此, 筛选出能够揭示DNN模型错误行为的测试用例并优先对其进行标注, 能够尽快修复模型缺陷, 从而提升DNN测试的效率、保证DNN模型质量. 提出一种基于数据变异的测试用例选择方法DMS. 该方法设计并实现数据变异算子生成变异模型, 以模拟模型缺陷并捕获测试用例揭错时的动态模式, 从而评估测试用例的揭错能力. 在25个深度学习测试集和模型的组合上进行实验, 结果表明, 无论是筛选出的样本中揭错用例的比例还是揭错方向的多样性, DMS都要显著优于现有的测试用例选择方法. 具体来说, 以原始测试集作为候选集时, 在选择10%的测试用例时, DMS能够筛选出候选集中53.85%–99.22%的揭错用例, 在选择5%的测试用例时, DMS筛选出的测试用例已经几乎能覆盖所有的揭错方向. 相较于8种对比方法, DMS平均多找出12.38%–71.81%的揭错用例, 证明了DMS在测试用例选择任务中的显著有效性.

关键词:深度学习;软件测试;测试用例选择;数据变异

Abstract:

Nowadays, deep neural network (DNN) is widely used in autonomous driving, medical diagnosis, speech recognition, face recognition, and other safety-critical fields. Therefore, DNN testing is critical to ensure the quality of DNN. However, labeling test cases to judge whether the DNN model predictions are correct is costly. Therefore, selecting test cases that reveal incorrect behavior of DNN models and labeling them earlier can help developers debug DNN models as soon as possible, thus improving the efficiency of DNN testing and ensuring the quality of DNN models. This study proposes a test case selection method based on data mutation, namely DMS. In this method, a data mutation operator is designed and implemented to generate a mutation model to simulate model defects and capture the dynamic pattern of test case bug-revealing, so as to evaluate the ability of test case bug-revealing. Experiments are conducted on the combination of 25 deep learning test sets and models. The results show that DMS is significantly better than the existing test case selection methods in terms of both the proportion of bug-revealing and the diversity of bug-revealing directions in the selected samples. Specifically, taking the original test set as the candidate set, DMS can filter out 53.85%–99.22% of all bug-revealing test cases when selecting 10% of the test cases. Moreover, when 5% of the test cases are selected, the selected cases by DMS can cover almost all bug-revealing directions. Compared with the eight comparison methods, DMS finds 12.38%–71.81% more bug-revealing cases on average, which proves the significant effectiveness of DMS in the task of test case selection.

Key words:deep learning;software testing;test case selection;data mutation

引用本文

曹雪洁,陈俊洁,闫明,尤翰墨,吴卓,王赞.基于数据变异的神经网络测试用例选择方法.软件学报,2024,35(11):4973-4992

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-11-28
最后修改日期:2023-04-06
录用日期:
在线发布日期: 2023-11-29
出版日期: 2024-11-06

微信服务号

微信订阅号

引用本文

相关视频

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

相关视频

分享

微信扫一扫：分享

文章指标

历史

文章二维码