基于大语言模型的模糊测试研究综述

doi:10.13328/j.cnki.jos.007323

微信服务号

微信订阅号

2025年6月15日 16:43 星期日

首页 > 过刊浏览>2025年第36卷第6期 >2404-2431. DOI:10.13328/j.cnki.jos.007323

PDF HTML阅读 XML下载导出引用引用提醒

基于大语言模型的模糊测试研究综述
DOI:
                        10.13328/j.cnki.jos.007323
                    
CSTR:
                        32375.14.jos.007323
                    
作者:
                        李岩李岩
中国科学技术大学 软件学院, 安徽 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找
杨文章杨文章
中国科学技术大学 计算机科学与技术学院, 安徽 合肥 230027
在期刊界中查找
在百度中查找
在本站中查找
张翼张翼
中国科学技术大学 软件学院, 安徽 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找
薛吟兴薛吟兴
中国科学技术大学 计算机科学与技术学院, 安徽 合肥 230027;中国科学技术大学 苏州高等研究院, 江苏 苏州 215123
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:薛吟兴,E-mail:yxxue@ustc.edu.cn
中图分类号:TP311
基金项目:国家自然科学基金(61972373)

Survey on Fuzzing Based on Large Language Model

Author:

LI Yan
LI Yan
School of Software Engineering, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找
YANG Wen-Zhang
YANG Wen-Zhang
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
在期刊界中查找
在百度中查找
在本站中查找
ZHANG Yi
ZHANG Yi
School of Software Engineering, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找
XUE Yin-Xing
XUE Yin-Xing
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China;Suzhou Institute for Advanced Study, University of Science and Technology of China, Suzhou 215123, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

模糊测试是一种自动化的软件测试方法, 通过向目标软件系统输入大量自动生成的测试数据, 以发现系统潜在的安全漏洞、软件缺陷或异常行为. 然而, 传统模糊测试技术受限于自动化程度低、测试效率低、代码覆盖率低等因素, 无法应对现代的大型软件系统. 近年来, 大语言模型的迅猛发展不仅为自然语言处理领域带来重大突破, 也为模糊测试领域带来了新的自动化方案. 因此, 为了更好地提升模糊测试技术的效果, 现有的工作提出了多种结合大语言模型的模糊测试方法, 涵盖了测试输入生成、缺陷检测、后模糊处理等模块. 但是现有工作缺乏对基于大语言模型的模糊测试技术的系统性调研和梳理讨论, 为了填补上述综述方面的空白, 对现有的基于大语言模型的模糊测试技术的研究发展现状进行全面的分析和总结. 主要内容包括: (1)概述模糊测试的整体流程和模糊测试研究中常用的大语言模型相关技术; (2)讨论大模型时代之前的基于深度学习的模糊测试方法的局限性; (3)分析大语言模型在模糊测试方法中不同环节的应用方式; (4)探讨大语言模型技术在模糊测试中的主要挑战和今后可能的发展方向.

关键词:大语言模型;模糊测试;测试输入生成;缺陷检测;后模糊处理

Abstract:

Fuzzing, as an automated software testing method, aims to detect potential security vulnerabilities, software defects, or abnormal behaviors by inputting a large quantity of automatically generated test data into the target software system. However, traditional fuzzing techniques are restricted by such factors as low automation level, low testing efficiency, and low code coverage, being unable to handle modern large-scale software systems. In recent years, the rapid development of large language models has not only brought significant breakthroughs to the field of natural language processing but also introduced new automation solutions to the field of fuzzing. Therefore, to better enhance the effectiveness of fuzzing technology, existing works have proposed various fuzzing methods combined with large language models, covering modules like test input generation, defect detection, and post-fuzzing. Nevertheless, the existing works lack systematic investigation and discussion on fuzzing techniques based on large language models. To fill the above-mentioned gaps in the review, this study comprehensively analyzes and summarizes the current research and development status of fuzzing techniques based on large language models. The main contents include (1) summarizing the overall process of fuzzing and the relevant technologies related to large language models commonly used in fuzzing research; (2) discussing the limitations of deep learning based fuzzing methods before the era of large language model (LLM); (3) analyzing the application methods of large language models in different stages of fuzzing; (4) exploring the main challenges and possible future development directions of large language model technology in fuzzing.

Key words:large language model (LLM);fuzzing;test input generation;defect detection;post-fuzzing

引用本文

李岩,杨文章,张翼,薛吟兴.基于大语言模型的模糊测试研究综述.软件学报,2025,36(6):2404-2431

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-07-17
最后修改日期:2024-10-14
录用日期:
在线发布日期: 2024-12-10
出版日期:

微信服务号

微信订阅号

引用本文

相关视频

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

相关视频

分享

微信扫一扫：分享

文章指标

历史

文章二维码