李政亮,陈翔,蒋智威,顾庆.基于信息检索的软件缺陷定位方法综述.软件学报,2021,32(2):0 |
基于信息检索的软件缺陷定位方法综述 |
Survey of Information Retrieval-based Software Bug Localization |
投稿时间:2020-01-16 修订日期:2020-04-05 |
DOI:10.13328/j.cnki.jos.006130 |
中文关键词: 软件维护 软件缺陷定位 信息检索 缺陷报告 程序模块 |
英文关键词:software maintenance software bug localization information retrieval bug report program module |
基金项目:国家自然科学基金(61972192,61202006,61906085,41972111);第二次青藏高原综合科学考察研究(2019QZKK0204);南京大学计算机软件新技术国家重点实验室开放课题(KFKT2019B14,KFKT2018B17) |
|
摘要点击次数: 830 |
全文下载次数: 794 |
中文摘要: |
基于信息检索的软件缺陷定位方法是当前软件缺陷定位领域中的一个研究热点.该方法主要分析缺陷报告文本和程序模块代码,通过计算缺陷报告和程序模块间的相似度,选取与缺陷报告相似度最高的若干程序模块,将其推荐给开发人员.本文对近些年国内外研究人员在该综述主题上取得的成果进行了系统的梳理和总结.首先,给出研究框架并阐述影响方法性能的三个重要因素:数据源、检索模型和场景应用;其次,依次对这三个影响因素的已有研究成果进行总结;然后,总结基于信息检索的软件缺陷定位研究中常用的性能评测指标和评测数据集;最后总结全文并对未来值得关注的研究方向进行展望. |
英文摘要: |
Information retrieval-based software bug localization is an active research topic in the domain of software fault localization. The phases of bug localization include analyzing the contents of the bug reports and program modules, calculating the similarity between the bug reports and program modules and recommending the most similar program modules to developers when given a bug report. This paper offers a systematic survey of existing research achievements of the domestic and foreign researchers in recent years. First, a research framework is proposed and three key factors (i.e. data sources, retrieval model, and application scenario) which influence the performance of bug localization are identified. Next, existing research achievements in these three key factors are discussed in sequence. Then, the performance evaluation metrics and evaluation datasets commonly used in information retrieval-based bug localization are summarized. Finally conclusions of this paper are made and a perspective of the future work in this research area is discussed. |
HTML 下载PDF全文 查看/发表评论 下载PDF阅读器 |