Software Bug Location Method Combining Information Retrieval and Deep Model Features

doi:10.13328/j.cnki.jos.007111

微信服务号

微信订阅号

2025-5-15- 14

Home > Archive>Volume 35, Issue 7, 2024 >3245-3264. DOI:10.13328/j.cnki.jos.007111

PDF HTML XML Export Cite reminder

Software Bug Location Method Combining Information Retrieval and Deep Model Features
DOI:
                        10.13328/j.cnki.jos.007111
                    
Author:
                        SHEN Zong-WenSHEN Zong-Wen
National Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
NIU Fei-FeiNIU Fei-Fei
National Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI Chuan-YiLI Chuan-Yi
National Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
CHEN XiangCHEN Xiang
School of Information Science and Technology, Nantong University, Nantong 226019, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI QiLI Qi
National Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
GE Ji-DongGE Ji-Dong
National Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LUO BinLUO Bin
National Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Automated bug localization methods can accelerate the process of programmers locating complex software system defects using bug reports. Early researchers treated bug localization as a retrieval task, constructing defect features by analyzing bug reports and related code, and applying information retrieval techniques for bug localization. With the development of deep learning, bug localization methods utilizing deep model features have also achieved certain effectiveness. Nevertheless, existing deep learning-based bug localization research methods suffer from experimental search space mismatching real-world scenarios due to the high time and resource costs of deep model training. These research methods do not consider all the files in the project as the search space during testing; they only search for code related to marked defects, such as the DNNLOC method, DreamLoc method, and DeepLocator method. This approach is inconsistent with the actual search scenario for programmers to localize real bug. In order to simulate the real-world scenario of bug localization, this study proposes the TosLoc method, which combines information retrieval and deep model features for bug localization. Firstly, information retrieval is employed to retrieve all source codes of real projects to ensure comprehensive utilization of existing features. Subsequently, deep models are utilized to extract semantics from source codes and bug reports. The TosLoc method achieves rapid localization of all code in a single project through two-stage retrieval. Experimental results conducted on four popular Java projects demonstrate that the proposed TosLoc method outperforms existing benchmark methods in terms of retrieval speed and accuracy. Compared to the best method called DreamLoc, the TosLoc method achieves an average MRR improvement of 2.5% and an average MAP improvement of 6.0% while only requiring 35% of the retrieval time of the DreamLoc method.

Key words:bug location;bug report;informational retrieval;deep learning;search space

Get Citation

申宗汶,牛菲菲,李传艺,陈翔,李奇,葛季栋,骆斌.融合信息检索和深度模型特征的软件缺陷定位方法.软件学报,2024,35(7):3245-3264

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 11,2023
Revised:October 30,2023
Adopted:
Online: January 05,2024
Published: July 06,2024

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History