Bug Triaging Approach Based on Recurrent Neural Networks

doi:10.13328/j.cnki.jos.005532

微信服务号

微信订阅号

2025-4-6- 8

Home > Archive>Volume 29, Issue 8, 2018 >2322-2335. DOI:10.13328/j.cnki.jos.005532

PDF HTML XML Export Cite reminder

Bug Triaging Approach Based on Recurrent Neural Networks
DOI:
                        10.13328/j.cnki.jos.005532
                    
Author:
                        XI Sheng-QuXI Sheng-Qu
Department of Computer Science and Technology, Nanjing University, Nanjing 210023, China;State Key Laboratory for Novel Software Technology(Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
YAO YuanYAO Yuan
Department of Computer Science and Technology, Nanjing University, Nanjing 210023, China;State Key Laboratory for Novel Software Technology(Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XU FengXU Feng
Department of Computer Science and Technology, Nanjing University, Nanjing 210023, China;State Key Laboratory for Novel Software Technology(Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LÜ JianLÜ Jian
Department of Computer Science and Technology, Nanjing University, Nanjing 210023, China;State Key Laboratory for Novel Software Technology(Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:National Key Research and Development Program of China (2016YFB1000802); National Natural Science Foundation of China (61702252, 61672274)

Article

Figures

Metrics

Reference [27]

Cited by

Materials

Comments

Abstract:

With the increasing size of open source software projects, assigning suitable developers for bug reports (i.e., bug triaging) is becoming more and more difficult. Moreover, the efficiency of bug repairing will likely be reduced if the bugs are assigned to inappropriate developers. Therefore, it is necessary to provide an automatic bug triaging technique for the project managers to better assign bug reports. Existing work for this task mainly focuses on analyzing the text and metadata in bug reports to characterize the relationships between developers and bug reports, while the active level of developers is largely ignored. A shortcoming of these methods is that they may lead to poor performance when developers with different active levels have similar characteristics. This paper proposes a learning model named DeepTriage based on the recurrent neural networks. On the one hand, the ordered natural language text in bug reports is mapped into high-level features by a bidirectional RNN. On the other hand, developer's active level is extracted and transformed into high-level features through a single directional RNN. Then, the features of text and developer's active level are combined and learned from bug reports with known fixers. Experimental results on four different open-source data sets (e.g., Eclipse) show that DeepTriage has significantly improved the accuracy of bug triaging compared with existing work.

Key words:bug triaging;recurrent neural network;deep learning

Reference

[1] Xia X, Lo D, Ding Y, Al-Kofahi JM, Nguyen TN, Wang XY. Improving automated bug triaging with specialized topic model. IEEE Trans. on Software Engineering, 2017,43(3):272-297.

[2] Čubranić D, Murphy GC. Automatic bug triage using text categorization. In:Proc. of the 16th Int'l Conf. on Software Engineering & Knowledge Engineering. DBLP, 2004. 92-97.

[3] Anvik J, Hiew L, Murphy GC. Who should fix this bug? In:Proc. of the Int'l Conf. on Software Engineering. DBLP, 2006. 361-370.

[4] Yang G, Zhang T, Lee B. Towards semi-automatic bug triage and severity prediction based on topic model and multi-feature of bug reports. In:Proc. of the Computer Software and Applications Conf. IEEE, 2014. 97-106.

[5] Somasundaram K, Murphy GC. Automatic categorization of bug reports using latent Dirichlet allocation. In:Proc. of the India Software Engineering Conf. ACM Press, 2012. 125-130.

[6] Johnson R, Zhang T. Supervised and semi-supervised text categorization using LSTM for region embeddings. In:Proc. of the 33rd Int'l Conf. on Machine Learning. 2016. 526-534.

[7] Eclipse bugzilla. 2017. https://bugs.eclipse.org/bugs/

[8] Netbeans bugzilla. 2017. https://netbeans.org/bugzilla/

[9] Mozilla bugzilla. 2017. https://bugzilla.mozilla.org/

[10] Redhat bugzilla. 2017. https://partner-bugzilla.redhat.com/

[11] Tamrawi A, Nguyen TT, Al-Kofahi JM, Nguyen TN. Fuzzy set and cache-based approach for bug triaging. In:Proc. of the ACM Sigsoft Symp. on the Foundations of Software Engineering. DBLP, 2011. 365-375.

[12] Jeong G, Kim S, Zimmermann T. Improving bug triage with bug tossing graphs. In:Proc. of the Joint Meeting of the European Software Engineering Conf. and the ACM Sigsoft Symp. on the Foundations of Software Engineering. ACM Press, 2009. 111-120.

[13] Bhattacharya P, Neamtiu I. Fine-Grained incremental learning and multi-feature tossing graphs to improve bug triaging. In:Proc. of the IEEE Int'l Conf. on Software Maintenance. IEEE, 2010. 1-10.

[14] Naguib H, Narayan N, Brügge B, Helal D. Bug report assignee recommendation using activity profiles. In:Proc. of the Mining Software Repositories. IEEE, 2013. 22-30.

[15] Huzefa K, Malcom G, Denys P, Maen H. Assigning change requests to software developers. Journal of Software Maintenance & Evolution Research & Practice, 2012,24(1):3-33.

[16] Linares-Vasquez M, Hossen K, Dang H, Kagdi H, Gethers M, Poshyvanyk D. Triaging incoming change requests:Bug or commit history, or code authorship? In:Proc. of the IEEE Int'l Conf. on Software Maintenance. IEEE, 2013. 451-460.

[17] Shokripour R, Anvik J, Kasirun ZM, Zamani S. Why so complicated? Simple term filtering and weighting for location-based bug report assignment recommendation. In:Proc. of the Mining Software Repositories. IEEE, 2013. 2-11.

[18] Graves A. Long short-term memory. In:Proc. of the Supervised Sequence Labelling with Recurrent Neural Networks. Berlin, Heidelberg:Springer-Verlag, 2012. 1735-1780.

[19] Mikolov T, Karafiát M, Burget L, Cernocký J, Khudanpur S. Recurrent neural network based language model. In:Proc. of the Conf. of the Int'l Speech Communication Association. DBLP, 2010. 1045-1048.

[20] Dai AM, Le QV. Semi-Supervised Sequence Learning. 2015. 3079-3087.

[21] Johnson R, Zhang T. Effective use of word order for text categorization with convolutional neural networks. Eprint Arxiv, ArXiv:1412.1058, 2014.

[22] Allamanis M, Tarlow D, Gordon A, Wei Y. Bimodal modelling of source code and natural language. In:Proc. of the Int'l Conf. on Machine Learning. JMLR.org, 2015. 2123-2132.

[23] Kingma DP, Ba J. Adam:A method for stochastic optimization. In:Proc. of the 3rd Int'l Conf. for Learning Representations. 2015.

[24] Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR. Improving neural networks by preventing co-adaptation of feature detectors. Computer Science, 2012,3(4):212-223.

[25] Chang CC, Lin CJ. LIBSVM:A library for support vector machines. ACM Trans. on Intelligent Systems & Technology, 2011,2(3):1-27.

[26] Asuncion A, Welling M, Smyth P, Teh YW. On smoothing and inference for topic models. In:Proc. of the Conf. on Uncertainty in Artificial Intelligence. AUAI Press, 2009. 27-34.

[27] Griffiths TL, Steyvers M. Finding scientific topics. Proc. of the National Academy of Sciences of the United States of America, 2004,101(Suppl.1):5228-5235.

Get Citation

席圣渠,姚远,徐锋,吕建.基于循环神经网络的缺陷报告分派方法.软件学报,2018,29(8):2322-2335

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:July 18,2017
Revised:September 28,2017
Adopted:
Online: March 13,2018
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History