Bug Triaging Approach Based on Recurrent Neural Networks
Author:
Affiliation:

Fund Project:

National Key Research and Development Program of China (2016YFB1000802); National Natural Science Foundation of China (61702252, 61672274)

  • Article
  • | |
  • Metrics
  • |
  • Reference [27]
  • |
  • Related
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    With the increasing size of open source software projects, assigning suitable developers for bug reports (i.e., bug triaging) is becoming more and more difficult. Moreover, the efficiency of bug repairing will likely be reduced if the bugs are assigned to inappropriate developers. Therefore, it is necessary to provide an automatic bug triaging technique for the project managers to better assign bug reports. Existing work for this task mainly focuses on analyzing the text and metadata in bug reports to characterize the relationships between developers and bug reports, while the active level of developers is largely ignored. A shortcoming of these methods is that they may lead to poor performance when developers with different active levels have similar characteristics. This paper proposes a learning model named DeepTriage based on the recurrent neural networks. On the one hand, the ordered natural language text in bug reports is mapped into high-level features by a bidirectional RNN. On the other hand, developer's active level is extracted and transformed into high-level features through a single directional RNN. Then, the features of text and developer's active level are combined and learned from bug reports with known fixers. Experimental results on four different open-source data sets (e.g., Eclipse) show that DeepTriage has significantly improved the accuracy of bug triaging compared with existing work.

    Reference
    [1] Xia X, Lo D, Ding Y, Al-Kofahi JM, Nguyen TN, Wang XY. Improving automated bug triaging with specialized topic model. IEEE Trans. on Software Engineering, 2017,43(3):272-297.
    [2] Čubranić D, Murphy GC. Automatic bug triage using text categorization. In:Proc. of the 16th Int'l Conf. on Software Engineering & Knowledge Engineering. DBLP, 2004. 92-97.
    [3] Anvik J, Hiew L, Murphy GC. Who should fix this bug? In:Proc. of the Int'l Conf. on Software Engineering. DBLP, 2006. 361-370.
    [4] Yang G, Zhang T, Lee B. Towards semi-automatic bug triage and severity prediction based on topic model and multi-feature of bug reports. In:Proc. of the Computer Software and Applications Conf. IEEE, 2014. 97-106.
    [5] Somasundaram K, Murphy GC. Automatic categorization of bug reports using latent Dirichlet allocation. In:Proc. of the India Software Engineering Conf. ACM Press, 2012. 125-130.
    [6] Johnson R, Zhang T. Supervised and semi-supervised text categorization using LSTM for region embeddings. In:Proc. of the 33rd Int'l Conf. on Machine Learning. 2016. 526-534.
    [7] Eclipse bugzilla. 2017. https://bugs.eclipse.org/bugs/
    [8] Netbeans bugzilla. 2017. https://netbeans.org/bugzilla/
    [9] Mozilla bugzilla. 2017. https://bugzilla.mozilla.org/
    [10] Redhat bugzilla. 2017. https://partner-bugzilla.redhat.com/
    [11] Tamrawi A, Nguyen TT, Al-Kofahi JM, Nguyen TN. Fuzzy set and cache-based approach for bug triaging. In:Proc. of the ACM Sigsoft Symp. on the Foundations of Software Engineering. DBLP, 2011. 365-375.
    [12] Jeong G, Kim S, Zimmermann T. Improving bug triage with bug tossing graphs. In:Proc. of the Joint Meeting of the European Software Engineering Conf. and the ACM Sigsoft Symp. on the Foundations of Software Engineering. ACM Press, 2009. 111-120.
    [13] Bhattacharya P, Neamtiu I. Fine-Grained incremental learning and multi-feature tossing graphs to improve bug triaging. In:Proc. of the IEEE Int'l Conf. on Software Maintenance. IEEE, 2010. 1-10.
    [14] Naguib H, Narayan N, Brügge B, Helal D. Bug report assignee recommendation using activity profiles. In:Proc. of the Mining Software Repositories. IEEE, 2013. 22-30.
    [15] Huzefa K, Malcom G, Denys P, Maen H. Assigning change requests to software developers. Journal of Software Maintenance & Evolution Research & Practice, 2012,24(1):3-33.
    [16] Linares-Vasquez M, Hossen K, Dang H, Kagdi H, Gethers M, Poshyvanyk D. Triaging incoming change requests:Bug or commit history, or code authorship? In:Proc. of the IEEE Int'l Conf. on Software Maintenance. IEEE, 2013. 451-460.
    [17] Shokripour R, Anvik J, Kasirun ZM, Zamani S. Why so complicated? Simple term filtering and weighting for location-based bug report assignment recommendation. In:Proc. of the Mining Software Repositories. IEEE, 2013. 2-11.
    [18] Graves A. Long short-term memory. In:Proc. of the Supervised Sequence Labelling with Recurrent Neural Networks. Berlin, Heidelberg:Springer-Verlag, 2012. 1735-1780.
    [19] Mikolov T, Karafiát M, Burget L, Cernocký J, Khudanpur S. Recurrent neural network based language model. In:Proc. of the Conf. of the Int'l Speech Communication Association. DBLP, 2010. 1045-1048.
    [20] Dai AM, Le QV. Semi-Supervised Sequence Learning. 2015. 3079-3087.
    [21] Johnson R, Zhang T. Effective use of word order for text categorization with convolutional neural networks. Eprint Arxiv, ArXiv:1412.1058, 2014.
    [22] Allamanis M, Tarlow D, Gordon A, Wei Y. Bimodal modelling of source code and natural language. In:Proc. of the Int'l Conf. on Machine Learning. JMLR.org, 2015. 2123-2132.
    [23] Kingma DP, Ba J. Adam:A method for stochastic optimization. In:Proc. of the 3rd Int'l Conf. for Learning Representations. 2015.
    [24] Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR. Improving neural networks by preventing co-adaptation of feature detectors. Computer Science, 2012,3(4):212-223.
    [25] Chang CC, Lin CJ. LIBSVM:A library for support vector machines. ACM Trans. on Intelligent Systems & Technology, 2011,2(3):1-27.
    [26] Asuncion A, Welling M, Smyth P, Teh YW. On smoothing and inference for topic models. In:Proc. of the Conf. on Uncertainty in Artificial Intelligence. AUAI Press, 2009. 27-34.
    [27] Griffiths TL, Steyvers M. Finding scientific topics. Proc. of the National Academy of Sciences of the United States of America, 2004,101(Suppl.1):5228-5235.
    Related
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

席圣渠,姚远,徐锋,吕建.基于循环神经网络的缺陷报告分派方法.软件学报,2018,29(8):2322-2335

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 18,2017
  • Revised:September 28,2017
  • Online: March 13,2018
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063