Automatic Generation of Large-Granularity Pull Request Description

doi:10.13328/j.cnki.jos.006239

微信服务号

微信订阅号

2025-6-5- 5

Home > Archive>Volume 32, Issue 6, 2021 >1597-1611. DOI:10.13328/j.cnki.jos.006239

PDF HTML XML Export Cite reminder

Automatic Generation of Large-Granularity Pull Request Description
DOI:
                        10.13328/j.cnki.jos.006239
                    
Author:
                        KUANG LiKUANG Li
School of Computer Science and Engineering, Central South University, Changsha 410083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
SHI Ru-YiSHI Ru-Yi
School of Computer Science and Engineering, Central South University, Changsha 410083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHAO Lei-HaoZHAO Lei-Hao
School of Computer Science and Engineering, Central South University, Changsha 410083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG HuanZHANG Huan
School of Computer Science and Engineering, Central South University, Changsha 410083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
GAO Hong-HaoGAO Hong-Hao
School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:National Key R & D Program of China (2018YFB1003800); National Natural Science Foundation of China (61772560)

Article

Figures

Metrics

Reference

Related [20]

Cited by

Materials

Comments

Abstract:

In GitHub platform, many project contributors often ignore the descriptions of pull requests (PRs) when submitting PRs, making their PRs easily neglected or rejected by reviewers. Therefore, it is necessary to generate PR descriptions automatically to help increase PR pass rate. The performances of existing PR description generation methods are usually affected by PR granularity, so it is difficult to generate descriptions for large-granularity PRs effectively. For such reasons, this work focuses on generating descriptions for large-granularity PRs. The text information is first preprocessed in PR and word-sentence heterogeneous graphs are constructed where the words are used as secondary nodes, so as to establish the connections between PR sentences. Subsequently, feature extraction is performed on the heterogeneous graphs, and then the features are input into graph neural network for further graph representation learning, from which the sentence nodes can learn more abundant content information through message delivery between nodes. Finally, the sentences with key information are selected to form a PR description. In addition, the supervised learning method cannot be used for training due to the lack of manually labeled tags in the dataset, therefore, reinforcement learning is used to guide the generation of PR descriptions. The goal of model training is minimizing the negative expectation of rewards, which does not require the ground truth and directly improves the performance of the results. The experiments are conducted on real dataset and the experimental results show that the proposed method is superior to existing methods in F1 and readability.

Key words:Pull Request description;heterogeneous graph neural network;reinforcement learning;unstructured document;summarization generation

Get Citation

邝砾,施如意,赵雷浩,张欢,高洪皓.大粒度Pull Request描述自动生成.软件学报,2021,32(6):1597-1611

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:August 09,2020
Revised:October 26,2020
Adopted:
Online: February 07,2021
Published: June 06,2021

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History