Feature Representation Method for Heterogeneous Defect Prediction Based on Variational Autoencoders

doi:10.13328/j.cnki.jos.006257

微信服务号

微信订阅号

2025-4-24- 9

Home > Archive>Volume 32, Issue 7, 2021 >2204-2218. DOI:10.13328/j.cnki.jos.006257

PDF HTML XML Export Cite reminder

Feature Representation Method for Heterogeneous Defect Prediction Based on Variational Autoencoders
DOI:
                        10.13328/j.cnki.jos.006257
                    
Author:
                        JIA Xiu-YiJIA Xiu-Yi
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG Wen-ZhouZHANG Wen-Zhou
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI Wei-WeiLI Wei-Wei
College of Aerospace Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
HUANG Zhi-QiuHUANG Zhi-Qiu
College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:National Natural Science Foundation of China (61906090, U20B2064, 61773208); Natural Science Foundation of Jiangsu Province, China (BK20191287, BK20170809); Fundamental Research Funds for the Central Universities (30920021131); China Postdoctoral Science Foundation (2018M632304)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Cross-project defect prediction technology can use the existing labeled defect data to predict new unlabeled data, but it needs to have the same metric features for two projects, which is difficult to be applied in actual development. Heterogeneous defect prediction can perform prediction without requiring the source and target project to have the same set of metrics and thus has attracted great interest. Existing heterogeneous defect prediction models use naive or traditional machine learning methods to learn feature representations between source and target projects, and perform prediction based on it. The feature representation learned by previous studies is weak, causing poor performance in predicting defect-prone instances. In view of the powerful feature extraction and representation capabilities of deep neural networks, this study proposes a feature representation method for heterogeneous defect prediction based on variational autoencoders. By combining the variational autoencoder and maximum mean discrepancy, this method can effectively learn the common feature representation of the source and target projects. Then, an effective defect prediction model can be trained based on it. The validity of the proposed method is verified by comparing it with traditional cross-project defect prediction methods and heterogeneous defect prediction methods on various datasets.

Key words:heterogeneous defect prediction;variational autoencoders;feature representation

Get Citation

贾修一,张文舟,李伟湋,黄志球.基于变分自编码器的异构缺陷预测特征表示方法.软件学报,2021,32(7):2204-2218

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 13,2020
Revised:October 26,2020
Adopted:
Online: January 22,2021
Published: July 06,2021

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History