Automated Static Warning Identification via Path-based Semantic Representation

doi:10.13328/j.cnki.jos.006982

微信服务号

微信订阅号

2025-5-16- 6

Home > Archive>Volume 35, Issue 10, 2024 >4662-4680. DOI:10.13328/j.cnki.jos.006982

PDF HTML XML Export Cite reminder

Automated Static Warning Identification via Path-based Semantic Representation
DOI:
                        10.13328/j.cnki.jos.006982
                    
Author:
                        ZHANG Yu-WeiZHANG Yu-Wei
School of Computer Science, Peking University, Beijing 100871, China;Key Lab of High Confidence Software Technologies (Peking University), Ministry of Education, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XING YingXING Ying
School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI GeLI Ge
School of Computer Science, Peking University, Beijing 100871, China;Key Lab of High Confidence Software Technologies (Peking University), Ministry of Education, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JIN ZhiJIN Zhi
School of Computer Science, Peking University, Beijing 100871, China;Key Lab of High Confidence Software Technologies (Peking University), Ministry of Education, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP311
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Static analysis tools often suffer from high false positive rates of reported alarms, despite their ability to aid developers in detecting potential defects early in the software development life cycle. To improve the availability of these tools, many automated warning identification techniques have been proposed to assist developers in classifying false positive alarms. However, existing approaches mainly focus on using hand-engineered features or statement-level abstract syntax tree token sequences to represent the defective code, failing to capture semantics from the reported alarms. To overcome the limitations of traditional approaches, this study employs deep neural networks with powerful feature extraction and representation abilities to generate code semantics from control flow graph paths for warning identification. The control flow graph abstractly represents the execution process of a given program. Thus, the generated path sequences of the control flow graph can guide the deep neural networks to learn semantic information about the potential defect more accurately. In this study, the pre-trained language model is fine-tuned to encode the path sequences and capture the semantic representations for model building. Finally, the study conducts extensive experiments on eight open-source projects to verify the effectiveness of the proposed approach by comparing it with the state-of-the-art baselines.

Key words:automated warning identification;path analysis;deep learning;pre-trained language model

Get Citation

张俞炜,邢颖,李戈,金芝.基于路径语义表示的静态警报自动确认方法.软件学报,2024,35(10):4662-4680

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:October 17,2022
Revised:January 04,2023
Adopted:
Online: October 11,2023
Published: October 06,2024

You are the first2044844Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History