Robustness Verification Method for Artificial Intelligence Systems Based on Source Code Processing

doi:10.13328/j.cnki.jos.006879

微信服务号

微信订阅号

2025-5-11- 10

Home > Archive>Volume 34, Issue 9, 2023 >4018-4036. DOI:10.13328/j.cnki.jos.006879

PDF HTML XML Export Cite reminder

Robustness Verification Method for Artificial Intelligence Systems Based on Source Code Processing
DOI:
                        10.13328/j.cnki.jos.006879
                    
Author:
                        YANG Yan-JingYANG Yan-Jing
Software Institute, Nanjing University, Nanjing 210093, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
MAO Run-FengMAO Run-Feng
Software Institute, Nanjing University, Nanjing 210093, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
TAN RuiTAN Rui
Software Institute, Nanjing University, Nanjing 210093, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
SHEN Hai-FengSHEN Hai-Feng
Discipline of Information Technology, Peter Faber Business School, Australian Catholic University, Sydney NSW 2060, Australia
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
RONG Guo-PingRONG Guo-Ping
Software Institute, Nanjing University, Nanjing 210093, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The development of artificial intelligence (AI) technology provides strong support for AI systems based on source code processing. Compared with natural language processing, source code is special in semantic space. Machine learning tasks related to source code processing usually employ abstract syntax trees, data dependency graphs, and control flow graphs to obtain the structured information of codes and extract features. Existing studies can obtain excellent results in experimental scenarios through in-depth analysis of source code structures and flexible application of classifiers. However, for real application scenarios where the source code structures are more complex, most of the AI systems related to source code processing have poor performance and are difficult to implement in the industry, which triggers practitioners to consider the robustness of AI systems. As AI-based systems are generally data-driven black box systems, it is difficult to directly measure the robustness of these software systems. With the emerging adversarial attack techniques, some scholars in natural language processing have designed adversarial attacks for different tasks to verify the robustness of models and conducted large-scale empirical studies. To solve the instability of AI systems based on source code processing in complex code scenarios, this study proposes robustness verification by Metropolis-Hastings attack method (RVMHM). Firstly, the code preprocessing tool based on abstract syntax trees is adopted to extract the variable pool of the model, and then the MHM source code attack algorithm is employed to replace the prediction effect of the variable perturbation model. The robustness of AI systems is measured by observing the changes in the robustness verification index before and after the attack by interfering with the data and model interaction process. With vulnerability prediction as a typical binary classification scenario of source code processing, this study verifies the robustness of 12 groups of AI vulnerability prediction models on three datasets of open source projects to illustrate the RVMHM effectiveness for robustness verification of source code processing based on AI systems.

Key words:code structure analysis;code adversarial attack;AI system quality evaluation

Get Citation

杨焱景,毛润丰,谭睿,沈海峰,荣国平.源码处理场景下人工智能系统鲁棒性验证方法.软件学报,2023,34(9):4018-4036

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 05,2022
Revised:October 13,2022
Adopted:
Online: January 13,2023
Published: September 06,2023

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History