Data-free Model Stealing Attack Method Based on Visual Feature Decoupling

doi:10.13328/j.cnki.jos.007310

微信服务号

微信订阅号

2025-4-5- 13

Home > Archive>Volume , Issue , >1-15. DOI:10.13328/j.cnki.jos.007310

PDF HTML XML Export Cite reminder

Data-free Model Stealing Attack Method Based on Visual Feature Decoupling
DOI:
                        10.13328/j.cnki.jos.007310
                    
Author:
                        ZHANG Jin-HongZHANG Jin-Hong
School of Information Science and Engineering, Yunnan University, Kunming 650500, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LIU Ren-YangLIU Ren-Yang
School of Information Science and Engineering, Yunnan University, Kunming 650500, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
WEI Ting-ChuWEI Ting-Chu
Institute of International Rivers and ECO-Security, Yunnan University, Kunming 650500, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
DONG Yun-YunDONG Yun-Yun
School of Software, Yunnan University, Kunming 650500, China;Engineering Research Center of Cyberspace, Yunnan University, Kunming 650500, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHOU WeiZHOU Wei
School of Software, Yunnan University, Kunming 650500, China;Engineering Research Center of Cyberspace, Yunnan University, Kunming 650500, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP309
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

With the continuous deepening of research on the security and privacy of deep learning models, researchers find that model stealing attacks pose a tremendous threat to neural networks. A typical data-dependent model stealing attack can use a certain percentage of real data to query the target model and train an alternative model locally to steal the target model. Since 2020, a novel data-free model stealing attack method has been proposed, which can steal and attack deep neural networks simply by using fake query examples generated by generative models. Since it does not rely on real data, the data-free model stealing attack can cause more serious damage. However, the diversity and effectiveness of the query examples constructed by the current data-free model stealing attack methods are insufficient, and there are problems of a large number of queries and a relatively low success rate of the attack during the model stealing process. Therefore, this study proposes a vision feature decoupling-based model stealing attack (VFDA), which decouples and generates the visual features of the query examples generated during the data-free model stealing process by using a multi-decoder structure, thus improving the diversity of query examples and the effectiveness of model stealing. Specifically, VFDA uses three decoders to respectively generate the texture information, region encoding, and smoothing information of query examples to complete the decoupling of visual features of query examples. Secondly, to make the generated query examples more consistent with the visual features of real examples, the sparsity of the texture information is limited and the generated smoothing information is filtered. VFDA exploits the property that the representational tendency of neural networks depends on the image texture features, and can generate query examples with inter-class diversity, thus effectively improving the similarity of model stealing and the success rate of the attack. In addition, VFDA adds intra-class diversity loss to the smoothed information of query samples generated through decoupling to make the query samples more consistent with real sample distribution. By comparing with multiple model stealing attack methods, the VFDA method proposed in this study has better performance in the similarity of model stealing and the success rate of the attack. In particular, on the GTSRB and Tiny-ImageNet datasets with high resolution, the attack success rate is respectively improved by 3.86% and 4.15% on average compared with the currently better EBFA method.

Key words:model stealing;adversarial examples;transfer attack;generative model;model privacy

Get Citation

张锦弘,刘仁阳,韦廷楚,董云云,周维.基于视觉特征解耦的无数据依赖模型窃取攻击方法.软件学报,,():1-15

Copy

Article Metrics

Abstract:18
PDF: 37
HTML: 0
Cited by: 0

History

Received:June 27,2023
Revised:January 11,2024
Adopted:
Online: March 12,2025
Published:

You are the first2033172Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History