Adversarial Sample Generation Method Based on Chinese Features

doi:10.13328/j.cnki.jos.006744

微信服务号

微信订阅号

2025-4-24- 15

Home > Archive>Volume 34, Issue 11, 2023 >5143-5161. DOI:10.13328/j.cnki.jos.006744

PDF HTML XML Export Cite reminder

Adversarial Sample Generation Method Based on Chinese Features
DOI:
                        10.13328/j.cnki.jos.006744
                    
Author:
                        LI Xiang-GeLI Xiang-Ge
School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing 100876, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LUO HongLUO Hong
School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing 100876, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
SUN YanSUN Yan
School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing 100876, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP18
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Deep neural networks are vulnerable to attacks from adversarial samples. For instance, in a text classification task, the model can be fooled by modifying a few characters, words, or punctuation marks in the original text to change the classification result. Currently, studies of Chinese adversarial samples are limited in the field of natural language processing (NLP), and they fail to give due consideration to the language features of Chinese. This study proposes CWordCheater, a character-level and word-level high-quality method to generate adversarial samples covering the aspects of pronunciation, glyphs, and punctuation marks by approaching from the Chinese sentiment classification scenarios and taking into account the pictographic, alphabetic, and other language features of Chinese. The ConvAE network is adopted to embed Chinese visual vectors for the replacement modes of visually similar characters and further obtain the candidate pool of such characters for replacement. Moreover, a semantic constraint method based on universal sentence encoder (USE) distance is proposed to avoid the semantic offset in the adversarial sample. Finally, the study proposes a set of multi-dimensional evaluation methods to evaluate the quality of adversarial samples from the two aspects of attack effect and attack cost. Experiment results show that CWordAttacker can reduce the classification accuracy by at least 27.9% on multiple classification models and multiple datasets and has a lower perturbation cost based on vision and semantics.

Key words:Chinese sentiment classification;adversarial sample;Chinese feature

Get Citation

李相葛,罗红,孙岩.基于汉语特征的中文对抗样本生成方法.软件学报,2023,34(11):5143-5161

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:January 08,2022
Revised:April 13,2022
Adopted:
Online: June 16,2023
Published: November 06,2023

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History