Enhancement of Textual Adversarial Attack Ability Based on Sememe-level Sentence Dilution Algorithm

doi:10.13328/j.cnki.jos.006525

微信服务号

微信订阅号

2025-4-24- 14

Home > Archive>Volume 34, Issue 7, 2023 >3313-3328. DOI:10.13328/j.cnki.jos.006525

PDF HTML XML Export Cite reminder

Enhancement of Textual Adversarial Attack Ability Based on Sememe-level Sentence Dilution Algorithm
DOI:
                        10.13328/j.cnki.jos.006525
                    
Author:
                        YE Wen-TaoYE Wen-Tao
Software Engineering Institute, East China Normal University, Shanghai 200062, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG MinZHANG Min
Shanghai Key Laboratory of Trustworthy Computing, Shanghai 200062, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
CHEN Yi-XiangCHEN Yi-Xiang
MOE Engineering Research Center for Software/Hardware Co-design Technology and Application, Shanghai 200062, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP309
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

With machine learning widely applied to the natural language processing (NLP) domain in recent years, the security of NLP tasks receives growing natural concerns. Existing studies found that small modifications in examples might lead to wrong machine learning predictions, which was also called adversarial attack. The textual adversarial attack can effectively reveal the vulnerability of NLP models for improvement. Nevertheless, existing textual adversarial attack methods all focus on designing complex adversarial example generation strategies with a limited improvement of success rate, and the highly invasive modifications bring the decline of textual quality. Thus, a simple and effective method with high adversarial example quality is in demand. To solve this problem, the sememe-level sentence dilution algorithm (SSDA) and the dilution pool construction algorithm (DPCA) are proposed from a new perspective of improving the process of adversarial attack. SSDA is a new process that can be freely embedded into the classical adversarial attack workflow. SSDA first uses dilution pools constructed by DPCA to dilute the original examples, then generates adversarial examples through those diluted examples. It can not only improve the success rate of any adversarial attack methods without any limit of datasets or victim models but also obtain higher adversarial example quality compared with the original method. Through the experiments of different datasets, dilution pools, victim models, and textual adversarial attack methods, it is successfully verified the improvement of SSDA on the success rate and proved that dilution pools constructed by DPCA can further enhance the dilution ability of SSDA. The experiment results demonstrate that SSDA reveals more vulnerabilities of models than classical methods, and DPCA can help SSDA to improve success rate with higher adversarial example quality.

Key words:adversarial attack;machine learning;natural language processing (NLP);boundary value analysis;sememe

Get Citation

叶文滔,张敏,陈仪香.基于义原级语句稀释法的文本对抗攻击能力强化方法.软件学报,2023,34(7):3313-3328

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:June 22,2021
Revised:September 22,2021
Adopted:
Online: September 09,2022
Published: July 06,2023

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History