面向中文文本倾向性分类的对抗样本生成方法

doi:10.13328/j.cnki.jos.005765

微信服务号

微信订阅号

2025年5月1日 9:18 星期四

首页 > 过刊浏览>2019年第30卷第8期 >2415-2427. DOI:10.13328/j.cnki.jos.005765

PDF HTML阅读 XML下载导出引用引用提醒

面向中文文本倾向性分类的对抗样本生成方法
DOI:
                        10.13328/j.cnki.jos.005765
                    
CSTR:
                        
                    
作者:
                        王文琦王文琦
空天信息安全与可信计算教育部重点实验室(武汉大学), 湖北 武汉 430072;武汉大学 国家网络安全学院, 湖北 武汉 430072
在期刊界中查找
在百度中查找
在本站中查找
汪润汪润
空天信息安全与可信计算教育部重点实验室(武汉大学), 湖北 武汉 430072;武汉大学 国家网络安全学院, 湖北 武汉 430072
在期刊界中查找
在百度中查找
在本站中查找
王丽娜王丽娜
空天信息安全与可信计算教育部重点实验室(武汉大学), 湖北 武汉 430072;武汉大学 国家网络安全学院, 湖北 武汉 430072
在期刊界中查找
在百度中查找
在本站中查找
唐奔宵唐奔宵
空天信息安全与可信计算教育部重点实验室(武汉大学), 湖北 武汉 430072;武汉大学 国家网络安全学院, 湖北 武汉 430072
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:王文琦(1992-),男,湖北襄阳人,博士生,主要研究领域为人工智能安全,自然语言处理;王丽娜(1964-),女,博士,教授,博士生导师,主要研究领域为系统安全,信息隐藏;汪润(1991-),男,博士,主要研究领域为移动设备隐私保护,机器学习;唐奔宵(1991-),男,博士,CCF学生会员,主要研究领域为Android隐私保护,机器学习.
通讯作者:王丽娜,E-mail:lnawang@163.com
中图分类号:TP309
基金项目:国家自然科学基金（61876134）；国家重点研发计划（2016YFB0801100）；中央高校基本科研业务费专项资金（2042018kf1028）

Adversarial Examples Generation Approach for Tendency Classification on Chinese Texts

Author:

WANG Wen-Qi
WANG Wen-Qi
Key Laboratory of Aerospace Information Security and Trusted Computing(Wuhan University), Ministry of Education, Wuhan 430072, China;School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Run
WANG Run
Key Laboratory of Aerospace Information Security and Trusted Computing(Wuhan University), Ministry of Education, Wuhan 430072, China;School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Li-Na
WANG Li-Na
Key Laboratory of Aerospace Information Security and Trusted Computing(Wuhan University), Ministry of Education, Wuhan 430072, China;School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
在期刊界中查找
在百度中查找
在本站中查找
Tang Ben-Xiao
Tang Ben-Xiao
Key Laboratory of Aerospace Information Security and Trusted Computing(Wuhan University), Ministry of Education, Wuhan 430072, China;School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

National Natural Science Foundation of China (61876134); National Key Research and Development Program of China (2016YFB0801100); Fundamental Research Funds for the Central Universities (2042018kf1028)

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

研究表明，在深度神经网络（DNN）的输入中添加小的扰动信息，能够使得DNN出现误判，这种攻击被称为对抗样本攻击.而对抗样本攻击也存在于基于DNN的中文文本的情感倾向性检测中，因此提出了一种面向中文文本的对抗样本生成方法WordHanding.该方法设计了新的词语重要性计算算法，并用同音词替换以生成对抗样本，用于在黑盒情况下实施对抗样本攻击.采用真实的数据集（京东购物评论和携程酒店评论），在长短记忆网络（LSTM）和卷积神经网络（CNN）这两种DNN模型上验证该方法的有效性.实验结果表明，生成的对抗样本能够很好地误导中文文本的倾向性检测系统.

关键词:中文文本;对抗样本;深度学习模型;评分函数;黑盒

Abstract:

Studies have shown that the adversarial example attack is that small perturbations are added on the input to make deep neural network (DNN) misbehave. Meanwhile, these attacks also exist in Chinese text sentiment orientation classification based on DNN and a method "WordHandling" is proposed to generate this kind of adversarial examples. This method designs a new algorithm aiming at calculating important words. Then the words are replaced with homonym to generate adversarial examples, which are used to conduct an adversarial example attack in black-box scenario. This study also verifies the effectiveness of the proposed method with real data set, i.e. Jingdong shopping and Ctrip hotel review, on long short-term memory network (LSTM) and convolutional neural network (CNN). The experimental results show that the adversarial examples in this study can mislead Chinese text orientation detection system well.

Key words:Chinese text;adversarial examples;deep learning models;score function;black box

引用本文

王文琦,汪润,王丽娜,唐奔宵.面向中文文本倾向性分类的对抗样本生成方法.软件学报,2019,30(8):2415-2427

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2018-05-31
最后修改日期:2018-09-21
录用日期:
在线发布日期: 2019-04-03
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码