DiTing: Semi-supervised Adversarial Training Approach for Robust Out-of-distribution Detection

doi:10.13328/j.cnki.jos.006928

微信服务号

微信订阅号

Home > Archive>Volume 35, Issue 6, 2024 >2936-2950. DOI:10.13328/j.cnki.jos.006928

PDF HTML XML Export Cite reminder

DiTing: Semi-supervised Adversarial Training Approach for Robust Out-of-distribution Detection
DOI:
                        10.13328/j.cnki.jos.006928
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Detecting out-of-distribution (OOD) samples outside the training set distribution is crucial for deploying deep neural network (DNN) classifiers in the open environment. OOD sample detection is a binary classification problem, which is to classify the input samples into the in-distribution (ID) or OOD categories. Then, the detector itself can be re-bypassed by malicious adversarial attacks. These OOD samples with malicious perturbations are called adversarial OOD samples. Building robust OOD detectors to detect adversarial OOD samples is more challenging. Existing methods usually train DNN through adversarial OOD samples within the neighborhood of auxiliary clean OOD samples to learn separable and robust representations to malicious perturbations. However, due to the distributional differences between the auxiliary OOD training set and original ID training set, training adversarial OOD samples is not effective enough to ensure the robustness of ID boundary against adversarial perturbations. Adversarial ID samples generated from within the neighborhood of (clean) ID samples are closer to the ID boundary and are also effective in improving the adversarial robustness of the ID boundary. This study proposes a semi-supervised adversarial training approach, DiTing, to build robust OOD detectors to detect clean and adversarial OOD samples. This approach treats the adversarial ID samples as auxiliary “near OOD” samples and trains them jointly with other auxiliary clean and adversarial OOD samples to improve the robustness of OOD detection. Experiments show that DiTing has a significant advantage in detecting adversarial OOD samples generated by strong attacks while maintaining state-of-the-art performance in classifying clean ID samples and detecting clean OOD samples.

Reference

Cited by

Get Citation

周志阳,窦文生,李硕,亢良伊,王帅,刘杰,叶丹.谛听: 面向鲁棒分布外样本检测的半监督对抗训练方法.软件学报,2024,35(6):2936-2950

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:October 19,2022
Revised:January 15,2023
Adopted:
Online: September 13,2023
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

Article Metrics

History