伪标签不确定性估计的源域无关鲁棒域自适应

doi:10.13328/j.cnki.jos.006467

微信服务号

微信订阅号

2025年7月17日 16:13 星期四

首页 > 过刊浏览>2022年第33卷第4期 >1183-1199. DOI:10.13328/j.cnki.jos.006467

PDF HTML阅读 XML下载导出引用引用提醒

伪标签不确定性估计的源域无关鲁棒域自适应
DOI:
                        10.13328/j.cnki.jos.006467
                    
CSTR:
                        
                    
作者:
                        王帆王帆
山东大学 软件学院, 山东 济南 250100
在期刊界中查找
在百度中查找
在本站中查找
韩忠义韩忠义
山东大学 软件学院, 山东 济南 250100
在期刊界中查找
在百度中查找
在本站中查找
尹义龙尹义龙
山东大学 软件学院, 山东 济南 250100
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:王帆(1999－),女,硕士生,主要研究领域为机器学习,域自适应,源域无关域自适应;
尹义龙(1972－),男,教授,博士生导师,CCF杰出会员,主要研究领域为机器学习,数据挖掘;
韩忠义(1994－),男,博士生,主要研究领域为机器学习,数据挖掘.
通讯作者:韩忠义,E-mail:hanzhongyicn@gmail.com;尹义龙,E-mail:ylyin@sdu.edu.cn
中图分类号:
基金项目:国家自然科学基金(62176139)

Source Free Robust Domain Adaptation Based on Pseudo Label Uncertainty Estimation

Author:

WANG Fan
WANG Fan
School of Software, Shandong University, Jinan 250100, China
在期刊界中查找
在百度中查找
在本站中查找
HAN Zhong-Yi
HAN Zhong-Yi
School of Software, Shandong University, Jinan 250100, China
在期刊界中查找
在百度中查找
在本站中查找
YIN Yi-Long
YIN Yi-Long
School of Software, Shandong University, Jinan 250100, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

无监督域自适应是解决训练集(源域)和测试集(目标域)分布不一致的有效途径之一.现有的无监督域自适应的理论和方法在相对封闭、静态的环境下取得了一定成功,但面向开放动态任务环境时,在隐私保护、数据孤岛等限制条件下,源域数据往往不可直接获取,现有无监督域自适应方法的鲁棒性将面临严峻的挑战.鉴于此,研究了一个更具挑战性却又未被充分探索的问题:源域无关的无监督域自适应,目标是仅依据预训练的源域模型和无标签目标域数据,实现源域向目标域的正向迁移.提出一种基于伪标签不确定性估计的源域无关鲁棒域自适应的方法PLUE-SFRDA (pseudo label uncertainty estimation for source free robust domain adaptation).PLUE-SFRDA的核心思想是:根据源域模型的预测结果,联合信息熵和能量函数充分挖掘目标域数据的隐含信息,探索类原型和类锚点,以准确估计目标域数据的伪标签,进而调优域自适应模型,实现源域数据无关的鲁棒域自适应.PLUE-SFRDA包含提出的二元软约束信息熵,解决了标准信息熵不能有效估计处于决策边界样本的不确定性的问题,增强了所挖掘的类原型和类锚点的可信度,进而提高了目标域伪标签估计的准确率.PLUE-SFRDA包含了提出的加权对比过滤方法,通过比较每个样本距离该类的类锚点和其他类的类锚点的加权距离,过滤掉处于决策边界的类别信息模糊样本,进一步提高了伪标签不确定性估计的安全性.PLUE-SFDRA还包含一个信息最大化损失,实现源域分类器和伪标签估计器迭代优化,逐渐将源域模型中蕴含的源域知识迁移至目标域,进一步提高了伪标签不确定性估计的鲁棒性.在Office-31,Office-Home和VisDA-C这3个公开的基准数据集上的大量实验表明:PLUE-SFRDA不仅超过了最新的源域无关的域自适应方法的表现,还显著优于现有的依赖源域数据的域自适应方法.

关键词:无监督域自适应;源域无关的域自适应;伪标签学习;信息熵;能量函数;不确定性估计

Abstract:

Unsupervised domain adaptation is one of the effective ways to solve the inconsistent distribution of training set (source domain) and test set (target domain). Existing unsupervised domain adaptation theories and methods have achieved some success in relatively closed and static environments. However, for open dynamic task environments, the robustness of existing unsupervised domain adaptation methods will face serious challenges under the constraints of privacy protection and data silos, where source domain data are often not directly accessible. In view of this, this paper investigates a more challenging yet under-explored problem: source free unsupervised domain adaptation, with the goal of achieving positive transfer from the source domain to the target domain based only on the pre-trained source domain model and unlabeled target domain data. In this paper, we propose a method called PLUE-SFRDA (pseudo label uncertainty estimation for source free robust domain adaptation). The core idea of PLUE-SFRDA is to combine information entropy and energy function to fully explore the implicit information of the target domain data based on the prediction results of the source domain model, explore the class prototypes and class anchors to accurately estimate the pseudo label of the target domain data, and then tune the domain adaptation model to achieve the source free robust domain adaptation. PLUE-SFRDA contains a proposed binary soft constraint information entropy, which solves the problem that the standard information entropy cannot effectively estimate the pseudo label uncertainty of samples at the decision boundary, enhances the confidence of the mined class prototypes, and thus improves the accuracy of pseudo label estimation in the target domain. PLUE-SFRDA contains a weighted comparison filtering method proposed by this paper. By comparing the weighted distances of each sample to the class anchors of other classes, the fuzzy samples of class information at the decision boundary are filtered out, which further improves the security of the new pseudo label uncertainty estimation. PLUE-SFRDA also contains an information maximization loss to achieve iterative optimization of the source domain classifier and the pseudo label estimator, which gradually migrates the source domain knowledge embedded in the source domain model to the target domain, further improving the robustness of the pseudo label uncertainty estimation. Extensive experiments on three publicly available datasets, Office-31, Office-Home and VisDA-C, show that PLUE-SFRDA not only outperforms the state-of-the-art source-free domain adaptation methods but also significantly outperforms standard domain adaptation methods which depend on the source-domain data.

Key words:unsupervised domain adaptation;source-free domain adaptation;pseudo label learning;information entropy;energy function;uncertainty estimation

引用本文

王帆,韩忠义,尹义龙.伪标签不确定性估计的源域无关鲁棒域自适应.软件学报,2022,33(4):1183-1199

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-03-10
最后修改日期:2021-07-16
录用日期:
在线发布日期: 2021-10-26
出版日期: 2022-04-06

微信服务号

微信订阅号

引用本文

相关视频

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

相关视频

分享

微信扫一扫：分享

文章指标

历史

文章二维码