[关键词]
[摘要]
无监督域自适应是解决训练集(源域)和测试集(目标域)分布不一致的有效途径之一.现有的无监督域自适应的理论和方法在相对封闭、静态的环境下取得了一定成功,但面向开放动态任务环境时,在隐私保护、数据孤岛等限制条件下,源域数据往往不可直接获取,现有无监督域自适应方法的鲁棒性将面临严峻的挑战.鉴于此,研究了一个更具挑战性却又未被充分探索的问题:源域无关的无监督域自适应,目标是仅依据预训练的源域模型和无标签目标域数据,实现源域向目标域的正向迁移.提出一种基于伪标签不确定性估计的源域无关鲁棒域自适应的方法PLUE-SFRDA (pseudo label uncertainty estimation for source free robust domain adaptation).PLUE-SFRDA的核心思想是:根据源域模型的预测结果,联合信息熵和能量函数充分挖掘目标域数据的隐含信息,探索类原型和类锚点,以准确估计目标域数据的伪标签,进而调优域自适应模型,实现源域数据无关的鲁棒域自适应.PLUE-SFRDA包含提出的二元软约束信息熵,解决了标准信息熵不能有效估计处于决策边界样本的不确定性的问题,增强了所挖掘的类原型和类锚点的可信度,进而提高了目标域伪标签估计的准确率.PLUE-SFRDA包含了提出的加权对比过滤方法,通过比较每个样本距离该类的类锚点和其他类的类锚点的加权距离,过滤掉处于决策边界的类别信息模糊样本,进一步提高了伪标签不确定性估计的安全性.PLUE-SFDRA还包含一个信息最大化损失,实现源域分类器和伪标签估计器迭代优化,逐渐将源域模型中蕴含的源域知识迁移至目标域,进一步提高了伪标签不确定性估计的鲁棒性.在Office-31,Office-Home和VisDA-C这3个公开的基准数据集上的大量实验表明:PLUE-SFRDA不仅超过了最新的源域无关的域自适应方法的表现,还显著优于现有的依赖源域数据的域自适应方法.
[Key word]
[Abstract]
Unsupervised domain adaptation is one of the effective ways to solve the inconsistent distribution of training set (source domain) and test set (target domain). Existing unsupervised domain adaptation theories and methods have achieved some success in relatively closed and static environments. However, for open dynamic task environments, the robustness of existing unsupervised domain adaptation methods will face serious challenges under the constraints of privacy protection and data silos, where source domain data are often not directly accessible. In view of this, this paper investigates a more challenging yet under-explored problem: source free unsupervised domain adaptation, with the goal of achieving positive transfer from the source domain to the target domain based only on the pre-trained source domain model and unlabeled target domain data. In this paper, we propose a method called PLUE-SFRDA (pseudo label uncertainty estimation for source free robust domain adaptation). The core idea of PLUE-SFRDA is to combine information entropy and energy function to fully explore the implicit information of the target domain data based on the prediction results of the source domain model, explore the class prototypes and class anchors to accurately estimate the pseudo label of the target domain data, and then tune the domain adaptation model to achieve the source free robust domain adaptation. PLUE-SFRDA contains a proposed binary soft constraint information entropy, which solves the problem that the standard information entropy cannot effectively estimate the pseudo label uncertainty of samples at the decision boundary, enhances the confidence of the mined class prototypes, and thus improves the accuracy of pseudo label estimation in the target domain. PLUE-SFRDA contains a weighted comparison filtering method proposed by this paper. By comparing the weighted distances of each sample to the class anchors of other classes, the fuzzy samples of class information at the decision boundary are filtered out, which further improves the security of the new pseudo label uncertainty estimation. PLUE-SFRDA also contains an information maximization loss to achieve iterative optimization of the source domain classifier and the pseudo label estimator, which gradually migrates the source domain knowledge embedded in the source domain model to the target domain, further improving the robustness of the pseudo label uncertainty estimation. Extensive experiments on three publicly available datasets, Office-31, Office-Home and VisDA-C, show that PLUE-SFRDA not only outperforms the state-of-the-art source-free domain adaptation methods but also significantly outperforms standard domain adaptation methods which depend on the source-domain data.
[中图分类号]
[基金项目]
国家自然科学基金(62176139)