Pseudo-labeling Algorithm Based on Optimal Transport for Deep Semi-supervised Learning

doi:10.13328/j.cnki.jos.007054

微信服务号

微信订阅号

2025-4-24- 14

Home > Archive>Volume 35, Issue 11, 2024 >5196-5209. DOI:10.13328/j.cnki.jos.007054

PDF HTML XML Export Cite reminder

Pseudo-labeling Algorithm Based on Optimal Transport for Deep Semi-supervised Learning
DOI:
                        10.13328/j.cnki.jos.007054
                    
Author:
                        ZHAI De-MingZHAI De-Ming
Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
SHEN Si-XianSHEN Si-Xian
Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHOU XiongZHOU Xiong
Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JIANG Jun-JunJIANG Jun-Jun
Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LIU Xian-MingLIU Xian-Ming
Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JI Xiang-YangJI Xiang-Yang
Department of Automation, Tsinghua University, Beijing 100084, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP18
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Deep learning has been widely employed in many fields and yields excellent performance. However, this often requires the support of large amounts of labeled data, which usually means high costs and harsh application conditions. Therefore, with the development of deep learning, how to break through data limitations in practical scenarios has become an important research problem. Specifically, as one of the most important research directions, semi-supervised learning greatly relieves the data requirement pressure of deep learning by conducting learning with the assistance of abundant unlabeled data and a small number of labeled data. The pseudo-labeling method plays a significant role in semi-supervised learning, and the quality of its generated pseudo labels will influence the final results of semi-supervised learning. Focusing on pseudo-labeling in semi-supervised learning, this study proposes the pseudo-labeling method based on optimal transport theory, which introduces the pseudo-labeling procedure constraint with labeled data as generation process guidance. On this basis, the pseudo-labeling procedure is converted to the optimization problem of optimal transport, which offers a new form for solving pseudo-labeling. Meanwhile, to solve this problem, this study introduces the Sinkhorn-Knopp algorithm for approximate fast solutions to avoid the heavy computation burden. As an independent module, the proposed method can be combined with other semi-supervised learning tricks such as consistency regularization for complete semi-supervised learning. Finally, this study conducts experiments on four classic public image classification datasets of CIFAR-10, SVHN, MNIST, and FashionMNIST to verify the effectiveness of the proposed method. The experimental results show that compared with the state-of-the-art semi-supervised learning methods, this method yields better performance, especially under fewer labeled data.

Key words:semi-supervised learning;pseudo-labeling;optimal transport;image classification;deep learning

Get Citation

翟德明,沈斯娴,周雄,江俊君,刘贤明,季向阳.基于最优传输理论的深度半监督学习伪标签生成算法.软件学报,2024,35(11):5196-5209

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 03,2023
Revised:May 29,2023
Adopted:
Online: April 03,2024
Published: November 06,2024

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History