Unsupervised Fine-grained Video Categorization via Adaptation Learning Across Domains and Modalities

doi:10.13328/j.cnki.jos.006058

微信服务号

微信订阅号

2025-4-13- 3

Home > Archive>Volume 32, Issue 11, 2021 >3482-3495. DOI:10.13328/j.cnki.jos.006058

PDF HTML XML Export Cite reminder

Unsupervised Fine-grained Video Categorization via Adaptation Learning Across Domains and Modalities
DOI:
                        10.13328/j.cnki.jos.006058
                    
Author:
                        HE Xiang-TengHE Xiang-Teng
Wangxuan Institute of Computer Technology, Peking University, Beijing 100080, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
PENG Yu-XinPENG Yu-Xin
Wangxuan Institute of Computer Technology, Peking University, Beijing 100080, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP181
Fund Project:National Natural Science Foundation of China (61925201, 61771025)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Fine-grained video categorization is a highly challenging task to discriminate similar subcategories that belong to the same basic-level category. Due to the significant advances in fine-grained image categorization and expensive cost of labeling video data, it is intuitive to adapt the knowledge learned from image to video in an unsupervised manner. However, there is a clear gap to directly apply the models learned from image to recognize the fine-grained instances in video, due to domain distinction and modality distinction between image and video. Therefore, this study proposes the unsupervised discriminative adaptation network (UDAN), which transfers the ability of discrimination localization from image to video. A progressive pseudo labeling strategy is adopted to iteratively guide UDAN to approximate the distribution of the target video data. To verify the effectiveness of the proposed UDAN approach, adaptation tasks between image and video are performed, adapting the knowledge learned from CUB-200-2011/Cars-196 datasets (image) to YouTube Birds/YouTube Cars datasets (video). Experimental results illustrate the advantage of the proposed UDAN approach for unsupervised fine-grained video categorization.

Key words:fine-grained video categorization;unsupervised discriminative adaptation network;domain distinction;modality distinction;domain adaption

Get Citation

何相腾,彭宇新.跨域和跨模态适应学习的无监督细粒度视频分类.软件学报,2021,32(11):3482-3495

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 09,2019
Revised:March 09,2020
Adopted:
Online: November 05,2021
Published: November 06,2021

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History