Modality Compensation Based Action Recognition

微信服务号

微信订阅号

2025-4-24- 22

Home > Archive>Volume 29, Issue S2, 2018 >1-15

PDF HTML XML Export Cite reminder

Modality Compensation Based Action Recognition
DOI:
                        
                    
Author:
                        SONG Si-JieSONG Si-Jie
Institute of Computer Science and Technology, Peking University, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LIU Jia-YingLIU Jia-Ying
Institute of Computer Science and Technology, Peking University, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI Yang-HaoLI Yang-Hao
Institute of Computer Science and Technology, Peking University, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
GUO Zong-MingGUO Zong-Ming
Institute of Computer Science and Technology, Peking University, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:National Natural Science Foundation of China (61772043)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

With the prevalence of depth cameras, video data of different modalities become more common. Multi-Modal data based human action recognition attracts increasing attention. Different modal data describe human actions from distinct perspectives. How to effectively utilize the complementary information of multi-modal data is a key topic in this area. In this study, we propose a modality compensation based method for action recognition. With RGB/optical flow as source modal data and skeletons as auxiliary modal data, we aim to compensate the feature learning from source modal data, through exploring the common spaces between source and auxiliary modalities. The proposed model is based on deep convolutional neural network (CNN) and long short term memory (LSTM) network to extract spatial and temporal features. With the help of residual learning, a modality adaptation block is proposed to align the distributions of different modalities and achieve modality compensation. To deal with different alignment of source and auxiliary modal data, we propose hierarchical modality adaptation schemes. The proposed model only requires auxiliary modal data in the training process, and is able to improve the recognition performance only with source modal data in the testing phase, which expands the application scenarios of the proposed model. The experiment results illustrate that proposed method outperforms other state-of-the-art approaches.

Key words:action recognition;multi-modal data;modality compensation;deep learning;residual learning

Get Citation

宋思捷,刘家瑛,厉扬豪,郭宗明.关联模态补偿的视频动作识别算法.软件学报,2018,29(S2):1-15

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 13,2018
Revised:June 13,2018
Adopted:
Online: August 07,2019
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History