Temporal Structure Learning with Grenander Inference for Action Recognition

doi:10.13328/j.cnki.jos.006202

微信服务号

微信订阅号

2025-5-5- 21

Home > Archive>Volume 33, Issue 5, 2022 >1865-1879. DOI:10.13328/j.cnki.jos.006202

PDF HTML XML Export Cite reminder

Temporal Structure Learning with Grenander Inference for Action Recognition
DOI:
                        10.13328/j.cnki.jos.006202
                    
Author:
                        WU Ke-WeiWU Ke-Wei
Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
GAO TaoGAO Tao
Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XIE ZhaoXIE Zhao
Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
GUO Wen-BinGUO Wen-Bin
Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP181
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Action recognition is one crucial and very challenging task in computer vision. Most of the existing methods use the temporal structure of the whole video and ignore its temporal noise and ambiguity feature, which leads to failure in action recognition. To address this problem, a novel temporal graph model is proposed with Grenander inference, namely, TGM-GI. First, a 3D CNN+ LSTM module is constructed to learn deep features, in which 3D CNN extracts the dynamic feature of video clips and LSTM optimizes the time dependence between features of two clips. Second, a temporal graph model is constructed with these deep features which use the generator space of Grenander theory. The original temporal pattern is modified using two operations, in which combination operation can remove redundancy clips like slow motion and denoise operation can remove low-frequency clips like abnormal motion. Third, an incremental Viterbi algorithm is proposed for temporal pattern learning with Grenander inference, in which a Grenander measure is designed with both feature bond and semantic bond. Finally, the dynamic time warping is used to match the Grenander temporal pattern of test video with the Grenander temporal pattern of the training set and the label of the test video is predicted. The experimental results show that the proposed TGM-GI outperforms the state-of-the-art methods on two acknowledge databases. The TGM-GI is superior to the baseline method of 3D CNN-LSTM, and its accuracy improves 6.41% on the UCF101 dataset and 5.67% on the Olympic Sports dataset respectively.

Key words:action recognition;temporal pattern;Grenander’s temporal graph model;deep model;dynamic time warping

Get Citation

吴克伟,高涛,谢昭,郭文斌. Grenander时间结构学习与推理优化下的行为识别.软件学报,2022,33(5):1865-1879

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 08,2020
Revised:June 27,2020
Adopted:
Online: May 09,2022
Published: May 06,2022

You are the first2042819Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History