Temporal Structure Learning with Grenander Inference for Action Recognition
Author:
  • WU Ke-Wei

    WU Ke-Wei

    Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
    Find this author on CNKI
    Find this author on BaiDu
    Search for this author on this site
  • GAO Tao

    GAO Tao

    Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
    Find this author on CNKI
    Find this author on BaiDu
    Search for this author on this site
  • XIE Zhao

    XIE Zhao

    Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
    Find this author on CNKI
    Find this author on BaiDu
    Search for this author on this site
  • GUO Wen-Bin

    GUO Wen-Bin

    Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei 230601, China;Anhui Province Key Laboratory of Affective Computing & Advanced Intelligent Machine (Hefei University of Technology), Hefei 230601, China;School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
    Find this author on CNKI
    Find this author on BaiDu
    Search for this author on this site
Affiliation:

Clc Number:

TP181

  • Article
  • | |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    Action recognition is one crucial and very challenging task in computer vision. Most of the existing methods use the temporal structure of the whole video and ignore its temporal noise and ambiguity feature, which leads to failure in action recognition. To address this problem, a novel temporal graph model is proposed with Grenander inference, namely, TGM-GI. First, a 3D CNN+ LSTM module is constructed to learn deep features, in which 3D CNN extracts the dynamic feature of video clips and LSTM optimizes the time dependence between features of two clips. Second, a temporal graph model is constructed with these deep features which use the generator space of Grenander theory. The original temporal pattern is modified using two operations, in which combination operation can remove redundancy clips like slow motion and denoise operation can remove low-frequency clips like abnormal motion. Third, an incremental Viterbi algorithm is proposed for temporal pattern learning with Grenander inference, in which a Grenander measure is designed with both feature bond and semantic bond. Finally, the dynamic time warping is used to match the Grenander temporal pattern of test video with the Grenander temporal pattern of the training set and the label of the test video is predicted. The experimental results show that the proposed TGM-GI outperforms the state-of-the-art methods on two acknowledge databases. The TGM-GI is superior to the baseline method of 3D CNN-LSTM, and its accuracy improves 6.41% on the UCF101 dataset and 5.67% on the Olympic Sports dataset respectively.

    Reference
    Related
    Cited by
Get Citation

吴克伟,高涛,谢昭,郭文斌. Grenander时间结构学习与推理优化下的行为识别.软件学报,2022,33(5):1865-1879

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:May 08,2020
  • Revised:June 27,2020
  • Online: May 09,2022
  • Published: May 06,2022
You are the first2042819Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063