DING Jing
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, ChinaSHU Xiang-Bo
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, ChinaHUANG Peng
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, ChinaYAO Ya-Zhou
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, ChinaSONG Yan
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, ChinaTP183
With the problem of the aging population becomes serious, more attention is payed to the safety of the elderly when they are at home alone. In order to provide early warning, alarm, and report of some dangerous behaviors, several domestic and foreign research institutions are focusing on studying the intelligent monitoring of the daily activities of the elderly in robot-view. For promoting the industrialization of these technologies, this work mainly studies how to automatically recognize the daily activities of the elderly, such as “drinking water”, “washing hands”, “reading a book”, “reading a newspaper”. Through the investigation of the daily activity videos of the elderly, it is found that the semantics of the daily activities of the elderly are obviously fine-grained. For example, the semantics of “drinking water” and “taking medicine” are highly similar, and only a small number of video frames can accurately reflect their category semantics. To effectively address such problem of the elderly behavior recognition, this work proposes a new multimodal multi-granularity graph convolutional network (MM-GCN), by applying the graph convolution network on four modalities, i.e., the skeleton (“point”), bone (“line”), frame (“frame”), and proposal (“segment”), to model the activities of the elderly, and capture the semantics under the four granularities of “point-line-frame-proposal”. Finally, the experiments are conducted to validate the activity recognition performance of the proposed method on ETRI-Activity3D (
丁静,舒祥波,黄捧,姚亚洲,宋砚.基于多模态多粒度图卷积网络的老年人日常行为识别.软件学报,2023,34(5):2350-2364
Copy