Action recognition using spatial-optical data organization and sequential learning framework

doi:10.1016/j.neucom.2018.06.071

CORC > 西安光学精密机械研究所 > 中国科学院西安光学精密机械研究所 > 光学影像学习与分析中心

	Action recognition using spatial-optical data organization and sequential learning framework
	Yuan, Yuan 1; Zhao, Yang 1,2; Wang, Qi 3,4
刊名	NEUROCOMPUTING
	2018-11-13
卷号	315 页码:221-233
关键词	Action Recognition Spatiotemporal Feature Deep Learning Sequential Learning Framework
ISSN号	0925-2312;1872-8286
DOI	10.1016/j.neucom.2018.06.071
产权排序	1
英文摘要	Recognizing human actions in videos is a challenging problem owning to complex motion appearance, various backgrounds and semantic gap between low-level features and high-level semantics. Existing methods have scored some achievements and many new thoughts have been proposed for action recognition. They focus on designing a robust feature description and training an elaborate learning model, and many of them can benefit from a two-stream network with a stack of RGB frames and optical flow frames. However, these features for human action representation are struggling with the limited feature representation as RGB videos are confused by static appearance redundancy and optical flow videos cannot represent the detailed appearance. To solve these problems, we propose an efficient algorithm based on the spatial-optical data organization and the sequential learning framework. There are two contributions of our method: a novel data organization based on hierarchical weighting segmentation and optical flow for video representation, and a lightweight deep learning model based on the Convolutional 3D (C3D) network and the Recurrent Neural Network (RNN) for complicated action recognition. The new data organization aggregates the merits of motion appearance, movement trajectories and optical flow in a creative way to highlight the meaningful information. And the proposed lightweight model has an insight into patterns and semantics of sequential data by low-level spatiotemporal feature extraction and high-level information mining. The proposed method is evaluated on the state-of-the-art dataset and the results demonstrate that our method have a good performance for complex human action recognition. (c) 2018 Elsevier B.V. All rights reserved.
语种	英语
出版者	ELSEVIER SCIENCE BV
WOS记录号	WOS:000445934400022
内容类型	期刊论文
源URL	[http://ir.opt.ac.cn/handle/181661/30659]
专题	西安光学精密机械研究所_光学影像学习与分析中心
通讯作者	Wang, Qi
作者单位	1.Chinese Acad Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China 2.Univ Chinese Acad Sci, Beijing 049, Peoples R China 3.Northwestern Polytech Univ, Sch Comp Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China 4.Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Shaanxi, Peoples R China
推荐引用方式 GB/T 7714	Yuan, Yuan,Zhao, Yang,Wang, Qi. Action recognition using spatial-optical data organization and sequential learning framework[J]. NEUROCOMPUTING,2018,315:221-233.
APA	Yuan, Yuan,Zhao, Yang,&Wang, Qi.(2018).Action recognition using spatial-optical data organization and sequential learning framework.NEUROCOMPUTING,315,221-233.
MLA	Yuan, Yuan,et al."Action recognition using spatial-optical data organization and sequential learning framework".NEUROCOMPUTING 315(2018):221-233.