Learning motion and content-dependent features with convolutions for action recognition

首页 > 成果 > 详情

认领

导出

Link by 万方学术期刊

反馈

作者信息关键词期刊信息基础信息归属信息摘要

成果类型：

期刊论文

作者：

Liu, Cong*;Xu, Weisheng;Wu, Qidi;Yang, Gelan

通讯作者：

Liu, Cong

作者机构：

[Wu, Qidi; Liu, Cong; Xu, Weisheng] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China.

[Yang, Gelan] Hunan City Univ, Sch Informat Sci & Engn, Yiyang 413000, Peoples R China.

通讯机构：

[Liu, Cong] T

Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China.

语种：

英文

关键词：

Spatiotemporal;Convolutional neural networks;Multiplicative interactions;Deep learning;Action recognition

期刊：

Multimedia Tools and Applications

ISSN：

1380-7501

年：

2016

卷：

期：

页码：

13023-13039

DOI：

10.1007/s11042-015-2550-4

基金类别：

We propose a deep action recognition model based on convolutional architecture using multiplicative interactions. The developed model generates feature representation that is sensitive to both temporal dynamic and static appearance of a video. We show that both the two kinds of information arise from the intrinsic properties of the products of adjacent frames, which is different from recent convolutional approaches. Our model also remedies the shortcoming of energy-based methods in scaling up them to more realistic datasets that have larger images because convolution is dramatically more efficient than dense matrix multiplication in terms of memory requirements and statistical efficiency. Experimental results show the developed model outperforms other baseline methods on the UCF101 dataset, and it achieves competitive performance on the KTH dataset. They also suggest that, to reliably model action, the static appearance should be captured in addition to motion information. The research work described in this paper is supported by Science Research Foundation of Hunan Provincial Education Department under grant number 12B023.

机构署名：

本校为其他机构

院系归属：

信息与电子工程学院

摘要：

A variety of recognizing architectures based on deep convolutional neural networks have been devised for labeling videos containing human motion with action labels. However, so far, most works cannot properly deal with the temporal dynamics encoded in multiple contiguous frames, which distinguishes action recognition from other recognition tasks. This paper develops a temporal extension of convolutional neural networks to exploit motion-dependent features for recognizing human action in video. Our approach differs from other recent attempts in ...

反馈

产权有误：本人成果被他人认领

数据有误：数据基本信息有误

归属有误：成果的院系归属、机构署名归属有误

其他原因：

验证码：

看不清楚，换一个

确定

取消

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

Learning motion and content-dependent features with convolutions for action recognition

反馈

成果认领

提示

该栏目需要登录且有访问权限才可以访问