SkeletonNet: Mining Deep Part Features for 3-D Action Recognition

Ke, Q.; An, Senjian; Bennamoun, M.; Sohel, F.; Boussaid, F.

doi:10.1109/LSP.2017.2690339

Access Status

Fulltext not available

Authors

Ke, Q.

An, Senjian

Bennamoun, M.

Sohel, F.

Boussaid, F.

Date

2017

Type

Journal Article

Metadata

Show full item record

Citation

Ke, Q. and An, S. and Bennamoun, M. and Sohel, F. and Boussaid, F. 2017. SkeletonNet: Mining Deep Part Features for 3-D Action Recognition. IEEE Signal Processing Letters. 24 (6): pp. 731-735.

Source Title

IEEE Signal Processing Letters

DOI

10.1109/LSP.2017.2690339

ISSN

1070-9908

School

School of Electrical Engineering, Computing and Mathematical Science (EECMS)

URI

http://hdl.handle.net/20.500.11937/69968

Collection

Curtin Research Publications

Abstract

This letter presents SkeletonNet, a deep learning framework for skeleton-based 3-D action recognition. Given a skeleton sequence, the spatial structure of the skeleton joints in each frame and the temporal information between multiple frames are two important factors for action recognition. We first extract body-part-based features from each frame of the skeleton sequence. Compared to the original coordinates of the skeleton joints, the proposed features are translation, rotation, and scale invariant. To learn robust temporal information, instead of treating the features of all frames as a time series, we transform the features into images and feed them to the proposed deep learning network, which contains two parts: one to extract general features from the input images, while the other to generate a discriminative and compact representation for action recognition. The proposed method is tested on the SBU kinect interaction dataset, the CMU dataset, and the large-scale NTU RGB+D dataset and achieves state-of-the-art performance.