Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

Chen, Tailin; Zhou, Desen; Wang, Jian; Wang, Shidong; Guan, Yu; He, Xuming; Ding, Errui

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.04536 (cs)

[Submitted on 10 Aug 2021]

Title:Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

Authors:Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Yu Guan, Xuming He, Errui Ding

View PDF

Abstract:The task of skeleton-based action recognition remains a core challenge in human-centred scene understanding due to the multiple granularities and large variation in human motion. Existing approaches typically employ a single neural representation for different motion patterns, which has difficulty in capturing fine-grained action classes given limited training data. To address the aforementioned problems, we propose a novel multi-granular spatio-temporal graph network for skeleton-based action classification that jointly models the coarse- and fine-grained skeleton motion patterns. To this end, we develop a dual-head graph network consisting of two interleaved branches, which enables us to extract features at two spatio-temporal resolutions in an effective and efficient manner. Moreover, our network utilises a cross-head communication strategy to mutually enhance the representations of both heads. We conducted extensive experiments on three large-scale datasets, namely NTU RGB+D 60, NTU RGB+D 120, and Kinetics-Skeleton, and achieves the state-of-the-art performance on all the benchmarks, which validates the effectiveness of our method.

Comments:	Accepted by ACM MM'21
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2108.04536 [cs.CV]
	(or arXiv:2108.04536v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.04536

Submission history

From: Tailin Chen [view email]
[v1] Tue, 10 Aug 2021 09:25:07 UTC (12,266 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-08

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jian Wang
Yu Guan
Xuming He
Errui Ding

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators