Cross-Stream Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition

CORC > 自动化研究所 > 中国科学院自动化研究所 > 精密感知与控制研究中心 > 人工智能与机器学习

	Cross-Stream Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition
	Ding Li 1,2; Yongqiang Tang 1; Wensheng Zhang 1,2
刊名	Image and Vision Computing
	2023
期号	135 页码:15-26
英文摘要	Self-supervised skeleton-based action recognition enjoys a rapid growth alongwith the development of contrastive learning. The existing methods rely on imposing invariance to augmentations of 3D skeleton within a single data stream, which merely leverages the easy positive pairs and limits the ability to explore the complicated movement patterns. In this paper, we advocate that the defect of single-stream contrast and the lack of necessary feature transformation are responsible for easy positives, and therefore propose a Cross-Stream Contrastive Learning framework for skeleton-based action Representation learning (CSCLR). Specifically, the proposed CSCLR not only utilizes intra-stream contrast pairs, but introduces inter-stream contrast pairs as hard samples to formulate a better representation learning. Besides, to further exploit the potential of positive pairs and increase the robustness of self-supervised representation learning, we propose a Positive Feature Transformation (PFT) strategy which adopts feature-level manipulation to increase the variance of positive pairs. To validate the effectiveness of our method, we conduct extensive experiments on three benchmark datasets NTURGB + D 60, NTU-RGB + D 120 and PKU-MMD. Experimental results show that our proposed CSCLR exceeds the state-of-the-art methods on a diverse range of evaluation protocols.
语种	英语
内容类型	期刊论文
源URL	[http://ir.ia.ac.cn/handle/173211/52223]
专题	精密感知与控制研究中心_人工智能与机器学习
作者单位	1.Institute of Automation 2.Univerisity of Chinese Academy of Science
推荐引用方式 GB/T 7714	Ding Li,Yongqiang Tang,Wensheng Zhang. Cross-Stream Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition[J]. Image and Vision Computing,2023(135):15-26.
APA	Ding Li,Yongqiang Tang,&Wensheng Zhang.(2023).Cross-Stream Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition.Image and Vision Computing(135),15-26.
MLA	Ding Li,et al."Cross-Stream Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition".Image and Vision Computing .135(2023):15-26.