• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Zhao, Zhifeng (Zhao, Zhifeng.) [1] | Chen, Guodong (Chen, Guodong.) [2] (Scholars:陈国栋) | Lin, Yuxiang (Lin, Yuxiang.) [3]

Indexed by:

EI Scopus SCIE

Abstract:

Recent years have seen the resurgence of self-supervised learning in visual representation thanks to Contrastive Learning and Masked Image Modeling. The existing self-supervised methods for skeleton-based action recognition typically learn feature invariance of the data only through contrastive learning. In this paper, we propose a contrast learning method combined with a temporal-masking mechanism of skeleton sequences to encourage the network able to learn action representations other than feature invariance, e.g., occlusion invariance, by implicitly reconstructing the masked sequences. However, the direct masking mechanism destroys the feature consistency of the samples, for which we propose Supervised Positive Sample Mining and self-attention module for embeddings to improve the generalization of the model. First of all, supervised contrastive learning can improve the robustness of models using prior knowledge of labels. Secondly, to avoid excessive masking mechanism that hinders the model from learning the correct occlusion invariance, a self-attention mechanism is necessary, which further discriminate the distance for each action class in the feature space. The results of various experimental protocols on NTU 60, NTU 120, PKU-MMD datasets demonstrate the advantages of our method and that our method outperforms the existing state-of-the-art contrastive methods. Code is available at https://github.com/ZZFCV/SASOiCLR.

Keyword:

Action recognition Contrastive learning Masking mechanism Supervised positive mining

Community:

  • [ 1 ] [Zhao, Zhifeng]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China
  • [ 2 ] [Chen, Guodong]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China
  • [ 3 ] [Lin, Yuxiang]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China

Reprint 's Address:

  • [Chen, Guodong]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China;;

Show more details

Related Keywords:

Related Article:

Source :

SIGNAL IMAGE AND VIDEO PROCESSING

ISSN: 1863-1703

Year: 2023

Issue: 5

Volume: 17

Page: 2267-2275

2 . 0

JCR@2023

2 . 0 0 0

JCR@2023

ESI Discipline: ENGINEERING;

ESI HC Threshold:35

JCR Journal Grade:3

CAS Journal Grade:4

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:93/9994799
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1