• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Lin, Zhenghong (Lin, Zhenghong.) [1] | Wu, Yuze (Wu, Yuze.) [2] | Chen, Jiawei (Chen, Jiawei.) [3] | Wang, Shiping (Wang, Shiping.) [4]

Abstract:

Transformers designed for natural language processing have originally been explored for computer vision in recent research. Various Vision Transformers (ViTs) play an increasingly important role in the field of image tasks such as computer vision, multimodal fusion and multimedia analysis. However, to obtain promising performance, most existing ViTs usually rely on artificially filtered high-quality images, which may suffer from inherent noise risk. Generally, such well-constructed images are not always available in every situation. To this end, we propose a Robust ViT (RViT) to focus on the relevant and robust representation learning for image classification tasks. Specifically, we first develop a novel Denoising VTUnet module, where we conceptualize the nonrobust noise as the uncertainty under the variational conditions. Furthermore, we design a fusion transformer backbone with a tailored fusion attention mechanism to perform image classification based on the extracted robust representations effectively. To demonstrate the superiority of our model, the compared experiments are conducted on several popular datasets. Benefiting from the sequence regularity of the Transformer and captured robust feature, the proposed method exceeds compared Transformer-based models with superior performance in visual tasks.

Keyword:

fusion attention Image classification robust representation learning variational inference vision transformer

Community:

  • [ 1 ] [Lin, Zhenghong]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
  • [ 2 ] [Wu, Yuze]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
  • [ 3 ] [Chen, Jiawei]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
  • [ 4 ] [Wang, Shiping]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
  • [ 5 ] [Wang, Shiping]Fujian Prov Univ, Key Lab Intelligent Metro, Fuzhou 350108, Peoples R China

Reprint 's Address:

  • [Wang, Shiping]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China;;[Wang, Shiping]Fujian Prov Univ, Key Lab Intelligent Metro, Fuzhou 350108, Peoples R China

Email:

Show more details

Related Keywords:

Source :

GUIDANCE NAVIGATION AND CONTROL

ISSN: 2737-4807

Year: 2024

Issue: 03

Volume: 04

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:136/10026245
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1