• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Ke, X. (Ke, X..) [1] | Cai, Y. (Cai, Y..) [2] | Chen, B. (Chen, B..) [3] | Liu, H. (Liu, H..) [4] | Guo, W. (Guo, W..) [5]

Indexed by:

Scopus

Abstract:

Fine-grained visual classification (FGVC) is a highly challenging task that aims to learn subtle differences between visually similar objects. Most existing methods for FGVC rely on deep convolutional neural networks to mine local fine-grained features, which neglect the learning of relationships between global and local semantics. Moreover, the feature encoding stage inevitably constructs complex feature representations, leading to overfitting to specific feature patterns, which is not beneficial for fine-grained visual classification. To address these issues, we propose a Transformer-based FGVC model, called the Multi-Granularity Interaction and Feature Recombination Network(MGIFR-Net), which consists of three modules. Firstly, a self-attention guided localization module is designed to locate and amplify discriminative local regions, enabling the sufficient learning of local detail information. Secondly, to enhance the perception of multi-granularity semantic interaction information, we construct a multi-granularity feature interaction learning module to jointly learn local and global feature representations. Finally, a dynamic feature recombination enhancement method is proposed, which explores diverse feature pattern combinations while retaining invariant features, effectively alleviating the overfitting problem caused by complex feature representations. Our method achieves state-of-the-art performance on four benchmark FGVC datasets (CUB-200-2011, Stanford Cars, FGVC-Aircraft, and NAbirds), and experimental results demonstrate the superiority of our method on different visual classification benchmarks. © 2025 Elsevier Ltd

Keyword:

Feature recombination Fine-grained visual classification Multi-granularity feature interaction Vision transformer

Community:

  • [ 1 ] [Ke X.]Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou, 350116, China
  • [ 2 ] [Ke X.]Engineering Research Center of Big Data Intelligence, Ministry of Education, Fuzhou University, Fuzhou, 350116, China
  • [ 3 ] [Cai Y.]Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou, 350116, China
  • [ 4 ] [Cai Y.]Engineering Research Center of Big Data Intelligence, Ministry of Education, Fuzhou University, Fuzhou, 350116, China
  • [ 5 ] [Chen B.]Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou, 350116, China
  • [ 6 ] [Chen B.]Engineering Research Center of Big Data Intelligence, Ministry of Education, Fuzhou University, Fuzhou, 350116, China
  • [ 7 ] [Liu H.]Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou, 350116, China
  • [ 8 ] [Liu H.]Engineering Research Center of Big Data Intelligence, Ministry of Education, Fuzhou University, Fuzhou, 350116, China
  • [ 9 ] [Guo W.]Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou, 350116, China
  • [ 10 ] [Guo W.]Engineering Research Center of Big Data Intelligence, Ministry of Education, Fuzhou University, Fuzhou, 350116, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Pattern Recognition

ISSN: 0031-3203

Year: 2025

Volume: 166

7 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Affiliated Colleges:

Online/Total:425/11097500
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1