• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Ke, Xiao (Ke, Xiao.) [1] (Scholars:柯逍) | Liu, Hao (Liu, Hao.) [2] | Xu, Peirong (Xu, Peirong.) [3] | Lin, Xinru (Lin, Xinru.) [4] | Guo, Wenzhong (Guo, Wenzhong.) [5] (Scholars:郭文忠)

Indexed by:

EI Scopus SCIE

Abstract:

Text -based person search aims to use text descriptions to search for corresponding person images. However, due to the obvious pattern differences in image and text modalities, it is still a challenging problem to align the two modalities. Most existing approaches only consider semantic alignment within a global context or partial parts, lacking consideration of how to match image and text in terms of differences in model information. Therefore, in this paper, we propose an efficient Modality -Aligned Person Search network (MAPS) to address this problem. First, we suppress image -specific information by image feature style normalization to achieve modality knowledge alignment and reduce information differences between text and image. Second, we design a multi -granularity modal feature fusion and optimization method to enrich the modal features. To address the problem of useless and redundant information in the multi -granularity fused features, we propose a Multigranularity Feature Self -optimization Module (MFSM) to adaptively adjust the corresponding contributions of different granularities in the fused features of the two modalities. Finally, to address the problem of information inconsistency in the training and inference stages, we propose a Cross -instance Feature Alignment (CFA) to help the network enhance category -level generalization ability and improve retrieval performance. Extensive experiments demonstrate that our MAPS achieves state-of-the-art performance on all text -based person search datasets, and significantly outperforms other existing methods.

Keyword:

CNN Cross-modality Image-text retrieval Person re-identification

Community:

  • [ 1 ] [Ke, Xiao]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
  • [ 2 ] [Liu, Hao]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
  • [ 3 ] [Xu, Peirong]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
  • [ 4 ] [Lin, Xinru]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
  • [ 5 ] [Guo, Wenzhong]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
  • [ 6 ] [Ke, Xiao]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China
  • [ 7 ] [Xu, Peirong]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China
  • [ 8 ] [Lin, Xinru]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China
  • [ 9 ] [Guo, Wenzhong]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China

Reprint 's Address:

  • 刘浩

    [Liu, Hao]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China

Show more details

Related Keywords:

Source :

PATTERN RECOGNITION

ISSN: 0031-3203

Year: 2024

Volume: 152

7 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:255/10059277
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1