Text-based person search via cross-modal alignment learning - Details

author：

Ke, Xiao (Ke, Xiao.) ^[1] (Scholars：柯逍) | Liu, Hao (Liu, Hao.) ^[2] | Xu, Peirong (Xu, Peirong.) ^[3] | Lin, Xinru (Lin, Xinru.) ^[4] | Guo, Wenzhong (Guo, Wenzhong.) ^[5] (Scholars：郭文忠)

Indexed by：

EI Scopus SCIE

Abstract：

Text　-based　person　search　aims　to　use　text　descriptions　to　search　for　corresponding　person　images.　However,　due　to　the　obvious　pattern　differences　in　image　and　text　modalities,　it　is　still　a　challenging　problem　to　align　the　two　modalities.　Most　existing　approaches　only　consider　semantic　alignment　within　a　global　context　or　partial　parts,　lacking　consideration　of　how　to　match　image　and　text　in　terms　of　differences　in　model　information.　Therefore,　in　this　paper,　we　propose　an　efficient　Modality　-Aligned　Person　Search　network　(MAPS)　to　address　this　problem.　First,　we　suppress　image　-specific　information　by　image　feature　style　normalization　to　achieve　modality　knowledge　alignment　and　reduce　information　differences　between　text　and　image.　Second,　we　design　a　multi　-granularity　modal　feature　fusion　and　optimization　method　to　enrich　the　modal　features.　To　address　the　problem　of　useless　and　redundant　information　in　the　multi　-granularity　fused　features,　we　propose　a　Multigranularity　Feature　Self　-optimization　Module　(MFSM)　to　adaptively　adjust　the　corresponding　contributions　of　different　granularities　in　the　fused　features　of　the　two　modalities.　Finally,　to　address　the　problem　of　information　inconsistency　in　the　training　and　inference　stages,　we　propose　a　Cross　-instance　Feature　Alignment　(CFA)　to　help　the　network　enhance　category　-level　generalization　ability　and　improve　retrieval　performance.　Extensive　experiments　demonstrate　that　our　MAPS　achieves　state-of-the-art　performance　on　all　text　-based　person　search　datasets,　and　significantly　outperforms　other　existing　methods.

Keyword：

CNN Cross-modality Image-text retrieval Person re-identification

Community：

[ 1 ] [Ke, Xiao]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
[ 2 ] [Liu, Hao]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
[ 3 ] [Xu, Peirong]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
[ 4 ] [Lin, Xinru]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
[ 5 ] [Guo, Wenzhong]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
[ 6 ] [Ke, Xiao]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China
[ 7 ] [Xu, Peirong]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China
[ 8 ] [Lin, Xinru]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China
[ 9 ] [Guo, Wenzhong]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350116, Peoples R China

Reprint 's Address：

刘浩
[Liu, Hao]Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China

Email：

nathan40@163.com

Show more details

Version：

Text-based person search via cross-modal alignment learning
2024，Pattern Recognition
Text-based person search via cross-modal alignment learning
2024，Pattern Recognition

Related Keywords：

一种轻量化卷积数据流控制方法及其应用
2024，电子制作
基于FPGA的CNN分类器设计
2024，电气开关
EEG Reconstruction With a Dual-Scale CNN-LSTM Model for Deep Artifact Removal
2023，IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS
Learning-Based Multi-Stage Intra Partition for Versatile Video Coding
2022，2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP)

Source ：

PATTERN RECOGNITION

ISSN： 0031-3203

Year： 2024

Volume： 152

7 . 5 0 0

JCR@2023

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to