Home>Results

  • Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

[期刊论文]

Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events

Share
Edit Delete 报错

author:

Wu, Xin (Wu, Xin.) [1] | Huang, Zhanchao (Huang, Zhanchao.) [2] (Scholars:黄展超) | Wang, Li (Wang, Li.) [3] | Unfold

Indexed by:

EI Scopus SCIE

Abstract:

In large-scale disaster events, the planning of optimal rescue routes depends on the object detection ability at the disaster scene, with one of the main challenges being the presence of dense and occluded objects. Existing methods, which are typically based on the RGB modality, struggle to distinguish targets with similar colors and textures in crowded environments and are unable to identify obscured objects. To this end, we first construct two multimodal dense and occlusion vehicle detection datasets for large-scale events, utilizing RGB and height map modalities. Based on these datasets, we propose a multimodal collaboration network (MuDet) for dense and occluded vehicle detection, MuDet for short. MuDet hierarchically enhances the completeness of discriminable information within and across modalities and differentiates between simple and complex samples. MuDet includes three main modules: Unimodal Feature Hierarchical Enhancement (Uni-Enh), Multimodal Cross Learning (Mul-Lea), and Hard-easy Discriminative (He-Dis) Pattern. Uni-Enh and Mul-Lea enhance the features within each modality and facilitate the cross-integration of features from two heterogeneous modalities. He-Dis effectively separates densely occluded vehicle targets with significant intra-class differences and minimal inter-class differences by defining and thresholding confidence values, thereby suppressing the complex background. Experimental results on two re-labeled multimodal benchmark datasets, the 4K Stereo Aerial Imagery of a Large Camping Site (4K-SAI-LCS) dataset, and the ISPRS Potsdam dataset, demonstrate the robustness and generalization of the MuDet.

Keyword:

Convolutional neural networks Convolutional neural networks (CNNs) dense and occluded Disasters Feature extraction hard-easy balanced attention large-scale disaster events multimodal vehicle detection (MVD) Object detection Remote sensing remote Sensing (RS) Streaming media Vehicle detection

Community:

  • [ 1 ] [Wu, Xin]Beijing Univ Posts & Telecommun, Sch Comp Sci, Nat Pilot Software Engn Sch, Beijing 100876, Peoples R China
  • [ 2 ] [Wang, Li]Beijing Univ Posts & Telecommun, Sch Comp Sci, Nat Pilot Software Engn Sch, Beijing 100876, Peoples R China
  • [ 3 ] [Huang, Zhanchao]Fuzhou Univ, Acad Digital China, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou 350108, Peoples R China
  • [ 4 ] [Huang, Zhanchao]Fuzhou Univ, Natl & Local Joint Engn Res Ctr Satellite Geospati, Fuzhou 350108, Peoples R China
  • [ 5 ] [Chanussot, Jocelyn]Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, F-38000 Grenoble, France
  • [ 6 ] [Chanussot, Jocelyn]Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
  • [ 7 ] [Tian, Jiaojiao]German Aerosp Ctr, Remote Sensing Technol Inst, D-82205 Wessling, Germany

Reprint 's Address:

  • 黄展超

    [Huang, Zhanchao]Fuzhou Univ, Acad Digital China, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou 350108, Peoples R China;;[Huang, Zhanchao]Fuzhou Univ, Natl & Local Joint Engn Res Ctr Satellite Geospati, Fuzhou 350108, Peoples R China

Show more details

Source :

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

ISSN: 0196-2892

Year: 2024

Volume: 62

7 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

30 Days PV: 1

Online/Total:91/10132012
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1