• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Zhang, S. (Zhang, S..) [1] | Li, M. (Li, M..) [2] | Zhao, W. (Zhao, W..) [3] | Wang, X. (Wang, X..) [4] | Wu, Q. (Wu, Q..) [5]

Indexed by:

Scopus

Abstract:

Building type information indicates the functional properties of buildings and plays a crucial role in smart city development and urban socio-economic activities. Existing methods for classifying building types often face challenges in accurately distinguishing buildings between types while maintaining well-delineated boundaries, especially in complex urban environments. This study introduces a novel framework, i.e., CNN-Transformer cross-attention feature fusion network (CTCFNet), for building type classification from very-high-resolution remote sensing images. CTCFNet integrates CNNs and Transformers using an interactive cross-encoder fusion (ICEF) module that enhances semantic feature learning and improves classification accuracy in complex scenarios. We develop an adaptive collaboration optimization (ACO) module that applies human visual attention mechanisms to enhance the feature representation of building types and boundaries simultaneously. To address the scarcity of datasets in building type classification, we create two new datasets: the urban building type (UBT) dataset and the town building type (TBT) dataset, for model evaluation. Extensive experiments on these datasets demonstrate that CTCFNet outperforms popular CNNs, Transformers, and dual-encoder methods in identifying building types across various regions, achieving the highest MIoU of 78.20% and 77.11%, F1 scores of 86.83% and 88.22%, and OA of 95.07% and 95.73% on the UBT and TBT datasets, respectively. We conclude that CTCFNet effectively addresses the challenges of high interclass similarity and intraclass inconsistency in complex scenes, yielding results with well-delineated building boundaries and accurate building types. The codes and datasets in this article are accessible at https://github.com/zsfaff/CTCFNet.  © 2008-2012 IEEE.

Keyword:

Building type classification CNN- transformer networks cross-encoder feature interaction very-high-resolution remote sensing

Community:

  • [ 1 ] [Zhang S.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China
  • [ 2 ] [Li M.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China
  • [ 3 ] [Zhao W.]Hong Kong University of Science and Technology (Guangzhou), Urban Governance and Design Thrust, Society Hub, China
  • [ 4 ] [Wang X.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China
  • [ 5 ] [Wu Q.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Source :

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN: 1939-1404

Year: 2024

4 . 7 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Affiliated Colleges:

Online/Total:261/9556118
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1