Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very-High-Resolution Satellite Images - Details

author：

Zhang, S. (Zhang, S..) ^[1] | Li, M. (Li, M..) ^[2] | Zhao, W. (Zhao, W..) ^[3] | Wang, X. (Wang, X..) ^[4] | Wu, Q. (Wu, Q..) ^[5]

Indexed by：

Scopus

Abstract：

Building　type　information　indicates　the　functional　properties　of　buildings　and　plays　a　crucial　role　in　smart　city　development　and　urban　socio-economic　activities.　Existing　methods　for　classifying　building　types　often　face　challenges　in　accurately　distinguishing　buildings　between　types　while　maintaining　well-delineated　boundaries,　especially　in　complex　urban　environments.　This　study　introduces　a　novel　framework,　i.e.,　CNN-Transformer　cross-attention　feature　fusion　network　(CTCFNet),　for　building　type　classification　from　very-high-resolution　remote　sensing　images.　CTCFNet　integrates　CNNs　and　Transformers　using　an　interactive　cross-encoder　fusion　(ICEF)　module　that　enhances　semantic　feature　learning　and　improves　classification　accuracy　in　complex　scenarios.　We　develop　an　adaptive　collaboration　optimization　(ACO)　module　that　applies　human　visual　attention　mechanisms　to　enhance　the　feature　representation　of　building　types　and　boundaries　simultaneously.　To　address　the　scarcity　of　datasets　in　building　type　classification,　we　create　two　new　datasets:　the　urban　building　type　(UBT)　dataset　and　the　town　building　type　(TBT)　dataset,　for　model　evaluation.　Extensive　experiments　on　these　datasets　demonstrate　that　CTCFNet　outperforms　popular　CNNs,　Transformers,　and　dual-encoder　methods　in　identifying　building　types　across　various　regions,　achieving　the　highest　MIoU　of　78.20%　and　77.11%,　F1　scores　of　86.83%　and　88.22%,　and　OA　of　95.07%　and　95.73%　on　the　UBT　and　TBT　datasets,　respectively.　We　conclude　that　CTCFNet　effectively　addresses　the　challenges　of　high　interclass　similarity　and　intraclass　inconsistency　in　complex　scenes,　yielding　results　with　well-delineated　building　boundaries　and　accurate　building　types.　The　codes　and　datasets　in　this　article　are　accessible　at　https://github.com/zsfaff/CTCFNet.　　©　2008-2012　IEEE.

Keyword：

Building type classification CNN- transformer networks cross-encoder feature interaction very-high-resolution remote sensing

Community：

[ 1 ] [Zhang S.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China
[ 2 ] [Li M.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China
[ 3 ] [Zhao W.]Hong Kong University of Science and Technology (Guangzhou), Urban Governance and Design Thrust, Society Hub, China
[ 4 ] [Wang X.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China
[ 5 ] [Wu Q.]Fuzhou University, Key Lab of Spatial Data Mining Info. Sharing of Min. of Educ., Academy of Digital China (Fujian), China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very High Resolution Satellite Images
2025，IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
Mapping Tea Plantations from VHR Images Using OBIA and Convolutional Neural Networks
2020，REMOTE SENSING
Integrating Spatial Details With Long-Range Contexts for Semantic Segmentation of Very High-Resolution Remote-Sensing Images
2023，IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
Extracting tobacco planting areas using LSTM from time series Sentinel-1 SAR data
2021，9th International Conference on Agro-Geoinformatics, Agro-Geoinformatics 2021
Extraction buildings from very high-resolution images with asymmetric siamese multitask networks and adversarial edge learning
2025，INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION

Source ：

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN： 1939-1404

Year： 2024

4 . 7 0 0

JCR@2023

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to