Semantic Correspondence with Peripheral Position Coding - Details

author：

Chen, Jinjian (Chen, Jinjian.) ^[1] | Li, Zuoyong (Li, Zuoyong.) ^[2] | Lai, Taotao (Lai, Taotao.) ^[3] | Xie, Huosheng (Xie, Huosheng.) ^[4] (Scholars：谢伙生)

Indexed by：

Abstract：

Establishing　dense　correspondences　between　semantically　similar　images　is　a　challenging　task.　Cost　aggregation　is　a　crucial　step　in　finding　correct　dense　correspondences,　with　the　goal　of　optimizing　the　initial　correlation　map　thereby　removing　the　ambiguity　of　the　correspondences.　Current　approaches　use　transformer　architectures　for　cost　aggregation,　which　lack　local　priors　to　adequately　capture　the　local　information　contained　in　the　correlation　map.　We　propose　to　incorporate　peripheral　position　coding　into　the　transformer　to　explore　the　local　information　to　obtain　the　matching　set　and　call　it　the　Peripheral　Transformer　Matcher　(PTM).　This　coding　technique　partitions　the　overall　receptive　field　of　the　self-attention　mechanism　into　diverse　peripheral　regions,　each　with　its　own　set　of　weights.　By　doing　this,　the　proposed　PTM　gets　a　specific　local　prior　by　adding　an　inductive　bias　to　the　transformer　models　and　making　the　initial　correlation　map　less　confusing.　In　addition,　a　local　self-attention　module　is　used　to　enhance　the　image　features　and　obtain　an　enhanced　initial　correlation　map.　Comparisons　of　the　experimental　results　with　baselines　on　public　datasets　demonstrate　the　effectiveness　of　the　proposed　PTM.　©　2023　IEEE.

Keyword：

Computer vision Image enhancement Semantics

Community：

[ 1 ] [Chen, Jinjian]College of Computer and Data Science, Fuzhou University, Fuzhou; 350108, China
[ 2 ] [Li, Zuoyong]College of Computer and Control Engineering, Minjiang University, Fujian Provincial Key Laboratory of Information Processing and Intelligent Control, Fuzhou; 350121, China
[ 3 ] [Lai, Taotao]College of Computer and Control Engineering, Minjiang University, Fujian Provincial Key Laboratory of Information Processing and Intelligent Control, Fuzhou; 350121, China
[ 4 ] [Xie, Huosheng]College of Computer and Data Science, Fuzhou University, Fuzhou; 350108, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Natural Images Enhancement Using Structure Extraction and Retinex
2020，20th International Conference on Advanced Concepts for Intelligent Vision Systems, ACIVS 2020
PIRM challenge on perceptual image enhancement on smartphones: Report
2019，15th European Conference on Computer Vision, ECCV 2018
Contrastive Learning Framework by Maximizing Mutual Information for Visual Question Answering
2021，2021 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2021
Multi-modal feature fusion based on variational autoencoder for visual question answering
2019，2nd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2019

Source ：

Year： 2023

Page： 329-334

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to