HSENet: Hierarchical semantic-enriched network for multi-modal image fusion - Details

author：

Liu, Xinyu (Liu, Xinyu.) ^[1] | Ming, Rui (Ming, Rui.) ^[2] | Du, Songlin (Du, Songlin.) ^[3] | He, Lianghua (He, Lianghua.) ^[4] | Xiao, Guobao (Xiao, Guobao.) ^[5]

Indexed by：

EI Scopus SCIE

Abstract：

In　this　paper,　we　propose　HSENet,　a　hierarchical　semantic-enriched　network　capable　of　generating　high-quality　fused　images　with　robust　global　semantic　consistency　and　excellent　local　detail　representation.　The　core　innovation　of　HSENet　lies　in　its　hierarchical　enrichment　of　semantic　information　through　semantic　gathering,　distribution,　and　injection.　Specifically,　the　network　begins　by　balancing　global　information　exchange　via　multi-scale　feature　aggregation　and　redistribution　while　dynamically　bridging　fusion　and　segmentation　tasks.　Following　this,　a　progressive　semantic　dense　injection　strategy　is　introduced,　employing　dense　connections　to　first　inject　global　semantics　into　highly　consistent　infrared　features　and　then　propagate　the　semantic-infrared　hybrid　features　to　visible　features.　This　approach　effectively　enhances　semantic　representation　while　minimizing　high-frequency　information　loss.　Furthermore,　HSENet　includes　two　types　of　feature　fusion　modules,　to　leverage　cross-modal　attention　for　more　comprehensive　feature　fusion　and　utilize　semantic　features　as　a　third　input　to　further　enhance　the　semantic　representation　for　image　fusion.　These　modules　achieve　robust　and　flexible　feature　fusion　in　complex　scenarios　by　dynamically　balancing　global　semantic　consistency　and　fine-grained　local　detail　representation.　Our　approach　excels　in　visual　perception　tasks　while　fully　preserving　the　texture　features　from　the　source　modalities.　The　comparison　experiments　of　image　fusion　and　semantic　segmentation　demonstrate　the　superiority　of　HSENet　in　visual　quality　and　semantic　preservation.　The　code　is　available　at　https://github.com/Lxyklmyt/HSENet.

Keyword：

High-level vision task Image fusion Progressive semantic dense injection Semantic gathering and distribution

Community：

[ 1 ] [Liu, Xinyu]Minjiang Univ, Sch Comp & Data Sci, Fuzhou 350108, Peoples R China
[ 2 ] [Ming, Rui]Minjiang Univ, Sch Comp & Data Sci, Fuzhou 350108, Peoples R China
[ 3 ] [He, Lianghua]Minjiang Univ, Sch Comp & Data Sci, Fuzhou 350108, Peoples R China
[ 4 ] [Liu, Xinyu]Fuzhou Univ, Sch Comp & Data Sci, Fuzhou 350108, Peoples R China
[ 5 ] [Du, Songlin]Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[ 6 ] [He, Lianghua]Tongji Univ, Sch Comp Sci & Technol, Shanghai 201804, Peoples R China
[ 7 ] [Xiao, Guobao]Tongji Univ, Sch Comp Sci & Technol, Shanghai 201804, Peoples R China

Reprint 's Address：

[Ming, Rui]Minjiang Univ, Sch Comp & Data Sci, Fuzhou 350108, Peoples R China;;[He, Lianghua]Minjiang Univ, Sch Comp & Data Sci, Fuzhou 350108, Peoples R China;;[He, Lianghua]Tongji Univ, Sch Comp Sci & Technol, Shanghai 201804, Peoples R China

Email：

rming@mju.edu.cn |
robhappy@mju.edu.cn

Show more details

Version：