• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Wang, Xu (Wang, Xu.) [1] | He, Ziyan (He, Ziyan.) [2] | Zhang, Qiudan (Zhang, Qiudan.) [3] | Yang, You (Yang, You.) [4] | Zhao, Tiesong (Zhao, Tiesong.) [5] | Jiang, Jianmin (Jiang, Jianmin.) [6]

Indexed by:

Scopus SCIE

Abstract:

Being able to estimate monocular depth for spherical panoramas is of fundamental importance in 3D scene perception. However, spherical distortion severely limits the effectiveness of vanilla convolutions. To push the envelope of accuracy, recent approaches attempt to utilize Tangent projection (TP) to estimate the depth of 360(degrees) images. Yet, these methods still suffer from discrepancies and inconsistencies among patch-wise tangent images, as well as the lack of accurate ground truth depth maps under a supervised fashion. In this paper, we propose a geometry-aware self-supervised 360(degrees) image depth estimation methodology that explores the complementary advantages of TP and Equirectangular projection (ERP) by an asymmetric dual-domain collaborative learning strategy. Especially, we first develop a lightweight asymmetric dual-domain depth estimation network, which enables to aggregate depth-related features from a single TP domain, and then produce depth distributions of the TP and ERP domains via collaborative learning. This effectively mitigates stitching artifacts and preserves fine details in depth inference without overspending model parameters. In addition, a frequent-spatial feature concentration module is devised to simultaneously capture non-local Fourier features and local spatial features, such that facilitating the efficient exploration of monocular depth cues. Moreover, we introduce a geometric structural alignment module to further improve geometric structural consistency among tangent images. Extensive experiments illustrate that our designed approach outperforms existing self-supervised 360(degrees) depth estimation methods on three publicly available benchmark datasets.

Keyword:

360(degrees) image Accuracy depth estimation Depth measurement Distortion Estimation Feature extraction Federated learning Image reconstruction self-supervised learning tangent projection Three-dimensional displays Transformers Visualization

Community:

  • [ 1 ] [Wang, Xu]Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
  • [ 2 ] [He, Ziyan]Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
  • [ 3 ] [Zhang, Qiudan]Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
  • [ 4 ] [Jiang, Jianmin]Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
  • [ 5 ] [Yang, You]Huangzhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
  • [ 6 ] [Zhao, Tiesong]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China

Reprint 's Address:

  • [Zhang, Qiudan]Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON MULTIMEDIA

ISSN: 1520-9210

Year: 2025

Volume: 27

Page: 3224-3237

8 . 4 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Online/Total:87/10829407
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1