GAN-based multi-view video coding with spatio-temporal EPI reconstruction - Details

author：

Lan, Chengdong (Lan, Chengdong.) ^[1] | Yan, Hao (Yan, Hao.) ^[2] | Luo, Cheng (Luo, Cheng.) ^[3] | Zhao, Tiesong (Zhao, Tiesong.) ^[4]

Indexed by：

Abstract：

The　introduction　of　multiple　viewpoints　in　video　scenes　inevitably　increases　the　bitrates　required　for　storage　and　transmission.　To　reduce　bitrates,　researchers　have　developed　methods　to　skip　intermediate　viewpoints　during　compression　and　delivery,　and　ultimately　reconstruct　them　using　Side　Information　(SInfo).　Typically,　depth　maps　are　used　to　construct　SInfo.　However,　these　methods　suffer　from　reconstruction　inaccuracies　and　inherently　high　bitrates.　In　this　paper,　we　propose　a　novel　multi-view　video　coding　method　that　leverages　the　image　generation　capabilities　of　Generative　Adversarial　Network　(GAN)　to　improve　the　reconstruction　accuracy　of　SInfo.　Additionally,　we　consider　incorporating　information　from　adjacent　temporal　and　spatial　viewpoints　to　further　reduce　SInfo　redundancy.　At　the　encoder,　we　construct　a　spatio-temporal　Epipolar　Plane　Image　(EPI)　and　further　utilize　a　convolutional　network　to　extract　the　latent　code　of　a　GAN　as　SInfo.　At　the　decoder,　we　combine　the　SInfo　and　adjacent　viewpoints　to　reconstruct　intermediate　views　using　the　GAN　generator.　Specifically,　we　establish　a　joint　encoder　constraint　for　reconstruction　cost　and　SInfo　entropy　to　achieve　an　optimal　trade-off　between　reconstruction　quality　and　bitrate　overhead.　Experiments　demonstrate　the　significant　improvement　in　Rate–Distortion　(RD)　performance　compared　to　state-of-the-art　methods.　©　2024

Keyword：

Convolutional codes Encoding (symbols) Generative adversarial networks Image coding Image enhancement Image reconstruction Network coding

Community：

[ 1 ] [Lan, Chengdong]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, Fuzhou; 350108, China
[ 2 ] [Yan, Hao]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, Fuzhou; 350108, China
[ 3 ] [Luo, Cheng]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, Fuzhou; 350108, China
[ 4 ] [Zhao, Tiesong]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, Fuzhou; 350108, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Distributed turbo product codes over multiple relays
2010，2010 7th IEEE Consumer Communications and Networking Conference, CCNC 2010
Analysis of PAM Mapping for Convolutional-Coded Physical-Layer Network Coding
2022，IEEE COMMUNICATIONS LETTERS
Video coding based on overcomplete motion compensated temporal
2005，Multimedia Systems and Applications VIII
On Construction of Low-Density Parity-Check Codes for Ultra-Reliable and Low Latency Communications
2024，IEEE Transactions on Communications

Source ：

Signal Processing: Image Communication

ISSN： 0923-5965

Year： 2025

Volume： 132

3 . 4 0 0

JCR@2023

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to