Multi-feature fusion for efficient inter prediction in versatile video coding - Details

author：

Indexed by：

Abstract：

Versatile　Video　Coding　(VVC)　introduces　various　advanced　coding　techniques　and　tools,　such　as　QuadTree　with　nested　Multi-type　Tree　(QTMT)　partition　structure,　and　outperforms　High　Efficiency　Video　Coding　(HEVC)　in　terms　of　coding　performance.　However,　the　improvement　of　coding　performance　leads　to　an　increase　in　coding　complexity.　In　this　paper,　we　propose　a　multi-feature　fusion　framework　that　integrates　the　rate-distortion-complexity　optimization　theory　with　deep　learning　techniques　to　reduce　the　complexity　of　QTMT　partition　for　VVC　inter-prediction.　Firstly,　the　proposed　framework　extracts　features　of　luminance,　motion,　residuals,　and　quantization　information　from　video　frames　and　then　performs　feature　fusion　through　a　convolutional　neural　network　to　predict　the　minimum　partition　size　of　Coding　Units　(CUs).　Next,　a　novel　rate-distortion-complexity　loss　function　is　designed　to　balance　computational　complexity　and　compression　performance.　Then,　through　this　loss　function,　we　can　adjust　various　distributions　of　rate-distortion-complexity　costs.　This　adjustment　impacts　the　prediction　bias　of　the　network　and　sets　constraints　on　different　block　partition　sizes　to　facilitate　complexity　adjustment.　Compared　to　anchor　VTM-13.0,　the　proposed　method　saves　the　encoding　time　by　10.14%　to　56.62%,　with　BDBR　increase　confined　to　a　range　of　0.31%　to　6.70%.　The　proposed　method　achieves　a　broader　range　of　complexity　adjustments　while　ensuring　coding　performance,　surpassing　both　traditional　methods　and　deep　learning-based　methods.　©　The　Author(s),　under　exclusive　licence　to　Springer-Verlag　GmbH　Germany,　part　of　Springer　Nature　2024.

Keyword：

Block codes Convolutional neural networks Deep learning Health risks Image coding Image compression

Community：

[ 1 ] [Wei, Xiaojie]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
[ 2 ] [Zeng, Hongji]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
[ 3 ] [Fang, Ying]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
[ 4 ] [Lin, Liqun]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
[ 5 ] [Chen, Weiling]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
[ 6 ] [Xu, Yiwen]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Protograph QC-LDPC codes design for multi-level cell flash memories
2017，9th International Conference on Wireless Communications and Signal Processing, WCSP 2017
Distributed turbo product codes over multiple relays
2010，2010 7th IEEE Consumer Communications and Networking Conference, CCNC 2010
An Advanced Deep Framework for Recognition of Distracted Driving Behaviors
2018，7th IEEE Global Conference on Consumer Electronics, GCCE 2018
A Review of Deep Learning Applications in Lesion Detection Research
2023，13th International Conference on Information Technology in Medicine and Education, ITME 2023
Industrial Process Modeling Method Using Arima-ACNN-LSTM Coupling Algorithm
2024，2024 International Conference on Applied Mathematics, Modelling and Statistics Application, AMMSA 2024

Source ：

Journal of Real-Time Image Processing

ISSN： 1861-8200

Year： 2024

Issue： 6

Volume： 21

2 . 9 0 0

JCR@2023

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to