• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Wei, Xiaojie (Wei, Xiaojie.) [1] | Zeng, Hongji (Zeng, Hongji.) [2] | Fang, Ying (Fang, Ying.) [3] | Lin, Liqun (Lin, Liqun.) [4] | Chen, Weiling (Chen, Weiling.) [5] | Xu, Yiwen (Xu, Yiwen.) [6]

Indexed by:

EI

Abstract:

Versatile Video Coding (VVC) introduces various advanced coding techniques and tools, such as QuadTree with nested Multi-type Tree (QTMT) partition structure, and outperforms High Efficiency Video Coding (HEVC) in terms of coding performance. However, the improvement of coding performance leads to an increase in coding complexity. In this paper, we propose a multi-feature fusion framework that integrates the rate-distortion-complexity optimization theory with deep learning techniques to reduce the complexity of QTMT partition for VVC inter-prediction. Firstly, the proposed framework extracts features of luminance, motion, residuals, and quantization information from video frames and then performs feature fusion through a convolutional neural network to predict the minimum partition size of Coding Units (CUs). Next, a novel rate-distortion-complexity loss function is designed to balance computational complexity and compression performance. Then, through this loss function, we can adjust various distributions of rate-distortion-complexity costs. This adjustment impacts the prediction bias of the network and sets constraints on different block partition sizes to facilitate complexity adjustment. Compared to anchor VTM-13.0, the proposed method saves the encoding time by 10.14% to 56.62%, with BDBR increase confined to a range of 0.31% to 6.70%. The proposed method achieves a broader range of complexity adjustments while ensuring coding performance, surpassing both traditional methods and deep learning-based methods. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

Keyword:

Block codes Convolutional neural networks Deep learning Health risks Image coding Image compression

Community:

  • [ 1 ] [Wei, Xiaojie]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
  • [ 2 ] [Zeng, Hongji]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
  • [ 3 ] [Fang, Ying]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
  • [ 4 ] [Lin, Liqun]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
  • [ 5 ] [Chen, Weiling]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China
  • [ 6 ] [Xu, Yiwen]Fujian Key Laboratory for Intelligent Processing and Wireless Transmission of Media Information, Fuzhou University, No. 2 North Wulong River Avenue, Fuzhou College Town, Fujian, Fuzhou, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Journal of Real-Time Image Processing

ISSN: 1861-8200

Year: 2024

Issue: 6

Volume: 21

2 . 9 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Affiliated Colleges:

Online/Total:382/9652087
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1