• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Zhao, Tiesong (Zhao, Tiesong.) [1] | Huang, Yuhang (Huang, Yuhang.) [2] | Feng, Weize (Feng, Weize.) [3] | Xu, Yiwen (Xu, Yiwen.) [4] (Scholars:徐艺文) | Kwong, Sam (Kwong, Sam.) [5]

Indexed by:

EI Scopus SCIE

Abstract:

The ever-growing multimedia traffic has underscored the importance of effective multimedia codecs. Among them, the up-to-date lossy video coding standard, Versatile Video Coding (VVC), has been attracting attentions of video coding community. However, the gain of VVC is achieved at the cost of significant encoding complexity, which brings the need to realize fast encoder with comparable Rate Distortion (RD) performance. In this paper, we propose to optimize the VVC complexity at intra-frame prediction, with a two-stage framework of deep feature fusion and probability estimation. At the first stage, we employ the deep convolutional network to extract the spatial-temporal neighboring coding features. Then we fuse all reference features obtained by different convolutional kernels to determine an optimal intra coding depth. At the second stage, we employ a probability-based model and the spatial-temporal coherence to select the candidate partition modes within the optimal coding depth. Finally, these selected depths and partitions are executed whilst unnecessary computations are excluded. Experimental results on standard database demonstrate the superiority of proposed method, especially for High Definition (HD) and Ultra-HD (UHD) video sequences.

Keyword:

Complexity theory Computational modeling Convolutional neural networks Encoding Feature extraction Intra coding Kernel rate-distortion (RD) versatile video coding (VVC) video coding Video coding

Community:

  • [ 1 ] [Zhao, Tiesong]Fuzhou Univ, Coll Phys & Informat Engn, Fujian Key Lab Intelligent Proc & Wireless Transm, Fuzhou 350108, Peoples R China
  • [ 2 ] [Huang, Yuhang]Fuzhou Univ, Coll Phys & Informat Engn, Fujian Key Lab Intelligent Proc & Wireless Transm, Fuzhou 350108, Peoples R China
  • [ 3 ] [Feng, Weize]Fuzhou Univ, Coll Phys & Informat Engn, Fujian Key Lab Intelligent Proc & Wireless Transm, Fuzhou 350108, Peoples R China
  • [ 4 ] [Xu, Yiwen]Fuzhou Univ, Coll Phys & Informat Engn, Fujian Key Lab Intelligent Proc & Wireless Transm, Fuzhou 350108, Peoples R China
  • [ 5 ] [Zhao, Tiesong]Peng Cheng Lab, Shenzhen 518055, Peoples R China
  • [ 6 ] [Kwong, Sam]City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

Reprint 's Address:

Show more details

Related Keywords:

Related Article:

Source :

IEEE TRANSACTIONS ON MULTIMEDIA

ISSN: 1520-9210

Year: 2023

Volume: 25

Page: 6411-6421

8 . 4

JCR@2023

8 . 4 0 0

JCR@2023

JCR Journal Grade:1

CAS Journal Grade:1

Cited Count:

WoS CC Cited Count: 13

SCOPUS Cited Count: 12

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:66/10027519
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1