• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Cheng, Yongli (Cheng, Yongli.) [1] (Scholars:程永利) | Ma, Yan (Ma, Yan.) [2] (Scholars:马妍) | Jiang, Hong (Jiang, Hong.) [3] | Zeng, Lingfang (Zeng, Lingfang.) [4] | Wang, Fang (Wang, Fang.) [5] | Xu, Xianghao (Xu, Xianghao.) [6] | Wu, Yuhang (Wu, Yuhang.) [7]

Indexed by:

EI Scopus SCIE

Abstract:

Existing graph systems focus mainly on the execution efficiency of the graph analysis tasks, often ignoring the importance and efficiency of time-evolving graph storage. However, to effectively mine the potential application values, an efficient storage system is important for time-evolving graphs whose storage requirement scales with the increasing number of snapshots. Storage cost and snapshot access speed are the two most important performance indicators for a time-evolving graph storage system, which are challenging for designers of such systems because they are conflicting goals. In this article, we address these challenges by proposing an efficient storage scheme for the large time-evolving graphs. We first design a Snapshot-level Data Deduplication (SLDD) strategy to eliminate the large number of repeated vertices and edges among the snapshots, and then a Structure-Changing Graph Representation (SCGR) to significantly improve the snapshot access speed. We implement an efficient time-evolving graph storage system, TgStore, based on this scheme to effectively store large-scale time-evolving graphs, aiming to efficiently support the time-evolving graph analysis tasks. Experimental results show that TgStore can obtain a high compression ratio of 43.03:1 when storing 100 snapshots of Twitter, while with an average snapshot access speedup of 16x. Efficient storage scheme enables TgStore to efficiently support time-evolving graph algorithms. For example, when executing the Pagerank algorithm on the time-evolving graph of Twitter, TgStore outperforms Graphone, a state-of-the-art time-evolving graph storage system, by 15.9x in algorithm execution speed and 1.45x in memory usage.

Keyword:

Big Data Blogs Costs data deduplication data representation Market research Pandemics Social networking (online) storage system Task analysis Time-evolving graph

Community:

  • [ 1 ] [Cheng, Yongli]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350025, Peoples R China
  • [ 2 ] [Ma, Yan]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350025, Peoples R China
  • [ 3 ] [Wu, Yuhang]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350025, Peoples R China
  • [ 4 ] [Cheng, Yongli]Fuzhou Univ, Fujian Key Lab Network Comp & Intelligent Informat, Fuzhou 350025, Peoples R China
  • [ 5 ] [Cheng, Yongli]Minist Educ, Engn Res Ctr Big Data Intelligence, Fuzhou 350025, Peoples R China
  • [ 6 ] [Cheng, Yongli]Zhejiang Lab, Hangzhou 311121, Peoples R China
  • [ 7 ] [Zeng, Lingfang]Zhejiang Lab, Hangzhou 311121, Peoples R China
  • [ 8 ] [Jiang, Hong]Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
  • [ 9 ] [Wang, Fang]Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
  • [ 10 ] [Xu, Xianghao]Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

Reprint 's Address:

  • [Zeng, Lingfang]Zhejiang Lab, Hangzhou 311121, Peoples R China;;[Xu, Xianghao]Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON BIG DATA

ISSN: 2332-7790

Year: 2024

Issue: 2

Volume: 10

Page: 158-173

7 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Online/Total:55/10051266
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1