Indexed by:
Abstract:
In the era of big data, traditional relational databases face challenges in massive data storage. HBase is a non-relational database based on column storage which is used widely for big data storage. The writing performance of HBase is high, but the unbalanced load caused by its uneven data storage strategy is the bottleneck of its reading performance. HBase needs to access disks to get query results. Therefore, the efficiency of disk access has a great impact on the query performance of HBase. In view of the above problems, this paper proposes a storage middleware which utilizes HBase for load balancing and Redis for memory caching. Specifically, we improve HBase’s original load balancing algorithm for Regions and RegionServers and customize a Redis cache eviction algorithm according to the data’s query and update frequency. Furthermore, the coprocessor of HBase is used to synchronize data between HBase and Redis. Experiments on synthetic datasets show that the proposed storage middleware achieves better writing and reading performance than HBase. The load balancing algorithm employed in the middleware is better than HBase's original algorithm. The hit rate of the customized cache eviction algorithm is also higher than that of the LRU algorithm. © 2021, Springer Nature Singapore Pte Ltd.
Keyword:
Reprint 's Address:
Email:
Source :
ISSN: 1865-0929
Year: 2021
Volume: 1330 CCIS
Page: 364-380
Language: English
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count:
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 3
Affiliated Colleges: