Improving LSM-trie performance by parallel search

Cheng, Wen; Guo, Tao; Zeng, Lingfang; Wang, Yang; Nagel, Lars; Süß, Tim; Brinkmann, André

optlsmtrie_spe_r3.pdf (338.46 kB)

Improving LSM-trie performance by parallel search

journal contribution

posted on 2020-08-28, 09:21 authored by Wen Cheng, Tao Guo, Lingfang Zeng, Yang Wang, Lars NagelLars Nagel, Tim Süß, André Brinkmann

LSM-trie-based key-value (KV) store is often used to manage an ultralarge dataset in reality by introducing a number of sublevels at each level, its linear growth pattern can fairly reduce the write amplification in store operations. Although this design is effective for the write operation, the last level holds a large proportion of KV items, leading to the extreme imbalance of data distribution. Therefore, to support efficient read, we need to carefully consider this imbalance. On the other hand, to ensure that acquired data is latest, the LSM-trie needs to search the dataset at different levels one by one, and this search method may take a lot of unnecessary time. When the number of items is ultralarge, the random lookup performance may be poor due to the imbalance data distribution. To address this issue, we improve the read performance of the LSM-trie by changing its serial search to parallel search, using two threads to simultaneously search at the last level and other levels, respectively. Our experiment results show that the read performance of the LSM-trie can be improved up to 98.35% and on average 71.55%.

Funding

National Natural Science Foundation of China. Grant Number: 61472153, 61672513, 61572377.

National Science Foundation of Hubei Province. Grant Number: 2015CFB192.

Science and Technology Planning Project of Guangdong Province. Grant Number: 2019B010137002.

the Key‐Area Research and Development Program of Guangdong Province. Grant Number: 2020B010164002.

History

School

Science

Department

Computer Science

Published in

Software: Practice and Experience

Volume

50

Issue

10

Pages

1952 - 1965

Publisher

Wiley

Version

AM (Accepted Manuscript)

Rights holder

Publisher statement

This is the peer reviewed version of the following article: CHENG, W. ... et al, 2020. Improving LSM‐trie performance by parallel search. Software: Practice and Experience, 50 (10), pp.1952-1965, which has been published in final form at https://doi.org/10.1002/spe.2875. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions.

Acceptance date

2020-06-05

Publication date

2020-08-03

Copyright date

2020

DOI

https://doi.org/10.1002/spe.2875

ISSN

0038-0644

eISSN

1097-024X

Publisher version

https://doi.org/10.1002/spe.2875

Language

en

Depositor

Dr Lars Nagel. Deposit date: 27 August 2020

Usage metrics

Keywords

key-value storage LSM-tree parallel search

Licence

CC BY-NC-ND 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Improving LSM-trie performance by parallel search

Funding

National Natural Science Foundation of China. Grant Number: 61472153, 61672513, 61572377.

National Science Foundation of Hubei Province. Grant Number: 2015CFB192.

Science and Technology Planning Project of Guangdong Province. Grant Number: 2019B010137002.

the Key‐Area Research and Development Program of Guangdong Province. Grant Number: 2020B010164002.

History

School

Department

Published in

Volume

Issue

Pages

Publisher

Version

Rights holder

Publisher statement

Acceptance date

Publication date

Copyright date

DOI

ISSN

eISSN

Publisher version

Language

Depositor

Usage metrics

Categories

Keywords

Licence

Exports