Journal of Shanghai Jiao Tong University ›› 2024, Vol. 58 ›› Issue (4): 579-584.doi: 10.16183/j.cnki.jsjtu.2024.035

• Electronic Information and Electrical Engineering • Previous Articles    

A Consistency Checking Method for Erasure-Coded Striped Data

XU Liangye1, SHI Lianxing2(), SHAN Rongsheng3   

  1. 1. Shanghai Children’s Medical Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200127, China
    2. Shanghai Xiaoyun Info Tech Co., Ltd., Shanghai 200240, China
    3. School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
  • Received:2024-01-24 Revised:2024-02-20 Accepted:2024-02-22 Online:2024-04-28 Published:2024-04-30

Abstract:

Erasure code is commonly used in distributed storage systems. The stripe is the basic unit of consistency check in erasure-coded data, including multiple original stripe units and verification stripe units. In order to reduce the cost of reading for consistency check of erasure-coded striped data and improve the efficiency of erasure-coded data consistency check and reading-after-writing, self-correction data tags (SCDTs) is added to each stripe unit when writing erasure-coded data in striping mode, based on which, the consistency checks of each stripe are implemented. The method proposed can complete the consistency check of a stripe without reading all data units in the stripe, which improves the efficiency of consistency checks by 1.7 to 2.6 times. Moreover, when the number of stripe units updated by written data is less than the critical value, it can effectively reduce the number of Input/Output (IO) interactions for writing. The method proposed can better handle partial updates of striped data sets while improving the efficiency of consistency checks.

Key words: distributed storage system, striped data, consistency check, data tag, erasure code

CLC Number: