J Shanghai Jiaotong Univ Sci ›› 2021, Vol. 26 ›› Issue (4): 463-470.doi: 10.1007/s12204-020-2253-5

• • 上一篇    下一篇

Novel Data Placement Algorithm for Distributed Storage System  Based on Fault-Tolerant Domain

SHI Lianxing (石连星), WANG Zhiheng (王志恒), LI Xiaoyong * (李小勇)    

  1. (1. School of Cyber Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;
    2. School of Electronic Information Engineering, Shanghai DianJi University, Shanghai 200240, China)
  • 出版日期:2021-08-28 发布日期:2021-06-06
  • 通讯作者: LI Xiaoyong * (李小勇) E-mail: Xiaoyongli@sjtu.edu.cn

Novel Data Placement Algorithm for Distributed Storage System  Based on Fault-Tolerant Domain

SHI Lianxing (石连星), WANG Zhiheng (王志恒), LI Xiaoyong * (李小勇)    

  1. (1. School of Cyber Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;
    2. School of Electronic Information Engineering, Shanghai DianJi University, Shanghai 200240, China)
  • Online:2021-08-28 Published:2021-06-06
  • Contact: LI Xiaoyong * (李小勇) E-mail: Xiaoyongli@sjtu.edu.cn

摘要: The 3-replica redundancy strategy is widely used to solve the problem of data reliability in large-scale distributed storage systems. However, its storage capacity utilization is only 33%. In this paper, a data placement algorithm based on fault-tolerant domain (FTD) is proposed. Owing to the fine-grained design of the FTD, the data reliability of systems using two replicas is comparable to that of current mainstream systems using three replicas, and the capacity utilization is increased to 50%. Moreover, the proposed FTD provides a new concept for the design of distributed storage systems. Distributed storage systems can take FTDs as the units for data placement, data migration, data repair and so on. In addition, fault detection can be performed independently and concurrently within the FTDs.


关键词: data reliability, failure domain, fault-tolerant domain, data placement, storage system, distributed system

Abstract: The 3-replica redundancy strategy is widely used to solve the problem of data reliability in large-scale distributed storage systems. However, its storage capacity utilization is only 33%. In this paper, a data placement algorithm based on fault-tolerant domain (FTD) is proposed. Owing to the fine-grained design of the FTD, the data reliability of systems using two replicas is comparable to that of current mainstream systems using three replicas, and the capacity utilization is increased to 50%. Moreover, the proposed FTD provides a new concept for the design of distributed storage systems. Distributed storage systems can take FTDs as the units for data placement, data migration, data repair and so on. In addition, fault detection can be performed independently and concurrently within the FTDs.


Key words: data reliability, failure domain, fault-tolerant domain, data placement, storage system, distributed system

中图分类号: