基于区组设计的Piggybacking码构造(网络首发)

展开
  • 1.长安大学信息工程学院;2.西北工业大学电子信息学院

网络出版日期: 2024-10-08

基金资助

国家自然科学基金(62001059); 陕西省重点研发计划项目(2024GX-YBXM-068);

Construction of Block Design-Based Piggybacking Codes

Expand
  • (1. School of Information Engineering, Chang’an University, 710064, Xi’an, China; 2. School of Electronics and lnformation, Northwestern Polytechnical University, 710129,Xi’an, China)

Online published: 2024-10-08

摘要

现有的piggybacking编码在最大距离可分(Maximum Distance Separable, MDS)码的基础上能够有效减小信息节点的修复带宽开销,但仍存在修复度大、校验节点修复带宽高等问题,同时忽略了实际分布式存储系统中数据冷热程度不同。为此,提出了一种基于区组设计的piggybacking编码构造,对热数据节点提供更高等级的保护。具体地,采用区组设计,对冷热数据节点进行非均匀分组,将热数据符号捎带到对应校验节点中,通过一定规则生成冷、热数据校验块和斜校验块,实现节点的低修复带宽率和低修复度率。理论分析与实验仿真发现,与现有的piggybacking编码相比,基于区组设计的piggybacking码显著降低了故障节点的平均修复带宽率和平均修复度率,且相较于冷数据节点,热数据节点的修复带宽开销更低。

本文引用格式

李瞳1, 王静1, 张洁薇1, 杨红志1, 刘向阳2 . 基于区组设计的Piggybacking码构造(网络首发)[J]. 上海交通大学学报, 0 : 0 . DOI: 10.16183/j.cnki.jsjtu.2024.203

Abstract

Based on Maximum Distance Separable (MDS) codes, piggybacking codes can reduce the repair bandwidth overhead of systematic nodes effectively, but there are still problems about larger repair degree, higher repair bandwidth of parity nodes, etc. At the same time, the existing piggybacking codes ignore the differences of hot and cold data in the actual distributed storage systems. For the reason above, a construction of block design-based piggybacking codes is proposed, which can provide more protection for hot data nodes. Specifically, the block design is used to group hot and cold data nodes non-uniformly, in which the hot data symbols are piggybacked into the corresponding parity nodes. Moreover, cold data parity blocks, hot data parity blocks, and slant parity blocks are generated to achieve lower repair bandwidth rate and repair degree rate of nodes. Theoretical analyses and experimental simulations show that, compared with the existing piggybacking codes, the block design-based piggybacking codes reduce the average repair bandwidth rate and average repair degree rate of the failed nodes significantly, and the repair bandwidth overhead of hot data nodes is much lower than cold data nodes.
文章导航

/