北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2013, Vol. 36 ›› Issue (3): 30-34.doi: 10.13190/jbupt.201303.31.xiechy

• 论文 • 上一篇    下一篇

适用于云存储冗余的半随机线性网络编码

谢垂益1,2, 贾忠田3, 卿斯汉4,5, 罗守山1, 程明智6   

  1. 1. 北京邮电大学 灾备技术国家工程实验室, 北京 100876;
    2. 韶关学院 数学与信息科学学院, 广东 韶关 512005;
    3. 山东省网络环境智能计算技术重点实验室, 济南 250022;
    4. 中国科学院 软件研究所, 北京 100190;
    5. 中国科学院 信息工程研究所, 北京 100093;
    6. 北京印刷学院 信息工程学院, 北京 102600
  • 收稿日期:2012-07-10 出版日期:2013-06-30 发布日期:2013-06-30
  • 作者简介:谢垂益(1974—), 男, 博士生, E-mail: gdxcy@163.com; 卿斯汉(1939—), 男, 研究员, 博士生导师.
  • 基金资助:

    灾备技术国家工程实验室建设项目; 国家自然科学基金项目(60970135,61170282); 北京市自然科学基金项目(4122026)

Semi-Random Linear Network Coding for Cloud Storage Redundancy

XIE Chui-yi1,2, JIA Zhong-tian3, QING Si-han4,5, LUO Shou-shan1, CHENG Ming-zhi6   

  1. 1. National Engineering Laboratory for Disaster Backup and Recovery, Beijing University of Posts and Telecommunications, Beijing 100876, China;
    2. School of Mathematics and Information Science, Shaoguan University, Guangdong Shaoguan 512005, China;
    3. Shandong Provincial Key Laboratory of Network Based Intelligent Computing, Jinan 250022, China;
    4. Institute of Software, Chinese Academy of Sciences, Beijing 100190, China;
    5. Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China;
    6. College of Information Engineering, Beijing Institute of Graphic Communication, Beijing 102600, China
  • Received:2012-07-10 Online:2013-06-30 Published:2013-06-30

摘要:

为了实现数据冗余备份,根据云存储的结构特点和可用性要求,提出半随机线性网络编码(SRLNC)算法,使用有限域GF(2s)上的单位行向量和随机行向量组成编码矩阵,对数据块分片、编码后进行冗余存储,采取单位行向量优先的策略进行解码. 分析了该算法的解码方程组存在唯一解的概率,并提出衡量算法运算时间的"每字符运算次数"指标. 实验结果表明,新算法在解码过程的运算时间随结点可用性的增加以指数级减少,当冗余度≤3且结点的可用性≥0.8时,比随机线性网络编码算法的编码运算时间快33%以上,比解码运算时间快5倍以上,适合应用于"一次写多次读"的云存储系统.

关键词: 云存储, 网络编码, 单位向量, 冗余, 每字符运算次数

Abstract:

According to structural characteristics and availability requirements of cloud storage, a semi-random linear network coding (SRLNC) algorithm is proposed for data redundancy. The data block is split into pieces, encoded by coding matrix composed of unit row vectors and random row vectors on finite fields GF(2s), stored in data nodes dispersedly. In decoding process, the data block can be restored with the strategy of unit row vectors priority. The probability of unique solution is analyzed for SRLNC decoding equations, a new operations per character index is defined to measure algorithm efficiency. Experiments prove that the operating time of the decoding process in the proposed algorithm exponentially reduces with the increase of the node availability. When the availability of each node≥0.8 and redundancy≤3, the encoding operation of the proposed algorithm is faster up to 33%, and decoding operation of the proposed is 5 times faster than that of random linear network coding algorithm, respectively. It is shown that the proposed algorithm is applicable to "write once read many" cloud storage system.

Key words: cloud storage, network coding, unit vector, redundancy, operations per character

中图分类号: