Treffer: On the reliability analysis and robust straggler tolerance of data-center storage with erasure coding = 利用糾刪碼打造可靠高效的數據中心 ; 利用糾刪碼打造可靠高效的數據中心 ; On the reliability analysis and robust straggler tolerance of data-center storage with erasure coding = Li yong jiu shan ma da zao ke kao gao xiao de shu ju zhong xin ; Li yong jiu shan ma da zao ke kao gao xiao de shu ju zhong xin
Chinese
Weitere Informationen
Ph.D. ; Modern data centers provide cloud computing services and big data analytics by clustering commodity machines. failures are commonplace in such an environment, which degrade data availability and durability. Erasure coding protects data storage against failures with low redundancy ratio. However, the penalty in data recovery prevents erasure coding being further adopted in practice. To guarantee reliability and boost performance of modern data centers with erasure coding, we study two problems in real-world data centers and present our systems. In this thesis, we aim to estimate the reliability of data centers with erasure coding and achieve low latencies with erasure-coded caching. ; Firstly, we study the problem of analyzing the reliability on modern data centers with erasure coding. Modern data centers adopt a hierarchical architecture in which multiple nodes are grouped into racks and the cross-rack bandwidth is typically more limited than the inner-rack bandwidth. In such a data center, there are two types of redundancy placement schemes: one is flat placement, in which erasure-coded data is distributed across distinct nodes, each of which is located in a different rack in order to maximize the fault tolerance against rack failures; another is hierarchical placement, in which erasure-coded data is distributed across fewer racks, or equivalently multiple nodes per rack, to trade rack-level fault tolerance for the reduction of cross-rack repair traffic. Thus, the choices of erasure codes and redundancy placement in erasure-coded data centers raise new reliability issues. Chapter 3 explores to conduct a comprehensive reliability analysis by taking into account the hierarchical nature of erasure-coded data centers. We build a discrete-event simulator called SimEDC, which reports various reliability metrics of an erasure-coded data center based on the configurable inputs of the data center topology, erasure codes, redundancy placement, and failure/repair patterns of different subsystems obtained from ...