Trilogy: Data placement to improve performance and robustness of cloud computing

Chin-Jung Hsu, Vincent W. Freeh, Flavio Villanustre

Research output: Contribution to journalArticle

3 Scopus citations

Abstract

cluster for each application for each workload. When the workload changes, a cluster will be either underutilized (wasting resources) or unable to meet demand (incurring opportunity costs). Consequently, efficient cluster resizing requires proper data replication and placement. Our work reveals that coarse-grain, workload-aware replication addresses over-utilization but cannot resolve under-utilization. With fine-grain partitioning of the dataset, data replication can reduce both under- and over-utilization. In our empirical studies, compared to a näive uniform data replication a coarse-grain workload-aware replication increases throughput by 81% on a highly-skewed workload. A fine-grain scheme further reaches 166% increase. Furthermore, a surprisingly small increase in granularity is sufficient to obtain most benefits. Evaluations also show that maximizing the number of unique partitions per node increases robustness to tolerate workload deviation while minimizing this number reduces storage footprint.
Original languageAmerican English
JournalIEEE Software
DOIs
StatePublished - 2017

Fingerprint Dive into the research topics of 'Trilogy: Data placement to improve performance and robustness of cloud computing'. Together they form a unique fingerprint.

  • Cite this