Cloud versus in-house cluster: Evaluating amazon cluster compute instances for running MPI applications

Yan Zhai*, Mingliang Liu, Jidong Zhai, Xiaosong Ma, Wenguang Chen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

71 Citations (Scopus)

Abstract

The emergence of cloud services brings new possibilities for constructing and using HPC platforms. However, while cloud services provide the exibility and convenience of customized, pay-as-you-go parallel computing, multiple previous studies in the past three years have indicated that cloudbased clusters need a significant performance boost to become a competitive choice, especially for tightly coupled parallel applications. In this work, we examine the feasibility of running HPC applications in clouds. This study distinguishes itself from existing investigations in several ways: 1) We carry out a comprehensive examination of issues relevant to the HPC community, including performance, cost, user experience, and range of user activities. 2) We compare an Amazon EC2-based platform built upon its newly available HPCoriented virtual machines with typical local cluster and supercomputer options, using benchmarks and applications with scale and problem size unprecedented in previous cloud HPC studies. 3) We perform detailed performance and scalability analysis to locate the chief limiting factors of the state-of-the-art cloud based clusters. 4) We present a case study on the impact of per-application parallel I/O system configuration uniquely enabled by cloud services. Our results reveal that though the scalability of EC2-based virtual clusters still lags behind traditional HPC alternatives, they are rapidly gaining in overall performance and cost-effectiveness, making them feasible candidates for performing tightly coupled scientific computing. In addition, our detailed benchmarking and profiling discloses and analyzes several problems regarding the performance and performance stability on EC2.

Original languageEnglish
Title of host publicationState of the Practice Reports, SC'11
DOIs
Publication statusPublished - 2011
Externally publishedYes
EventState of the Practice Reports, SC'11 - Seattle, WA, United States
Duration: 12 Nov 201118 Nov 2011

Publication series

NameState of the Practice Reports, SC'11

Conference

ConferenceState of the Practice Reports, SC'11
Country/TerritoryUnited States
CitySeattle, WA
Period12/11/1118/11/11

Fingerprint

Dive into the research topics of 'Cloud versus in-house cluster: Evaluating amazon cluster compute instances for running MPI applications'. Together they form a unique fingerprint.

Cite this