ACIC: Automatic cloud I/O configurator for HPC applications

Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zha, Qianqian Shi, Xiaosong Ma, Wenguang Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Citations (Scopus)

Abstract

The cloud has become a promising alternative to tradi-tional HPC centers or in-house clusters. This new environ-ment highlights the I/O bottleneck problem, typically with top-of-the-line compute instances but sub-par communica-tion and I/O facilities. It has been observed that changing cloud I/O system configurations leads to significant varia-tion in the performance and cost efficiency of I/O intensive HPC applications. However, storage system configuration is tedious and error-prone to do manually, even for experts. This paper proposes ACIC, which takes a given applica-tion running on a given cloud platform, and automatically searches for optimized I/O system configurations. ACIC utilizes machine learning models to perform black-box per-formance/cost predictions. To tackle the high-dimensional parameter exploration space unique to cloud platforms, we enable affordable, reusable, and incremental training guided by Plackett and Burman Matrices. Results with four repre-sentative applications indicate that ACIC consistently iden-tiffes near-optimal configurations among a large group of candidate settings.

Original languageEnglish
Title of host publicationProceedings of SC 2013
Subtitle of host publicationThe International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherIEEE Computer Society
ISBN (Print)9781450323789
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013 - Denver, CO, United States
Duration: 17 Nov 201322 Nov 2013

Publication series

NameInternational Conference for High Performance Computing, Networking, Storage and Analysis, SC
ISSN (Print)2167-4329
ISSN (Electronic)2167-4337

Conference

Conference2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013
Country/TerritoryUnited States
CityDenver, CO
Period17/11/1322/11/13

Keywords

  • Cloud Computing
  • Modeling
  • Performance
  • Storage

Fingerprint

Dive into the research topics of 'ACIC: Automatic cloud I/O configurator for HPC applications'. Together they form a unique fingerprint.

Cite this