Reducing data movement costs using energy-efficient, active computation on SSD

Devesh Tiwari, Sudharshan S. Vazhkudai, Youngjae Kim, Xiaosong Ma, Simona Boboila, Peter J. Desnoyers

Research output: Contribution to conferencePaperpeer-review

33 Citations (Scopus)

Abstract

Modern scientific discovery often involves running complex application simulations on supercomputers, followed by a sequence of data analysis tasks on smaller clusters. This offline approach suffers from significant data movement costs such as redundant I/O, storage bandwidth bottleneck, and wasted CPU cycles, all of which contribute to increased energy consumption and delayed end-to-end performance. Technology projections for an exascale machine indicate that energy-efficiency will become the primary design metric. It is estimated that the energy cost of data movement will soon rival the cost of computation. Consequently, we can no longer ignore the data movement costs in data analysis. To address these challenges, we advocate executing data analysis tasks on emerging storage devices, such as SSDs. Typically, in extreme-scale systems, SSDs serve only as a temporary storage system for the simulation output data. In our approach, Active Flash, we propose to conduct in-situ data analysis on the SSD controller without degrading the performance of the simulation job. By migrating analysis tasks closer to where the data resides, it helps reduce the data movement cost. We present detailed energy and performance models for both active flash and offline strategies, and study them using extreme-scale application simulations, commonly used data analytics kernels, and supercomputer system configurations. Our evaluation suggests that active flash is a promising approach to alleviate the storage bandwidth bottleneck, reduce the data movement cost, and improve the overall energy efficiency.

Original languageEnglish
Publication statusPublished - 2012
Externally publishedYes
Event2012 Workshop on Power-Aware Computing Systems, HotPower 2012 - Hollywood, United States
Duration: 7 Oct 2012 → …

Conference

Conference2012 Workshop on Power-Aware Computing Systems, HotPower 2012
Country/TerritoryUnited States
CityHollywood
Period7/10/12 → …

Fingerprint

Dive into the research topics of 'Reducing data movement costs using energy-efficient, active computation on SSD'. Together they form a unique fingerprint.

Cite this