Characterization and Modeling of PIDX Parallel I/O for Performance Optimization

Sidharth Kumar, Avishek Saha, Venkatram Vishwanath, Philip Carns, John A. Schmidt, Giorgio Scorzelli, Hemanth Kolla, Ray Grout, Robert Latham, Robert Ross, Michael E. Papka, Jacqueline Chen, Valerio Pascucci

Research output: Contribution to conferencePaperpeer-review

19 Scopus Citations

Abstract

Parallel I/O library performance can vary greatly in re-sponse to user-tunable parameter values such as aggrega-tor count, file count, and aggregation strategy. Unfortu-nately, manual selection of these values is time consuming and dependent on characteristics of the target machine, the underlying file system, and the dataset itself. Some charac-teristics, such as the amount of memory per core, can also impose hard constraints on the range of viable parameter values. In this work we address these problems by using machine learning techniques to model the performance of the PIDX parallel I/O library and select appropriate tun-able parameter values. We characterize both the network and I/O phases of PIDX on a Cray XE6 as well as an IBM Blue Gene/P system. We use the results of this study to develop a machine learning model for parameter space ex-ploration and performance prediction.

Original languageAmerican English
DOIs
StatePublished - 2013
Event2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013 - Denver, CO, United States
Duration: 17 Nov 201322 Nov 2013

Conference

Conference2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013
Country/TerritoryUnited States
CityDenver, CO
Period17/11/1322/11/13

NREL Publication Number

  • NREL/CP-2C00-62250

Other Report Number

  • Article No. 67

Keywords

  • I/O & Network Characterization
  • Performance Modeling

Fingerprint

Dive into the research topics of 'Characterization and Modeling of PIDX Parallel I/O for Performance Optimization'. Together they form a unique fingerprint.

Cite this