Abstract
Parallel I/O library performance can vary greatly in re-sponse to user-tunable parameter values such as aggrega-tor count, file count, and aggregation strategy. Unfortu-nately, manual selection of these values is time consuming and dependent on characteristics of the target machine, the underlying file system, and the dataset itself. Some charac-teristics, such as the amount of memory per core, can also impose hard constraints on the range of viable parameter values. In this work we address these problems by using machine learning techniques to model the performance of the PIDX parallel I/O library and select appropriate tun-able parameter values. We characterize both the network and I/O phases of PIDX on a Cray XE6 as well as an IBM Blue Gene/P system. We use the results of this study to develop a machine learning model for parameter space ex-ploration and performance prediction.
Original language | American English |
---|---|
DOIs | |
State | Published - 2013 |
Event | 2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013 - Denver, CO, United States Duration: 17 Nov 2013 → 22 Nov 2013 |
Conference
Conference | 2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013 |
---|---|
Country/Territory | United States |
City | Denver, CO |
Period | 17/11/13 → 22/11/13 |
NREL Publication Number
- NREL/CP-2C00-62250
Other Report Number
- Article No. 67
Keywords
- I/O & Network Characterization
- Performance Modeling