Abstract
We present a new software, HYPPO, that enables the automatic tuning of hyperparameters of various deep learning (DL) models. Unlike other hyperparameter optimization (HPO) methods, HYPPO uses adaptive surrogate models and directly accounts for uncertainty in model predictions to find accurate and reliable models that make robust predictions. Using asynchronous nested parallelism, we are able to significantly alleviate the computational burden of training complex architectures and quantifying the uncertainty. HYPPO is implemented in Python and can be used with both TensorFlow and PyTorch libraries. We demonstrate various software features on time-series prediction and image classification problems as well as a scientific application in computed tomography image reconstruction. Finally, we show that (1) we can reduce by an order of magnitude the number of evaluations necessary to find the most optimal region in the hyperparameter space and (2) we can reduce by two orders of magnitude the throughput for such HPO process to complete.
Original language | American English |
---|---|
Pages | 81-93 |
Number of pages | 13 |
DOIs | |
State | Published - 2021 |
Event | 7th IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, MLHPC 2021 - St. Louis, United States Duration: 15 Nov 2021 → … |
Conference
Conference | 7th IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, MLHPC 2021 |
---|---|
Country/Territory | United States |
City | St. Louis |
Period | 15/11/21 → … |
Bibliographical note
Publisher Copyright:© 2021 IEEE.
NREL Publication Number
- NREL/CP-2C00-82274
Keywords
- adaptation models
- computational modeling
- parallel processing
- predictive models
- stochastic processes
- training
- uncertainty