Machine Learning Prediction of Enzyme Optimum pH

Japheth Gado, Matthew Knotts, Ada Shaw, Debora Marks, Nicholas Gauthier, Chris Sander, Gregg Beckham

Research output: Contribution to journalArticlepeer-review

1 Scopus Citations

Abstract

The relationship between pH and enzyme catalytic activity, especially the optimal pH (pHopt) at which enzymes function, is critical for biotechnological applications. Hence, computational methods to predict pHopt will enhance enzyme discovery and design by facilitating accurate identification of enzymes that function optimally at specific pH levels, and by elucidating sequence-function relationships. Here we proposed and evaluated various machine learning methods for predicting pHopt, conducting extensive hyperparameter optimization and training over 11,000 model instances. Our results demonstrate that models utilizing language model embeddings markedly outperform other methods in predicting pHopt. We present EpHod, the best-performing model, to predict pHopt, making it publicly available to researchers. From sequence data, EpHod directly learns structural and biophysical features that relate to pHopt, including proximity of residues to the catalytic centre and the accessibility of solvent molecules. Overall, EpHod presents a promising advancement in pHopt prediction and will potentially speed up the development of enzyme technologies.
Original languageAmerican English
Pages (from-to)716-729
Number of pages14
JournalNature Machine Intelligence
Volume7
DOIs
StatePublished - 2025

NREL Publication Number

  • NREL/JA-2A00-86608

Keywords

  • enzyme function
  • enzymes

Fingerprint

Dive into the research topics of 'Machine Learning Prediction of Enzyme Optimum pH'. Together they form a unique fingerprint.

Cite this