TY - GEN
T1 - Large Language Models (LLMs) for Energy Systems Research
AU - Buster, Grant
PY - 2023
Y1 - 2023
N2 - The integration of Large Language Models (LLMs) in energy systems research promises transformative results, as demonstrated in this work, particularly in the realms of information retrieval and legal document analysis. We have developed a chat-based interface, specifically designed to query an extensive corpus of technical reports from the National Renewable Energy Laboratory (NREL). This interface capitalizes on the natural language processing capabilities of LLMs, providing future consumers of NREL research with a user-friendly platform to access and extract valuable information from technical documents, thus enhancing the dissemination of research to the public. In addition to information retrieval, we have employed LLMs to extract renewable energy siting ordinances from a variety of legal documents, a task traditionally driven by significant human labor. This automated extraction not only supports the ongoing development of the high-impact NREL siting ordinance database but also ensures the database's accuracy and comprehensiveness. Crucially, we have augmented the performance of LLMs through the integration of a decision tree framework, resulting in a substantial improvement in extraction accuracy. Comparative analysis with manual efforts has shown that this approach not only rivals but also significantly surpasses human accuracy, heralding increased reliability in legal document analysis for energy systems research. To democratize access to these advancements and foster collaborative research, we introduce the "Energy Language Model" (ELM), an open-source software package. ELM encapsulates the methodologies and tools developed in this work, providing researchers and practitioners with a robust toolkit to conduct similar analyses within their respective domains. Through these contributions, this work underscores the immense potential of LLMs in revolutionizing energy systems research, improving accuracy, efficiency, and accessibility in the field.
AB - The integration of Large Language Models (LLMs) in energy systems research promises transformative results, as demonstrated in this work, particularly in the realms of information retrieval and legal document analysis. We have developed a chat-based interface, specifically designed to query an extensive corpus of technical reports from the National Renewable Energy Laboratory (NREL). This interface capitalizes on the natural language processing capabilities of LLMs, providing future consumers of NREL research with a user-friendly platform to access and extract valuable information from technical documents, thus enhancing the dissemination of research to the public. In addition to information retrieval, we have employed LLMs to extract renewable energy siting ordinances from a variety of legal documents, a task traditionally driven by significant human labor. This automated extraction not only supports the ongoing development of the high-impact NREL siting ordinance database but also ensures the database's accuracy and comprehensiveness. Crucially, we have augmented the performance of LLMs through the integration of a decision tree framework, resulting in a substantial improvement in extraction accuracy. Comparative analysis with manual efforts has shown that this approach not only rivals but also significantly surpasses human accuracy, heralding increased reliability in legal document analysis for energy systems research. To democratize access to these advancements and foster collaborative research, we introduce the "Energy Language Model" (ELM), an open-source software package. ELM encapsulates the methodologies and tools developed in this work, providing researchers and practitioners with a robust toolkit to conduct similar analyses within their respective domains. Through these contributions, this work underscores the immense potential of LLMs in revolutionizing energy systems research, improving accuracy, efficiency, and accessibility in the field.
KW - automation
KW - chatgpt
KW - large language model (LLM)
KW - legal documents
KW - renewable energy
KW - siting ordinances
KW - text parsing
M3 - Presentation
T3 - Presented at the Solar Applications of Artificial Intelligence and Machine Learning Workshop, 31 October - 1 November 2023, Alexandria, Virginia
ER -