Abstract
PV power and irradiance sensor-based measurements are prone to error, resulting in issues such as abrupt time series data shifts. These shifts, which are usually unintentional, may be caused by software or hardware configuration changes on a PV system, and do not reflect an actual change in overall system performance. Locating these shifts and segmenting the associated time series aids in more accurate future PV analysis. In this research, an offline changepoint detection (CPD) algorithm that automatically detects these abrupt data shifts in sensor-based time series is introduced. Data shift periods in 101 daily PV power and irradiance time series were labeled manually by two solar experts. These data streams represent sensor-based measurements, and display a variety of data shift behaviors. A changepoint detection algorithm was tuned using the 101 labeled data streams, with each model configuration's ability to detect labeled changepoints benchmarked using metrics such as F1-score, recall, and Rand Index. Best performing models on seasonality-corrected data streams include the Pruned Exact Linear (PELT) method, the Binary Segmentation method, and the Bottom-Up method, all scoring an average F1-score of 0.76 or greater at detecting labeled changepoints within a 30-day window for the labeled data sets. To promote further research in this space, we are releasing the labeled data shift sets on U.S. Department of Energy's (DOE) DuraMAT Data Hub, and the associated algorithm in the Python PVAnalytics package.
Original language | American English |
---|---|
Number of pages | 8 |
State | Published - 2022 |
Event | 49th IEEE Photovoltaic Specialists Conference (PVSC 49) - Philadelphia, Pennsylvania Duration: 5 Jun 2022 → 10 Jun 2022 |
Conference
Conference | 49th IEEE Photovoltaic Specialists Conference (PVSC 49) |
---|---|
City | Philadelphia, Pennsylvania |
Period | 5/06/22 → 10/06/22 |
NREL Publication Number
- NREL/CP-5K00-83062
Keywords
- changepoint detection
- data shifts
- solar
- time series