TY - JOUR
T1 - Hybrid machine learning system based on multivariate data decomposition and feature selection for improved multitemporal evapotranspiration forecasting
AU - Lee, Jinwook
AU - Bateni, Sayed M.
AU - Jun, Changhyun
AU - Heggy, Essam
AU - Jamei, Mehdi
AU - Kim, Dongkyun
AU - Ghafouri, Hamid Reza
AU - Deenik, Jonathan L.
N1 - Publisher Copyright:
© 2024 Elsevier Ltd
PY - 2024/9
Y1 - 2024/9
N2 - Evapotranspiration is an essential component of the hydrological cycle. Forecasting the reference crop evapotranspiration (ETo) using a reliable and generalized framework is crucial for agricultural operations, especially irrigation. This study was aimed at evaluating the performance of a hybrid system including the K-Best selection (KBest), multivariate variational mode decomposition (MVMD), and Machine learning (ML) models for 1-, 3-, 7-, and 10-day-ahead forecasting of the daily ETo in twelve stations of California. The analysis covered a span of 20 years, from 2003 to 2022. Three stand-alone ML models, namely Cascade Forward Neural Network (CFNN), Extreme Learning Machine (ELM), and Bagging Regression Tree (BRT) are used and were integrated with various preprocessing techniques to construct three hybrid models, i.e., MVMD-KBest-CFNN, MVMD-KBest-ELM, and MVMD-KBest-BRT. According to the results obtained in the testing phase, averaged across all stations, all three stand-alone models (CFNN, ELM, and BRT) yielded similar outcomes. In contrast, the hybrid models exhibited significantly enhanced performances compared with the standalone models, and MVMD-KBest-CFNN and MVMD-KBest-ELM models outperformed MVMD-KBest-BRT model. The BRT-based models were vulnerable to overfitting. The performance of the best models is superior compared to similar existing studies. Examining the variations across stations, it was found that the stations located further from the coast and in arid regions could be susceptible to prediction errors and necessitate more attention.
AB - Evapotranspiration is an essential component of the hydrological cycle. Forecasting the reference crop evapotranspiration (ETo) using a reliable and generalized framework is crucial for agricultural operations, especially irrigation. This study was aimed at evaluating the performance of a hybrid system including the K-Best selection (KBest), multivariate variational mode decomposition (MVMD), and Machine learning (ML) models for 1-, 3-, 7-, and 10-day-ahead forecasting of the daily ETo in twelve stations of California. The analysis covered a span of 20 years, from 2003 to 2022. Three stand-alone ML models, namely Cascade Forward Neural Network (CFNN), Extreme Learning Machine (ELM), and Bagging Regression Tree (BRT) are used and were integrated with various preprocessing techniques to construct three hybrid models, i.e., MVMD-KBest-CFNN, MVMD-KBest-ELM, and MVMD-KBest-BRT. According to the results obtained in the testing phase, averaged across all stations, all three stand-alone models (CFNN, ELM, and BRT) yielded similar outcomes. In contrast, the hybrid models exhibited significantly enhanced performances compared with the standalone models, and MVMD-KBest-CFNN and MVMD-KBest-ELM models outperformed MVMD-KBest-BRT model. The BRT-based models were vulnerable to overfitting. The performance of the best models is superior compared to similar existing studies. Examining the variations across stations, it was found that the stations located further from the coast and in arid regions could be susceptible to prediction errors and necessitate more attention.
KW - Hybrid system
KW - KBest selection
KW - Multitemporal forecasting
KW - Multivariate variational mode decomposition
KW - Reference evapotranspiration
UR - http://www.scopus.com/inward/record.url?scp=85194966527&partnerID=8YFLogxK
U2 - 10.1016/j.engappai.2024.108744
DO - 10.1016/j.engappai.2024.108744
M3 - Article
AN - SCOPUS:85194966527
SN - 0952-1976
VL - 135
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
M1 - 108744
ER -