Improving Reliability of Hydrological Flow Estimation using Hydroinformatics Approach

Publication Type:
Thesis
Issue Date:
2019
Full metadata record
Application of hydroinformatics tools in water resources has been very common in water industry due to the rapid advancement of digital computer. Over the last few decades, there are several tools have been developed and applied with success. The most commonly used Artificial Intelligence (AI) based hydroinformatics tools in hydrology are Genetic Programming (GP), Artificial Neural Network (ANN), Fuzzy Logic (FL), Standard Chaos Technique, Inverse Approach, Support Vector Machine (SVM) and Evolutionary Computation (Genetic Algorithm (GA), Shuffled Complex Evolution (SCE), Particle Swarm Optimization (PSO), Ant Colony Optimization Algorithm (ACOA)) based AI techniques including SVM (EC-SVM). These tools including Genetic Programming (GP) have been proven to be efficient in prediction of flows from event based rainfalls series. The driving factor behind the application of hydroinformatics tools was to ease the complex numerical modelling process. In principal, both conceptual and physically based distributed models require a large number of parameters such as catchment characteristics, losses, flow paths, meteorological and flow data. The values of some of these parameters are evaluated through calibration. The calibration process of complex models may be cumbersome and requires considerable effort and experience particularly when the number of the calibration parameters is large. Even though the model is calibrated, the application of the parameters is catchment specific. Model parameters from one catchment may not be representative for the other catchment. In this case, hydroinformatics tools like GP and/or ANN can be used where no parameters associated with catchment and soil characteristic are necessary. GP has been successfully applied for calibration of numerous event based rainfall and runoff models. However, application of GP for the prediction of long term time series is limited. The application of GP for long term runoff prediction from a dam catchment is demonstrated. The model is developed and calibrated for a dam catchment located in New South Wales, Australia. The calibration shows excellent agreement between the observed and simulated flows recorded over thirty years and the results are better than traditional Sacramento model and ANN. GP is also linked to MIKE11-NAM to build a hybrid model. The concept of this hybrid model is to fill the data gaps and generate long term (100 years) predictions. The calibrated GP model is then applied for the assessment of two future rainfall scenarios where future hundred year flows are predicted using rainfall input generated from different assumed climatic conditions. The analysis results provide some basis for making future water management plans including water supply from alternative sources. While the application was successful and produced better results, it was found that GP suffered from computational overhead in the learning process from input data. To improve the prediction accuracy, relatively new AI technique, called Extreme Learning Machine (ELM) is proposed. ELM is applied to partly overcome the slow learning problems of GP and ANN and to predict the hydrological time-series very quickly. ELM, which is also called single-hidden layer feed-forward neural networks (SLFNs), is able to well generalize the performance for extremely complex problems. ELM randomly chooses a single hidden layer and analytically determines the weights to predict the output. The ELM method was applied to predict hydrological flow series for the Tryggevælde Catchment, Denmark and for the Mississippi River at Vicksburg, USA. The results confirmed that ELM’s performance was similar or better in terms of Root Mean Square Error (RMSE) and Normalized Root Mean Square Error (NRMSE) compared to ANN and other previously published techniques, namely Evolutionary Computation based Support Vector Machine (EC-SVM), Standard Chaotic Approach and Inverse Approach. In this analysis, the sensitivity of ELM’s input parameters on the prediction accuracy were not investigated. The influence of input parameters was then analysed to further improve the model results. The robustness of ELM’s performances based on number of lagged input variables, the number of hidden nodes in ELM, higher lead days prediction and extrapolation capability using four goodness-of-fit measures is demonstrated. The results show that (1) ELM yields reasonable results with all combinations of lagged input variables (flows) for 1-day lead prediction. The minimum errors were obtained when 4-day lagged flows were applied as input variables; (2) ELM produced satisfactory results very rapidly for any number of hidden nodes ranging from ten to six thousand in the hidden layer. The time required to train ELM varies from less than a second to two minutes as only single iteration is required. A larger number of hidden nodes generally gives slightly better results; (3) ELM generated reasonable results for higher number of lead days (second and third) predictions; (4) ELM was able to extrapolate when the highest magnitude of input variables were excluded from training dataset; (5) ELM was shown to be computationally much faster and capable of producing better results compared with GP and EC-SVM for prediction of flow series from the same catchment. This demonstrates ELM potential for forecasting real-time hydrological time-series. This analysis was based on node based ELM (NELM) method. The performance of ELM is further improved by introducing Kernel function (KELM) in the learning process in the subsequent analysis. In addition to node based ELM, Kernel based ELM (KELM) is also applied. The performance of KELM was also compared against hidden node based ELM (NELM). The predictive capabilities of both NELM and KELM were investigated using data from three different catchments located in three different climatic regions (Tryggevælde catchment, Denmark, Mississippi River at Vicksburg, USA and Duckmaloi Weir catchment, Australia). The results were compared with those obtained with Genetic Programming (GP) and evolutionary computation based Support Vector Machine (EC-SVM), the later obtained from literature. The results show that KELM predictions were better than NELM, GP and EC-SVM. KELM ran faster than any other model. ELM’s fast learning capability from a training dataset for the prediction of hydrological flows means that it would be more suitable for on-line and real-time applications where quick processing time is important or vital. The study demonstrates ELM’s ability for rapid prediction and has potential application in real-time forecasting and in water resources planning and management.
Please use this identifier to cite or link to this item: