Data Reconstruction for Groundwater Wells Proximal to Lakes: A Quantitative Assessment for Hydrological Data Imputation

Loading...
Publication Logo

Date

2025

Authors

Murat Can
Babak Vaheddoost
Mir Jafar Sadegh Safari

Journal Title

Journal ISSN

Volume Title

Publisher

MDPI

Open Access Color

GOLD

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

The reconstruction of missing groundwater level data is of great importance in hydrogeological and environmental studies. This study provides a comprehensive and sequential approach for the reconstruction of groundwater level data near Lake Uluabat in Bursa Turkey. This study addresses missing data reconstruction for both past and future events using the Gradient Boosting Regression (GBR) model. The reconstruction process is evaluated through model calibration metrics and changes in the statistical properties of the observed and reconstructed time series. To achieve this goal the groundwater time series from two observational wells and lake water levels during the January 2004 to September 2019 period are used. The lake water level the definition of the four seasons via the application of three dummy variables and time are used as inputs in the prediction of groundwater levels in observation wells. The optimal GBR model calibration is achieved by training the dataset selected based on data gaps in the time series while test-past and test-future datasets are used for model validation. Afterward the GBR models are used in reconstructing the missing data both in the pre- and post-training data sets and the performance of the models are evaluated via the Nash-Sutcliffe efficiency (NSE) Root Mean Square Percentage Error (RMSPE) and Performance Index (PI). The statistical properties of the time series including the probability distribution maxima minima quartiles (Q1-Q3) standard error (SE) coefficient of variation (CV) entropy (H) and error propagation are also measured. It was concluded that GBR provides a good base for missing data reconstruction (the best performance was as high as NSE: 0.99 RMSPE: 0.36 and PI: 1.002). In particular the standard error and the entropy of the system in one case respectively experienced a 53% and 35% rise which was found to be tolerable and negligible.

Description

Keywords

distribution changes, entropy, gradient boosting regression, groundwater level, Lake Uluabat, ULUABAT, SERIES, distribution changes, Lake Uluabat, gradient boosting regression, groundwater level, entropy

Fields of Science

Citation

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
3

Source

Water

Volume

17

Issue

Start Page

718

End Page

PlumX Metrics
Citations

Scopus : 2

Captures

Mendeley Readers : 2

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
3.4098

Sustainable Development Goals

SDG data is not available