Data Reconstruction for Groundwater Wells Proximal to Lakes: A Quantitative Assessment for Hydrological Data Imputation

dc.contributor.author Murat Can
dc.contributor.author Babak Vaheddoost
dc.contributor.author Mir Jafar Sadegh Safari
dc.date MAR
dc.date.accessioned 2025-10-06T16:23:04Z
dc.date.issued 2025
dc.description.abstract The reconstruction of missing groundwater level data is of great importance in hydrogeological and environmental studies. This study provides a comprehensive and sequential approach for the reconstruction of groundwater level data near Lake Uluabat in Bursa Turkey. This study addresses missing data reconstruction for both past and future events using the Gradient Boosting Regression (GBR) model. The reconstruction process is evaluated through model calibration metrics and changes in the statistical properties of the observed and reconstructed time series. To achieve this goal the groundwater time series from two observational wells and lake water levels during the January 2004 to September 2019 period are used. The lake water level the definition of the four seasons via the application of three dummy variables and time are used as inputs in the prediction of groundwater levels in observation wells. The optimal GBR model calibration is achieved by training the dataset selected based on data gaps in the time series while test-past and test-future datasets are used for model validation. Afterward the GBR models are used in reconstructing the missing data both in the pre- and post-training data sets and the performance of the models are evaluated via the Nash-Sutcliffe efficiency (NSE) Root Mean Square Percentage Error (RMSPE) and Performance Index (PI). The statistical properties of the time series including the probability distribution maxima minima quartiles (Q1-Q3) standard error (SE) coefficient of variation (CV) entropy (H) and error propagation are also measured. It was concluded that GBR provides a good base for missing data reconstruction (the best performance was as high as NSE: 0.99 RMSPE: 0.36 and PI: 1.002). In particular the standard error and the entropy of the system in one case respectively experienced a 53% and 35% rise which was found to be tolerable and negligible.
dc.identifier.doi 10.3390/w17050718
dc.identifier.issn 2073-4441
dc.identifier.uri http://dx.doi.org/10.3390/w17050718
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/7680
dc.language.iso English
dc.publisher MDPI
dc.relation.ispartof Water
dc.source WATER
dc.subject distribution changes, entropy, gradient boosting regression, groundwater level, Lake Uluabat
dc.subject ULUABAT, SERIES
dc.title Data Reconstruction for Groundwater Wells Proximal to Lakes: A Quantitative Assessment for Hydrological Data Imputation
dc.type Article
dspace.entity.type Publication
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.startpage 718
gdc.description.volume 17
gdc.identifier.openalex W4408091417
gdc.index.type WoS
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 3.0
gdc.oaire.influence 2.4661302E-9
gdc.oaire.isgreen false
gdc.oaire.keywords distribution changes
gdc.oaire.keywords Lake Uluabat
gdc.oaire.keywords gradient boosting regression
gdc.oaire.keywords groundwater level
gdc.oaire.keywords entropy
gdc.oaire.popularity 4.7471036E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration International
gdc.openalex.fwci 3.4098
gdc.openalex.normalizedpercentile 0.91
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 3
gdc.plumx.mendeley 2
gdc.plumx.scopuscites 2
gdc.virtual.author Safari, Mir Jafar Sadegh
person.identifier.orcid Vaheddoost- Babak/0000-0002-4767-6660, Safari- Mir Jafar Sadegh/0000-0003-0559-5261
publicationissue.issueNumber 5
publicationvolume.volumeNumber 17
relation.isAuthorOfPublication 08e59673-4869-4344-94da-1823665e239d
relation.isAuthorOfPublication.latestForDiscovery 08e59673-4869-4344-94da-1823665e239d
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files