Advancements in water quality prediction: a practical review of machine learning and deep learning approaches

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.authorHelaly, Marwah A.
dc.contributor.authorRady, Sherine
dc.contributor.authorMabrouk, Mohamed
dc.contributor.authorAref, Mostafa M.
dc.contributor.authorVillarroya Fernández, Sebastián
dc.contributor.authorCotos Yáñez, José Manuel
dc.contributor.authorMera Pérez, David
dc.date.accessioned2025-10-21T09:28:52Z
dc.date.available2025-10-21T09:28:52Z
dc.date.issued2025-08-30
dc.description.abstractWater quality plays a pivotal role in ensuring the safety and sustainability of water resources, with significant implications for environmental protection, public health, and various industrial applications. This paper presents both a review of related state-of-the-art works and an implementation and application of adapted versions of these related works for predicting water quality parameters on a new water dataset from Galicia, Spain. The reviewed studies encompass a range of predictive models applied to diverse water quality parameters, including dissolved oxygen levels, pH levels, and other complex water parameters. These models include various machine learning and deep learning methods such as Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM) networks, and Bidirectional LSTMs. This research contributes by implementing various models on the dataset and experimentally demonstrating the impact of key factors on model performance. These factors include model sophistication, imputation techniques, recurrent architectures, and customized approaches for water quality prediction using deep learning. Notably, K-Nearest Neighbors (KNN) imputation enhances performance by preserving local data relationships, while noise filtering further improves predictive accuracy. Additionally, we observe that smaller batch sizes and learning rates lead to better generalization in sparse datasets, outperforming traditional approaches. The conclusions are guided by comparing the performance of all models on the Galician dataset using the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Coefficient of Determination (R2 ). This paper provides the first DL-based water quality analysis for Galicia, emphasizing the need for regional model adaptation. Our results guide future research directions, including the exploration of Transformer-based architectures for time-series data, more sophisticated feature selection techniques, and neural-network-based imputation strategies to enhance data completeness.
dc.description.peerreviewedSI
dc.description.sponsorshipOpen access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).
dc.identifier.citationA. Helaly, M., Rady, S., Mabrouk, M. et al. Advancements in water quality prediction: a practical review of machine learning and deep learning approaches. Cluster Comput 28, 598 (2025). https://doi.org/10.1007/s10586-025-05221-3
dc.identifier.doi10.1007/s10586-025-05221-3
dc.identifier.essn1386-7857
dc.identifier.issn1386-7857
dc.identifier.urihttps://hdl.handle.net/10347/43312
dc.issue.number598
dc.journal.titleCluster Computing
dc.language.isoeng
dc.publisherSpringer
dc.relation.publisherversionhttps://doi.org/10.1007/s10586-025-05221-3
dc.rights©The Author(s) 2025. This article is licensed under a Creative Commons Attribution 4.0 International License
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectWater quality prediction
dc.subjectDeep learning
dc.subjectMachine learning
dc.subjectData imputation
dc.subjectConvolutional neural network
dc.subjectRecurrent neural network
dc.titleAdvancements in water quality prediction: a practical review of machine learning and deep learning approaches
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number28
dspace.entity.typePublication
relation.isAuthorOfPublication8473f69a-64ab-4e16-8d71-ce1edce20b04
relation.isAuthorOfPublicationdf8d5480-a8c8-43ec-8e3b-cf5a939ad831
relation.isAuthorOfPublicatione2ae3872-d4dd-42a0-9c4e-9f6c03f499a8
relation.isAuthorOfPublication.latestForDiscoverydf8d5480-a8c8-43ec-8e3b-cf5a939ad831

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2025_ClustComp_Helaly_Advancements
Size:
1.79 MB
Format:
Adobe Portable Document Format