Volume 61, Issue 4 p. 510-516
Research Paper/

Susceptibility Assessment of Groundwater Nitrate Contamination Using an Ensemble Machine Learning Approach

Farzaneh Sajedi Hosseini

Corresponding Author

Farzaneh Sajedi Hosseini

Corresponding author: Reclamation of Arid and Mountainous Regions Department, Faculty of Natural Resources, University of Tehran, Karaj, Iran; [email protected]

Search for more papers by this author
Bahram Choubin

Bahram Choubin

Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran

Search for more papers by this author
Mehdi Bagheri-Gavkosh

Mehdi Bagheri-Gavkosh

Irrigation and Reclamation Engineering Department, University of Tehran, P.O. Box 31587-77871 Karaj, Iran

Search for more papers by this author
Omid Karimi

Omid Karimi

Department of Irrigation and Drainage, College of Aburaihan, University of Tehran, Tehran, Iran

Search for more papers by this author
Fereshteh Taromideh

Fereshteh Taromideh

Department of Irrigation, Sari Agricultural Sciences and Natural Resources University, Sari, Iran

Search for more papers by this author
Csaba Mako

Csaba Mako

Institute of Information Society, University of Public Service, 1083 Budapest, Hungary

Search for more papers by this author
First published: 20 September 2022
Citations: 1

Article impact statement: Machine learning modeling is a useful tool and effective method for accurate groundwater pollution mapping.

Abstract

Groundwater pollution susceptibility mapping using parsimonious approaches with limited data is of utmost importance for water resource and health planning, especially in data-scarce regions. Current research assesses groundwater nitrate susceptibility by considering the various combination of explanatory variables. In this study, the novel machine learning models of weighted subspace random forest (WSRF) and generalized additive model using LOESS (GAMLOESS) are applied, and the results are compared with well-known machine learning models of K-nearest neighbors (KKNN) and random forest (RF). The optimum combination of inputs for groundwater nitrate susceptibility mapping is identified using the k-fold cross-validation methodology. Results indicated that the combination of variables of precipitation, groundwater level, and lithology had the best performance among the 16 combinations. Modeling performance using the optimum combination demonstrated that the new ensemble approach, the WSRF model, had superior performance according to the evaluation metrics of accuracy (0.87), kappa (0.73), precision (0.92), false alarm ratio (0.08), and critical success index (0.75). The susceptibility assessment results of this paper can be a useful tool in developing strategies for the prevention and protection of groundwater pollution.