A case study using support vector machines, neural networks and logistic regression in a GIS to predict wells contaminated with nitrate-N.

Research output: Contribution to journalArticlepeer-review

Abstract

Accurate and inexpensive identification of potentially contaminated wells is critical for water resources protection and management. The objectives of this study are to 1) assess the suitability of approximation tools such as neural networks (NN) and support vector machines (SVM) integrated in a geographic information system (GIS) for identifying contaminated wells and 2) use logistic regression and feature selection methods to identify significant variables for transporting contaminants in and through the soil profile to the groundwater. Fourteen GIS derived soil hydrogeologic and landuse parameters were used as initial inputs in this study. Well water quality data (nitrate-N) from 6,917 wells provided by Florida Department of Environmental Protection (USA) were used as an output target class. The use of the logistic regression and feature selection methods reduced the number of input variables to nine. Receiver operating characteristics (ROC) curves were used for evaluation of these approximation tools. Results showed superior performance with the NN as compared to SVM especially on training data while testing results were comparable. Feature selection did not improve accuracy; however, it helped increase the sensitivity or true positive rate (TPR). Thus, a higher TPR was obtainable with fewer variables.

Original languageAmerican English
JournalDefault journal
StatePublished - Jan 1 2009

Keywords

  • Geographic Information Systems (GIS)
  • Vulnerability mapping
  • Groundwater protection
  • Nitrate
  • Waste water technology

Disciplines

  • Environmental Sciences

Cite this