Groundwater arsenic data and ASCII grids for predicting elevated arsenic in northwestern and central Minnesota using boosted regression tree methods
This data release contains: (1) ASCII grids of predicted probability of elevated arsenic in groundwater for the Northwest and Central Minnesota regions, (2) input arsenic and predictive variable data used in model development and calculation of predictions, and (3) ASCII files used to predict the probability of elevated arsenic across the two study regions. The probability of elevated arsenic was predicted using Boosted Regression Tree (BRT) modeling methods using the gbm package in R Studio version 3.4.2. The response variable was the presence or absence of arsenic > 10 microgram/L, the U.S. Environmental Protection Agency's maximum contaminant level for arsenic, in 3,283 wells located throughout both study regions (1,363 in the Northwest region and 1,920 in the Central). The original database used to develop the BRT model consisted of 127 predictor variables which included well characteristics, land use, soil properties, aquifer properties, depth to water table, and predicted nitrate. After optimization steps, a final database of 33 predictor variables was used to predict the occurrence of elevated arsenic across the two study regions.
|Groundwater arsenic data and ASCII grids for predicting elevated arsenic in northwestern and central Minnesota using boosted regression tree methods
|Sarah M Elliott, Catherine A Christenson
|USGS Digital Object Identifier Catalog
|Upper Midwest Water Science Center