U.S. flag

An official website of the United States government

icon-dot-gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

icon-https

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Ascii grids of predicted pH in depth zones used by domestic and public drinking water supply depths, Central Valley, California

The ascii grids associated with this data release are predicted distributions of continuous pH at the drinking water depth zones in the groundwater of Central Valley, California. The two prediction grids produced in this work represent predicted pH at the domestic supply and public supply drinking water depths, respectively and are bound by the alluvial boundary that defines the Central Valley. A depth of 46 m was used to stratify wells into the shallow and deep aquifer and were derived from depth percentiles associated with domestic and public supply in previous work by Burow et al. (2013). In this work, the median well depth categorized as domestic supply was 30 meters below land surface and the median well depth categorized as public supply is 100 meters below land surface. Prediction grids were created using prediction modeling methods, specifically Boosted Regression Trees (BRT) with a gaussian error distribution within a statistical learning framework within R's computing framework (http://www.r-project.org/). The statistical learning framework seeks to maximize the predictive performance of machine learning methods through model tuning by cross validation. The response variable was measured pH from 1337 wells, and was compiled from two sources: US Geological Survey (USGS) National Water Information System (NWIS) Database (all data are publicly available from the USGS: http://waterdata.usgs.gov/ca/nwis/nwis) and the California State Water Resources Control Board Division of Drinking Water (SWRCB-DDW) database (water quality data are publicly available from the SWRCB: http://www.waterboards.ca.gov/gama/geotracker_gama.shtml). Only wells with measured pH and well depth data were selected, and for wells with multiple records, only the most recent sample in the period 1993-2014 was used. A total of 1003 wells (training dataset) were used to train the BRT model and 334 wells (hold-out dataset) were used to validate the prediction model. The training r-squared was 0.70 and the RMSE in standard pH units was were 0.26. The holdout r-squared was 0.43 and RMSE in standard pH units was 0.37. Predictor variables consisting of more than 60 variables from 7 sources (see metadata) were assembled to develop a model that incorporates regional-scale soil properties, soil chemistry, land use, aquifer textures, and aquifer hydrology. Previously developed Central Valley model outputs of textures (Central Valley Textural Model, CVTM; Faunt et al. 2010) and MODFLOW-simulated vertical water fluxes and predicted depth to water table (Central Valley Hydrologic Model, CVHM; Faunt, 2009) were used to represent aquifer textures and groundwater hydraulics, respectively. In this work, wells were attributed to predictor variable values in ArcGIS using a 500-m buffer. Results of the predictor variable influence as defined by Friedman (2001) for variables used in the final BRT model used for mapping can be downloaded from this landing page (see file named PredictorVariableInfluence_CentralValley_pH_BRT.csv).

Get Data and Metadata
Author(s) Celia Z Rosecrans, JoAnn M. Gronberg orcid, Bernard T. Nolan orcid
Publication Date 2017
Beginning Date of Data 1993
Ending Date of Data 2014
Data Contact
DOI https://doi.org/10.5066/F7FX77K4
Citation Rosecrans, C.Z., Gronberg, J.M., and Nolan, B.T., 2017, Ascii grids of predicted pH in depth zones used by domestic and public drinking water supply depths, Central Valley, California: U.S. Geological Survey data release, https://doi.org/10.5066/F7FX77K4.
Metadata Contact
Metadata Date 2020-08-12
Related Publication
Citations of these data

Loading https://doi.org/10.3133/sim3377

Access public
License http://www.usa.gov/publicdomain/label/1.0/
Loading...
Harvest Source: ScienceBase
Harvest Date: 2021-11-19T04:42:53.907Z