<?xml version='1.0' encoding='UTF-8'?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <idinfo>
    <citation>
      <citeinfo>
        <origin>Mark S. Pleasants</origin>
        <pubdate>2025</pubdate>
        <title>Application of a prior-based model discrimination framework using synthetic MODFLOW 6 groundwater models</title>
        <geoform>groundwater model</geoform>
        <pubinfo>
          <pubplace>Reston, VA, USA</pubplace>
          <publish>U.S. Geological Survey</publish>
        </pubinfo>
        <onlink>https://doi.org/10.5066/P1HH3X5W</onlink>
        <lworkcit>
          <citeinfo>
            <origin>Mark S. Pleasants</origin>
            <origin>Michael N. Fienen</origin>
            <origin>Hedeff I. Essaid</origin>
            <origin>Joel D. Blomquist</origin>
            <origin>Jing Yang</origin>
            <origin>Ming Ye</origin>
            <pubdate>2025</pubdate>
            <title>Toward a new framework to evaluate process-based model configurations and quantify data worth prior to calibration</title>
            <geoform>publication</geoform>
            <serinfo>
              <sername>Water Resources Research</sername>
              <issue>vol. 61</issue>
            </serinfo>
            <pubinfo>
              <pubplace>n/a</pubplace>
              <publish>American Geophysical Union (AGU)</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1029/2025WR040323</onlink>
          </citeinfo>
        </lworkcit>
      </citeinfo>
    </citation>
    <descript>
      <abstract>A synthetic one-dimensional groundwater flow model was developed using MODFLOW 6 as a test application of the prior-based model discrimination method described in the associated publication (https://doi.org/10.1029/2025WR040323). The model represents a steady-state unconfined aquifer where water flows horizontally between two constant head boundaries (a lake at the lefthand side and river at the righthand side) and precipitation recharges the aquifer. Process module uncertainty is considered in the representations of groundwater recharge, geologic structure, and snowmelt that controls the river stage constant head boundary. Two process modules are considered to represent each of the three uncertain system processes. This yielded eight candidate configurations of the groundwater model. This USGS data release contains all of the input and output files and ancillary Python scripts for the simulations and results described in the associated publication (https://doi.org/10.1029/2025WR040323).</abstract>
      <purpose>The synthetic models in this archive were developed as a test application of the prior-based model discrimination method described in the associated publication (https://doi.org/10.1029/2025WR040323). A network of synthetic data are created using a version of the 1-D groundwater model that is distinct from the eight candidate model configurations (referred to as the "true" model). These synthetic data are used during prior-based model discrimination to determine more-adequate model configurations prior to model calibration (model calibration is not performed here). A new metric, Mahalanobis distance deviation (M_D), is calculated to quantify the similarity of distributions of prior model outputs to the distributions of synthetic data. A new metric of data worth termed "discernment value" is then calculated to quantify the value of the synthetic data network for discerning more-adequate from less-adequate models prior to calibration. The development of the model input and output files included in this data release are documented in the associated publication (https://doi.org/10.1029/2025WR040323).</purpose>
      <supplinf>Support is provided for correcting errors in the data release and clarification of the modeling conducted by the U.S. Geological Survey. Users are encouraged to review the model documentation report (https://doi.org/10.1029/2025WR040323) to understand the purpose, construction, and limitations of this model. The model will run successfully only if the original directory structure is correctly restored. The model archive is broken into several pieces to reduce the likelihood of download timeouts. Instructions for reconstructing the original directory structure and running the model included in this data release and described in the model documentation report can be found in the README.txt ASCII file which can be downloaded as part of this data release.

All data and models in this archive are synthetic. These models and data are not based on a real location or time period, nor were the models calibrated (see associated publication for details: https://doi.org/10.1029/2025WR040323). For this reason, the time period information contained in this metadata file ("timeinfo") is listed as "Unknown" and the spatial domain bounding coordinates in this metadata file ("spdom") are listed as "-180.000, 180.0000, -90.0000, 90.0000". Please refer to the "Thumbnail.jpg" file in this archive for information on the model design. 

Files in this data release include:
-README.txt: ASCII text file that describes the model data release. This file also includes instructions on how to run the models contained in this data release.

-Thumbnail.jpg: JPEG image showing the synthetic model domains for the candidate and true models.

-batch.zip:  Zipped directory containing a csv file that defines the parameter realizations that are to be run using the candidate models.

-environment.zip:  Zipped directory containing an environment.yml file that can be used to create a conda environment to run the Python scripts contained in this archive.

-mf6_htc_sims.zip:  Zipped directory containing MODFLOW 6 input and output files of the candidate models.

-mf6_true_sims.zip:  Zipped directory containing MODFLOW 6 input and output files of the synthetic "true" model.

-reals.zip: Zipped directory containing model parameter realizations.

-results.zip: Zipped directory containing all model inputs/outputs and post-processed results.

-scripts.zip: Zipped directory containing Python and bash scripts to run the workflow.</supplinf>
    </descript>
    <timeperd>
      <timeinfo>
        <sngdate>
          <caldate>Unknown</caldate>
        </sngdate>
      </timeinfo>
      <current>See Supplemental Info</current>
    </timeperd>
    <status>
      <progress>Complete</progress>
      <update>Not planned</update>
    </status>
    <spdom>
      <bounding>
        <westbc>-180.0000</westbc>
        <eastbc>180.0000</eastbc>
        <northbc>90.0000</northbc>
        <southbc>-90.0000</southbc>
      </bounding>
    </spdom>
    <keywords>
      <theme>
        <themekt>ISO 19115 Topic Category</themekt>
        <themekey>geoscientificInformation</themekey>
      </theme>
      <theme>
        <themekt>USGS Thesaurus</themekt>
        <themekey>groundwater</themekey>
        <themekey>modeling</themekey>
      </theme>
      <theme>
        <themekt>none</themekt>
        <themekey>usgsgroundwatermodel</themekey>
        <themekey>MODFLOW 6</themekey>
        <themekey>Groundwater model</themekey>
        <themekey>Model discrimination</themekey>
        <themekey>Prior-data conflict</themekey>
        <themekey>Model structural uncertainty</themekey>
      </theme>
      <theme>
        <themekt>USGS Metadata Identifier</themekt>
        <themekey>USGS:67b8eb45d34e1a2e835b8476</themekey>
      </theme>
    </keywords>
    <accconst>None. Acknowledgement of the U.S. Geological Survey would be appreciated in products derived from this data release.</accconst>
    <useconst>These groundwater model input and output files and ancillary Python scripts are provided to support the analyses documented in the associated publication (https://doi.org/10.1029/2025WR040323).  Although the information contained in this model archive may be useful for other purposes, it is incumbent on the user to understand the purpose, construction, and limitations of these models. Data have been checked to ensure consistency with the accompanying report. If any errors are detected, please notify the originating office.</useconst>
    <ptcontac>
      <cntinfo>
        <cntperp>
          <cntper>Mark Pleasants</cntper>
          <cntorg>U.S. Geological Survey</cntorg>
        </cntperp>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>521 Progress Circle, Suite 6</address>
          <city>Cheyenne</city>
          <state>WY</state>
          <postal>82007</postal>
          <country>USA</country>
        </cntaddr>
        <cntvoice>1-307-778-2931</cntvoice>
        <cntemail>mpleasants@usgs.gov</cntemail>
      </cntinfo>
    </ptcontac>
    <browse>
      <browsen>https://www.sciencebase.gov/catalog/file/get/67b8eb45d34e1a2e835b8476?name=Thumbnail.jpg</browsen>
      <browsed>Image of the synthetic model domains. (a) Groundwater model domain used to define the eight candidate models and (b) groundwater model domain used to create the synthetic data, i.e., the true model.</browsed>
      <browset>jpg</browset>
    </browse>
    <datacred>U.S. Geological Survey Water Availability and Use Science Program.</datacred>
    <secinfo>
      <secsys>None</secsys>
      <secclass>Unclassified</secclass>
      <sechandl>None</sechandl>
    </secinfo>
    <native>-All model simulations were run on an Amazon Web Services (AWS) cloud computing cluster with HTCondor. The mf6 files in this archive are the MODFLOW 6.5.0 version for Linux released on May 23, 2024 in the Github commit 879f911 (https://github.com/MODFLOW-USGS/modflow6/releases/tag/6.5.0).

-The code was also tested on a personal computer with a 64-bit Windows 11 OS in the Windows Subsystem for Linux 2 (WSL2) after creating the environment given in the environment.yml file in the /environment folder. Using the parameter ensemble included in the /reals folder requires 6,000,000 simulations of the candidate MODFLOW 6 models. On the tested personal computer, this required approximately 12 hours of wall time running the models using 8 batches in parallel. To run the workflow using a smaller parameter ensemble, the variable N = 500 in params.py can be reduced. The total number of candidate model simulations is determined from the equation 3 * 8 * N * N. 

-Using the synthetic data set size of 500 given by the variable N = 500 in sims_true.py, the true model was simulated 250,000 times. On the tested personal computer, this required approximately 3 hours of wall time running all simulations in serial. To run the workflow using a smaller synthetic data set size, the variable N = 500 in sims_true.py can be reduced. The total number of true model simulations is determined from the equation N * N.</native>
    <crossref>
      <citeinfo>
        <origin>Christian D. Langevin</origin>
        <origin>Joseph D. Hughes</origin>
        <origin>Edward R. Banta</origin>
        <origin>Richard G. Niswonger</origin>
        <origin>Sorab Panday</origin>
        <origin>Alden M. Provost</origin>
        <pubdate>2017</pubdate>
        <title>Documentation for the MODFLOW 6 Groundwater Flow Model</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>Techniques and Methods</sername>
          <issue>6-A55</issue>
        </serinfo>
        <pubinfo>
          <pubplace>Reston, VA, USA</pubplace>
          <publish>US Geological Survey</publish>
        </pubinfo>
        <othercit>Detailed description of the model input and output files included in this data release can be found in this model code documentation report.</othercit>
        <onlink>https://doi.org/10.3133/tm6A55</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>Jing Yang</origin>
        <origin>Ming Ye</origin>
        <origin>Xingyuan Chen</origin>
        <origin>Heng Dai</origin>
        <origin>Anthony P. Walker</origin>
        <pubdate>20220302</pubdate>
        <title>Process Interactions Can Change Process Ranking in a Coupled Complex System Under Process Model and Parametric Uncertainty</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>Water Resources Research</sername>
          <issue>vol. 58, issue 3</issue>
        </serinfo>
        <pubinfo>
          <pubplace>n/a</pubplace>
          <publish>American Geophysical Union (AGU)</publish>
        </pubinfo>
        <othercit>The MODFLOW 6 model is based on the analytical groundwater model used in this reference.</othercit>
        <onlink>https://doi.org/10.1029/2021WR029812</onlink>
      </citeinfo>
    </crossref>
  </idinfo>
  <dataqual>
    <attracc>
      <attraccr>All data in this archive are synthetic and models were not calibrated.</attraccr>
    </attracc>
    <logic>No formal logical accuracy tests were conducted.</logic>
    <complete>Data set is considered complete for the information presented, as described in the abstract. Users are advised to read the rest of the metadata record and the associated model documentation report (https://doi.org/10.1029/2025WR040323) for additional details.</complete>
    <posacc>
      <horizpa>
        <horizpar>No formal positional accuracy tests were conducted.</horizpar>
      </horizpa>
      <vertacc>
        <vertaccr>No formal positional accuracy tests were conducted.</vertaccr>
      </vertacc>
    </posacc>
    <lineage>
      <procstep>
        <procdesc>The process used to develop and apply the model is fully described in the associated publication (https://doi.org/10.1029/2025WR040323).</procdesc>
        <procdate>2025</procdate>
      </procstep>
    </lineage>
  </dataqual>
  <eainfo>
    <overview>
      <eaover>This model archive data release contains all the model input and output files needed to replicate the simulations described in the associated publication (https://doi.org/10.1029/2025WR040323).</eaover>
      <eadetcit>https://doi.org/10.1029/2025WR040323</eadetcit>
    </overview>
    <overview>
      <eaover>Detailed descriptions of the model input and output files can be found in the MODFLOW 6 model documentation (https://doi.org/10.3133/tm6A55) and online at: https://water.usgs.gov/water-resources/software/MODFLOW-6/mf6io_6.5.0.pdf accessed February 24, 2025.</eaover>
      <eadetcit>https://doi.org/10.3133/tm6A55</eadetcit>
    </overview>
  </eainfo>
  <distinfo>
    <distrib>
      <cntinfo>
        <cntperp>
          <cntper>GS ScienceBase</cntper>
          <cntorg>U.S. Geological Survey</cntorg>
        </cntperp>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>Denver Federal Center, Building 810, Mail Stop 302</address>
          <city>Denver</city>
          <state>CO</state>
          <postal>80225</postal>
          <country>United States</country>
        </cntaddr>
        <cntvoice>1-888-275-8747</cntvoice>
        <cntemail>sciencebase@usgs.gov</cntemail>
      </cntinfo>
    </distrib>
    <distliab>Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although the data, software, and related material have been processed successfully on a computer system at the U.S. Geological Survey (USGS), reviewed for accuracy and completeness, and approved for release by the USGS, no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty. Although the data have been subjected to rigorous review and are substantially complete, the USGS reserves the right to revise the data pursuant to further analysis and review. Furthermore, the data are released on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from authorized or unauthorized use. The USGS or the U.S. Government shall not be held liable for improper or incorrect use of the data described and/or contained herein. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government.</distliab>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>Digital datasets</formname>
          <formvern>None</formvern>
          <transize>4310.194</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://doi.org/10.5066/P1HH3X5W</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None. No fees are applicable for obtaining the dataset.</fees>
    </stdorder>
  </distinfo>
  <metainfo>
    <metd>20250903</metd>
    <metc>
      <cntinfo>
        <cntperp>
          <cntper>Mark S. Pleasants</cntper>
          <cntorg>U.S. Geological Survey</cntorg>
        </cntperp>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>521 Progress Circle, Suite 6</address>
          <city>Cheyenne</city>
          <state>WY</state>
          <postal>82007</postal>
          <country>USA</country>
        </cntaddr>
        <cntvoice>1-307-778-2931</cntvoice>
        <cntemail>mpleasants@usgs.gov</cntemail>
      </cntinfo>
    </metc>
    <metstdn>Content Standard for Digital Geospatial Metadata</metstdn>
    <metstdv>FGDC-STD-001-1998</metstdv>
  </metainfo>
</metadata>
