<?xml version='1.0' encoding='UTF-8'?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <idinfo>
    <citation>
      <citeinfo>
        <origin>Fienen, Michael N.</origin>
        <origin>Long, Andrew J.</origin>
        <origin>Markovich, Katherine H.</origin>
        <origin>Haj, Adel E.</origin>
        <origin>Barker, Matthew I.</origin>
        <pubdate>2025</pubdate>
        <title>Workflow and data supporting ensemble-based history matching and uncertainty quantification for selected watersheds from the National Hydrologic Model</title>
        <geoform>publication</geoform>
        <onlink>https://doi.org/10.5066/P13KUGYI</onlink>
      </citeinfo>
    </citation>
    <descript>
      <abstract>History matching of large hydrologic models is challenging due to data sparsity and non-unique process combinations (and associated parameters) that can produce similar model predictions. We developed an ensemble-based history matching and uncertainty quantification approach using an iterative ensemble smoother (iES) method for three cutouts of the National Hydrologic Model (NHM) and qualitatively compared the results and performance to the stepwise history matching approach. In the latter approach, subsets of parameters and observations were sequentially calibrated to a diverse range of observations to mitigate non-uniqueness and local minima. In iES, localization simulates the same causal connections between parameters and observations without the need (and computational cost) of sequential history matching steps. iES uses a weighted sum-of-squared errors objective function which allows differential weighting of multiple data sources. Formal adoption of range observation also pushes results to within ranges of observation values rather than discrete values. Overall, the ensemble approach performs similarly to the stepwise approach. Both approaches performed poorly for the cutout representing a snowmelt-dominated watershed, indicating a structural issue in the process representation of the model. The main advantage of iES is quantification of uncertainty in both the history matching and the predictions of interest.

Plain language summary: Parameter estimation, or history matching, for large hydrologic models to best align with real-world observations is difficult because there are often limited data and many ways to adjust the parameters that can produce similar results. This study tests a statistical approach, called the iterative ensemble smoother (iES), in three National Hydrologic Model cutouts. The iES method is compared with a traditional step-by-step parameterization approach. Both methods produced similar results overall, though both struggled in a watershed where snowmelt plays a major role, suggesting the model needs improvement there. The main advantage of the iES approach is that it provides an estimate of uncertainty, which helps scientists better understand how confident they can be in both the history matching and its predictions.</abstract>
      <purpose>Workflow for connecting PEST++ with the pywatershed implementation of the National Hydrologic Model for parameter estimation and uncertainty analysis.</purpose>
    </descript>
    <timeperd>
      <timeinfo>
        <rngdates>
          <begdate>19991001</begdate>
          <enddate>20090930</enddate>
        </rngdates>
      </timeinfo>
      <current>Simulation Time</current>
    </timeperd>
    <status>
      <progress>Complete</progress>
      <update>None planned</update>
    </status>
    <spdom>
      <bounding>
        <westbc>-125.0000</westbc>
        <eastbc>-66.9000</eastbc>
        <northbc>49.0000</northbc>
        <southbc>24.5000</southbc>
      </bounding>
    </spdom>
    <keywords>
      <theme>
        <themekt>None</themekt>
        <themekey>uncertainty analysis</themekey>
        <themekey>python</themekey>
        <themekey>PEST++</themekey>
        <themekey>parameter estimation</themekey>
        <themekey>uncertainty analysis</themekey>
        <themekey>pywatershed</themekey>
        <themekey>iES</themekey>
        <themekey>National Hydrologic Model</themekey>
        <themekey>NHM</themekey>
      </theme>
      <theme>
        <themekt>USGS Metadata Identifier</themekt>
        <themekey>USGS:6931dc50d4be024058c06064</themekey>
      </theme>
      <place>
        <placekt>None</placekt>
        <placekey>Conterminous United States</placekey>
        <placekey>Turtle Creek in Wisconsin</placekey>
        <placekey>Perkiomen Creek in Wisconsin</placekey>
        <placekey>East River in Colorado</placekey>
      </place>
    </keywords>
    <accconst>No access constraints. Please see 'Distribution Information' for details.</accconst>
    <useconst>These data are marked with a Creative Commons Zero v1.0 Universal (CC0-1.0) public domain dedication and do not have any use constraints. Users are advised to read the dataset's metadata thoroughly to understand appropriate use and data limitations.</useconst>
    <ptcontac>
      <cntinfo>
        <cntperp>
          <cntper>Michael N. Fienen</cntper>
          <cntorg>U.S. Geological Survey, MIDCONTINENT REGION</cntorg>
        </cntperp>
        <cntpos>Research Hydrologist</cntpos>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>Mail Stop Office 164, One Gifford Pinchot Drive</address>
          <city>Madison</city>
          <state>WI</state>
          <postal>53726</postal>
          <country>US</country>
        </cntaddr>
        <cntvoice>608-821-3894</cntvoice>
        <cntemail>mnfienen@usgs.gov</cntemail>
      </cntinfo>
    </ptcontac>
  </idinfo>
  <dataqual>
    <attracc>
      <attraccr>Formal history matching was performed to estimate model parameters that result in adequate correspondence between model outputs and observations of streamflow and ranges of other water budget components collocated in space and time.</attraccr>
    </attracc>
    <logic>This data release was reviewed to make sure that the output files herein produce the outputs described in the report, and that the files in this archive can be run to produce the same output as was used in the report</logic>
    <complete>This data release presents all the inputs needed to run the PEST++ workflow to generate files needed to run PEST++. Parallel runs must be run on a high-throughput or high-performance computing resource. Ultimate PEST++ outputs are also provided for comparison with reported results.</complete>
    <posacc>
      <horizpa>
        <horizpar>A formal accuracy assessment of the horizontal positional information in the data set has not been conducted.</horizpar>
      </horizpa>
      <vertacc>
        <vertaccr>A formal accuracy assessment of the vertical positional information in the data set has not been conducted.</vertaccr>
      </vertacc>
    </posacc>
    <lineage>
      <procstep>
        <procdesc>Model input files were generated using custom python codes that are included in this data release. 
This workflow consists of:
1) a series of scripts to run the pywatershed model and to extract and post-process results from the model to align with
observations.
2) a series of notebooks to connect the pywatershed model to iES.
3) a combination of scripts and notebooks to post-process and visualize results from iES.

View included README for detailed instructions.</procdesc>
        <procdate>2024</procdate>
      </procstep>
    </lineage>
  </dataqual>
  <distinfo>
    <distrib>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey</cntorg>
          <cntper>GS ScienceBase</cntper>
        </cntorgp>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>Denver Federal Center, Building 810, Mail Stop 302</address>
          <city>Denver</city>
          <state>CO</state>
          <postal>80225</postal>
          <country>United States</country>
        </cntaddr>
        <cntvoice>1-888-275-8747</cntvoice>
        <cntemail>sciencebase@usgs.gov</cntemail>
      </cntinfo>
    </distrib>
    <distliab>The data have been approved for release by the U.S. Geological Survey (USGS). Although the data have been subjected to rigorous review and are substantially complete, the USGS reserves the right to revise the data pursuant to further analysis and review. Furthermore, the data are released on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from authorized or unauthorized use. Although the data, software, and related material have been processed successfully on a computer system at the USGS, no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty. The USGS or the U.S. Government shall not be held liable for improper or incorrect use of the data described and/or contained herein. Any use of trade, product, or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government.</distliab>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>figures.zip</formname>
          <formvern>none</formvern>
          <formspec>zipfile</formspec>
          <formcont>Additional plots of all history matching results in PDF form.</formcont>
          <transize>41.5</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=figures.zip</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>scripts.zip</formname>
          <formvern>none</formvern>
          <formspec>zipfile</formspec>
          <formcont>Supporting Python scripts used for both pre- and post-processing files for PEST.</formcont>
          <transize>0.03549</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=scripts.zip</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>notebooks.zip</formname>
          <formvern>none</formvern>
          <formspec>zipfile</formspec>
          <formcont>Six jupyter notebooks used to create input files for PEST++.</formcont>
          <transize>0.05881</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=notebooks.zip</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>NHM_extractions.zip</formname>
          <formvern>none</formvern>
          <formspec>zipfile</formspec>
          <formcont>Starting cutout models extracted from the NHM.</formcont>
          <transize>10.5</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=NHM_extractions.zip</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>NHM_extractions_PST_reference.zip</formname>
          <formvern>none</formvern>
          <formspec>zipfile</formspec>
          <formcont>PST files generated from the notebooks, meant to be run with pestpp-ies.</formcont>
          <transize>4.27</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=NHM_extractions_PST_reference.zip</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>Supporting_information.zip</formname>
          <formvern>none</formvern>
          <formspec>zipfile</formspec>
          <formcont>Six supporting files used by the notebooks that contain tabular information.</formcont>
          <transize>0.00364</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=Supporting_information.zip</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>results.zip</formname>
          <formvern>none</formvern>
          <formspec>zipfile</formspec>
          <formcont>Result files from pest-ies for local processing.</formcont>
          <transize>896.08</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=results.zip</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>README.md</formname>
          <formvern>none</formvern>
          <formspec>markdown</formspec>
          <formcont>README file outlining contents of this data release.</formcont>
          <transize>0.0012</transize>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://www.sciencebase.gov/catalog/item/6931dc50d4be024058c06064?name=README.md</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
  </distinfo>
  <metainfo>
    <metd>20251222</metd>
    <metc>
      <cntinfo>
        <cntperp>
          <cntper>Michael N. Fienen</cntper>
          <cntorg>U.S. Geological Survey, MIDCONTINENT REGION</cntorg>
        </cntperp>
        <cntpos>Research Hydrologist</cntpos>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>Mail Stop Office 164, One Gifford Pinchot Drive</address>
          <city>Madison</city>
          <state>WI</state>
          <postal>53726</postal>
          <country>US</country>
        </cntaddr>
        <cntvoice>608-821-3894</cntvoice>
        <cntemail>mnfienen@usgs.gov</cntemail>
      </cntinfo>
    </metc>
    <metstdn>FGDC Content Standard for Digital Geospatial Metadata</metstdn>
    <metstdv>FGDC-STD-001-1998</metstdv>
  </metainfo>
</metadata>
