<?xml version='1.0' encoding='UTF-8'?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <idinfo>
    <citation>
      <citeinfo>
        <origin>Matthew J. Cashman</origin>
        <origin>Taylor E. Woods</origin>
        <origin>Kelly O. Maloney</origin>
        <origin>Daren M. Carlisle</origin>
        <origin>Lindsey J. Boyle</origin>
        <pubdate>20260302</pubdate>
        <title>Ecological Water Quality Benchmarks for the Conterminous United States, Attributed to the NHDPlus version 2.1</title>
        <geoform>tabular digital data</geoform>
        <pubinfo>
          <pubplace>Sciencebase</pubplace>
          <publish>US Geological Survey</publish>
        </pubinfo>
        <onlink>https://doi.org/10.5066/P1FBQTVT</onlink>
      </citeinfo>
    </citation>
    <descript>
      <abstract>This data release provides a variety of water quality benchmarks, compiled from multiple publications and attributed to the NHDPlus v2.1 hydrography, for the conterminous United States (CONUS) for use in integrated water availability studies for evaluating altered water conditions and potential ecological impacts. 

To avoid ambiguity in terms, for the purpose of this data release, we use the following definitions. Benchmark is used as a generic term to refer to any value used to evaluate or compare condition against. A variety of benchmark sub-categories exist, which are not mutually exclusive. Criteria are a type of benchmark that are used or suggested for regulatory purposes, including enforceable and nonenforceable standards. Thresholds are a class of benchmark that indicate a significant, pre-defined quantifiable change in condition. Background reference values are a class of benchmark that indicate conditions representative of no to minimal amounts of human activity or disturbance. Presumptive standards are a class of benchmark that lack a clear and quantifiable degree of impact and where negative effects are merely presumed to occur.

Specifically, this data release primarily focuses on ecologically relevant water quality benchmarks for nutrients (nitrogen, phosphorus), salinity as specific conductance, and temperature. Additional benchmarks that exist in the source datasets for other factors (e.g., chlorophyll-a, turbidity, and fish and macroinvertebrate multimetric indices) are also provided for completeness and value-added utility, even if they were not the primary focus of this compilation. 

This dataset includes a compilation of ecologically relevant benchmarks from the following sources:
1. Environmental Protection Agency National Rivers and Streams Assessment. Benchmarks used in the 2018-2019 assessment cycle are provided for total nitrogen, total phosphorus, specific conductance, and fish and macroinvertebrate multimetric indices. Specifically, the background reference value cutoff of Good/Fair and the Fair/Poor benchmark used as a presumptive standard to evaluate potential ecosystem impacts are included for each water quality parameter and biological multimetric index. These benchmarks are spatially variable based on the specific National Rivers and Streams Assessment Ecoregion Level III aggregation (9 units).
2. Environmental Protection Agency Recommended Ecoregional Nutrient Criteria for Rivers and Streams, developed in the National Nutrient Strategy.  Although not directly used for regulatory purposes, these criteria are intended as a recommended starting place to support development of jurisdiction-specific regulatory criteria. Criteria are provided for total nitrogen, total phosphorus, turbidity, and chlorophyll-a. These data are variable based on a specific National Nutrient Ecoregion Level III aggregation (14 units).
3. Specific Conductance benchmark of a natural background reference value from Olson and Cormier (2019). The natural background benchmark was derived from a machine-learning model from data at least-impacted sites across the conterminous United States. These data are spatially variable at each NHDPlus v2.1 COMID.
4. Newly calculated fish community temperature thresholds, combining previously published fish species distribution models and laboratory-derived maximum critical thermal limits (CTmax) for each species. Available for individual species or via summary metrics of CTmax thresholds for the fish community in each local stream reach. These data are spatially variable for each NHDPlus v2.1 COMID. 

As the various benchmarks originate from different methods, were derived for comparison against different condition levels, represent different time-scales of condition, and/or may have different quantified levels of impact, a thorough reading of the metadata is highly encouraged, especially Attribute Definitions, Logical Consistency, and Purpose. This may have profound implications for use in water availability and ecological condition studies.

This data release contains four primary tabular datasets. The two primary national-scale datasets are provided in Parquet format (.parquet), an open-source, efficient, cloud-optimized tabular format for big which can be easily read in R, Python, and other open-source languages. Alternative .csv formats of these two conus datasets, with otherwise identical content, are provided for accessibility purposes and are compressed in a .zip format for long-term storage.  In addition, two other files are provided exclusively in .csv format, including a crosswalk of taxonomic names used to reconcile across source datasets, and a data dictionary for the primary files. The files are:
1. water_quality_benchmarks.parquet - The main ecological benchmark dataset for all CONUS NHDPlus v2.1 network flowlines.
2. Temperature_CTmax_taxa_NHD_CONUS.parquet - The taxa-level temperature thresholds for all fish taxa predicted to be present in all CONUS NHDPlus v2.1 network flowlines. In tall format, with one row per unique taxa per unique flowline identifier. Used to produce the community level summaries contained in water_quality_benchmarks.parquet. 
3. conus_benchmarks_csv_archive.zip - A compressed archive containing identical copies of the main CONUS ecological benchmark datasets listed above (files #1 and #2), but provided in .csv format for accessibility purposes. Files are contained in .zip compressed archives to minimize size for long-term storage purposes.
4. unique_tsn_crosswalk.csv - A dataset that lists the unique taxonomic identifiers and taxonomic serial numbers (TSN) used in processing the temperature threshold data and align taxa names used in source thermal benchmark data and in the fish species distribution models.
5. variable_lookup.csv - A data dictionary containing entity and attribute information about the variable names, descriptions, units, and data sources contained in the main water quality benchmark files provided at the CONUS scale (files #1 and #2). File also contains sciencebase identifiers to facilitate optional programmatic access to these data through cloud-based parquet retrieval tools.</abstract>
      <purpose>This data release was created to facilitate the analysis and comparison of water quality observation and model output data at national and large regional scales. Specifically, this is a compilation of nationally-consistent and spatially allocated benchmarks for evaluating potential departure from natural background near-reference conditions and potential levels of ecological impact. As none of these benchmarks are explicit, enforceable regulatory criteria, these should not be used to make regulatory evaluations. In addition, these benchmarks were created for use in large national and regional evaluations. More relevant, locally-derived benchmarks and thresholds are likely available elsewhere for addressing local scale questions of water availability and ecological condition.  

While many of the data sources that we have compiled were previously available in various formats, this compilation was created to overcome the following main challenges: 1) many of these data were published in tabular form based on different spatial regional aggregations in report documents, requiring manual translation; 2) allocation of stream reaches to various spatial regional aggregations requires additional GIS spatial processing and attribution, which can be unwieldy at a national scale for many users; 3) one dataset is available only through a web-hosted interactive map viewer which is not easy to programmatically access or download for use in subsequent scientific model applications. 

These data are primarily recommended for use in direct comparisons with observational data or modeled water quality conditions. Specific use will vary based on data source, temporal scale of analysis, and the specific type of benchmark or a threshold's quantified level of impact, which can vary across the dataset. Example of their use is as follows: 1) background reference value conditions can be used to create ratios of alteration from background conditions; 2) benchmark(s) used for potential ecological impact can be used to flag areas above or below the benchmark(s), or used as a ratio to indicate the degree of alteration beyond the potential impact benchmark(s); 3) acute limits, such as the fish community thermal thresholds, can be used as an index of thermal stress, sometimes called the thermal buffer, that is calculated as the magnitude of difference between current water conditions and thermal thresholds.</purpose>
    </descript>
    <timeperd>
      <timeinfo>
        <sngdate>
          <caldate>20260227</caldate>
        </sngdate>
      </timeinfo>
      <current>publication date</current>
    </timeperd>
    <status>
      <progress>Complete</progress>
      <update>None planned</update>
    </status>
    <spdom>
      <bounding>
        <westbc>-125.1563</westbc>
        <eastbc>-66.6211</eastbc>
        <northbc>49.6107</northbc>
        <southbc>24.3671</southbc>
      </bounding>
    </spdom>
    <keywords>
      <theme>
        <themekt>ISO 19115 Topic Category</themekt>
        <themekey>biota</themekey>
        <themekey>inlandWaters</themekey>
      </theme>
      <theme>
        <themekt>USGS Thesaurus</themekt>
        <themekey>salinity</themekey>
        <themekey>surface water quality</themekey>
        <themekey>water chemistry</themekey>
        <themekey>water temperature</themekey>
        <themekey>nutrient content (water)</themekey>
        <themekey>biota</themekey>
        <themekey>aquatic biology</themekey>
        <themekey>freshwater ecosystems</themekey>
      </theme>
      <theme>
        <themekt>USGS Metadata Identifier</themekt>
        <themekey>USGS:6733c24dd34ee3976592da4e</themekey>
      </theme>
      <place>
        <placekt>Common Geographic Areas</placekt>
        <placekey>United States of America</placekey>
      </place>
    </keywords>
    <accconst>None.  Please see 'Distribution Info' for details.</accconst>
    <useconst>None.  Users are advised to read the dataset's metadata thoroughly to understand appropriate use and data limitations.</useconst>
    <ptcontac>
      <cntinfo>
        <cntperp>
          <cntper>Matthew J Cashman</cntper>
          <cntorg>USGS - WATER</cntorg>
        </cntperp>
        <cntpos>Physical Scientist</cntpos>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>MD/DE/DC WSC UMBC Campus, UMBC Research Park</address>
          <city>Catonsville</city>
          <state>MD</state>
          <postal>21228</postal>
          <country>USA</country>
        </cntaddr>
        <cntvoice>443-498-5511</cntvoice>
        <cntemail>mcashman@usgs.gov</cntemail>
      </cntinfo>
    </ptcontac>
    <datacred>This data release is a compilation, spatial allocation, and/or dataset join and summarization of previous datasets compiled by other groups. Please see processing notes and source reference credits.</datacred>
    <native>Windows 11 Enterprise, R version 4.4.1, and the R packages tidyverse (2.0.0), arrow (19.0.1), duckdb (1.2.1), dbplyr (2.5.0), duckplyr (1.0.1), DBI (1.2.3)</native>
    <crossref>
      <citeinfo>
        <origin>Hao Yu</origin>
        <origin>Arthur Cooper</origin>
        <origin>Dana M. Infante</origin>
        <origin>Jared A. Ross</origin>
        <pubdate>20220520</pubdate>
        <title>Fluvial Fish Native Distributions for the Conterminous United States using the NHDPlusV2.1 and Boosted Regression Tree (BRT) Models</title>
        <geoform>Dataset</geoform>
        <pubinfo>
          <pubplace>https://www.sciencebase.gov</pubplace>
          <publish>U.S. Geological Survey</publish>
        </pubinfo>
        <onlink>https://doi.org/10.5066/p9yx3ex6</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>Lise Comte</origin>
        <origin>Julian D. Olden</origin>
        <pubdate>20170911</pubdate>
        <title>Climatic vulnerability of the world’s freshwater and marine fishes</title>
        <geoform>publication</geoform>
        <pubinfo>
          <pubplace>https://www.nature.com/</pubplace>
          <publish>Nature Climate Change</publish>
        </pubinfo>
        <othercit>Nature Climate Change</othercit>
        <onlink>https://doi.org/10.1038/nclimate3382</onlink>
        <onlink>https://www.nature.com/articles/nclimate3382</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>Jonathan Chang</origin>
        <origin>Daniel L. Rabosky</origin>
        <origin>Stephen A. Smith</origin>
        <origin>Michael E. Alfaro</origin>
        <pubdate>20190328</pubdate>
        <title>An r package and online resource for macroevolutionary studies using the ray-finned fish tree of life</title>
        <geoform>publication</geoform>
        <onlink>https://doi.org/10.1111/2041-210X.13182</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>John R. Olson</origin>
        <origin>Charles P Hawkins</origin>
        <pubdate>20130604</pubdate>
        <title>Developing site-specific nutrient criteria from empirical models</title>
        <geoform>publication</geoform>
        <pubinfo>
          <pubplace>Freshwater Science</pubplace>
          <publish>BioOne</publish>
        </pubinfo>
        <onlink>https://doi.org/10.1899/12-113.1</onlink>
        <onlink>https://bioone.org/journals/freshwater-science/volume-32/issue-3/12-113.1/Developing-site-specific-nutrient-criteria-from-empirical-models/10.1899/12-113.1.short</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>James M. Omernik</origin>
        <origin>Glenn E. Griffith</origin>
        <pubdate>20140916</pubdate>
        <title>Ecoregions of the Conterminous United States: Evolution of a Hierarchical Spatial Framework</title>
        <geoform>publication</geoform>
        <pubinfo>
          <pubplace>Environmental Management</pubplace>
          <publish>Springer</publish>
        </pubinfo>
        <onlink>https://doi.org/10.1007/s00267-014-0364-1</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>ITIS</origin>
        <origin>US Geological Survey</origin>
        <origin>Smithsonian Institution</origin>
        <pubdate>20250130</pubdate>
        <title>Integrated Taxonomic Information System (ITIS)</title>
        <geoform>application/service</geoform>
        <onlink>www.itis.gov</onlink>
        <onlink>https://doi.org/10.5066/F7KH0KBK</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>Alan T. Herlihy</origin>
        <origin>Steven G. Paulsen</origin>
        <origin>John Van Sickle</origin>
        <origin>John L. Stoddard</origin>
        <origin>Charles P. Hawkins</origin>
        <origin>Lester L. Yuan</origin>
        <pubdate>200812</pubdate>
        <title>Striving for consistency in a national assessment: the challenges of applying a reference-condition approach at a continental scale</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>Journal of the North American Benthological Society</sername>
          <issue>vol. 27, issue 4</issue>
        </serinfo>
        <pubinfo>
          <pubplace>n/a</pubplace>
          <publish>University of Chicago Press</publish>
        </pubinfo>
        <othercit>ppg. 860-877</othercit>
        <onlink>https://doi.org/10.1899/08-081.1</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>Alan T. Herlihy</origin>
        <origin>Jean C. Sifneos</origin>
        <pubdate>200812</pubdate>
        <title>Developing nutrient criteria and classification schemes for wadeable streams in the conterminous US</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>Journal of the North American Benthological Society</sername>
          <issue>vol. 27, issue 4</issue>
        </serinfo>
        <pubinfo>
          <pubplace>n/a</pubplace>
          <publish>University of Chicago Press</publish>
        </pubinfo>
        <othercit>ppg. 932-948</othercit>
        <onlink>https://doi.org/10.1899/08-041.1</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>John W. Clune</origin>
        <origin>J. Kent Crawford</origin>
        <origin>Elizabeth W. Boyer</origin>
        <pubdate>20201217</pubdate>
        <title>Nitrogen and Phosphorus Concentration Thresholds toward Establishing Water Quality Criteria for Pennsylvania, USA</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>Water</sername>
          <issue>vol. 12, issue 12</issue>
        </serinfo>
        <pubinfo>
          <pubplace>n/a</pubplace>
          <publish>MDPI AG</publish>
        </pubinfo>
        <othercit>ppg. 3550</othercit>
        <onlink>https://doi.org/10.3390/w12123550</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>Raymond B. Cowles</origin>
        <origin>Charles M. Bogert</origin>
        <pubdate>19440913</pubdate>
        <title>A preliminary study of the thermal requirements of desert reptiles</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>Bulletin of the American Museum of Natural History</sername>
          <issue>Volume 83, Article 5</issue>
        </serinfo>
        <pubinfo>
          <pubplace>New York</pubplace>
          <publish>American Museum of Natural History</publish>
        </pubinfo>
        <onlink>https://digitallibrary.amnh.org/bitstreams/f73745e6-81a7-4c50-821a-e4c38453118a/download</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>John R. Olson</origin>
        <origin>Susan M. Cormier</origin>
        <pubdate>20190312</pubdate>
        <title>Modeling Spatial and Temporal Variation in Natural Background Specific Conductivity</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>Environmental Science &amp; Technology</sername>
          <issue>vol. 53, issue 8</issue>
        </serinfo>
        <pubinfo>
          <pubplace>n/a</pubplace>
          <publish>American Chemical Society (ACS)</publish>
        </pubinfo>
        <othercit>ppg. 4316-4325</othercit>
        <onlink>https://doi.org/10.1021/acs.est.8b06777</onlink>
      </citeinfo>
    </crossref>
    <crossref>
      <citeinfo>
        <origin>U.S. Environmental Protection Agency</origin>
        <pubdate>2024</pubdate>
        <title>National Rivers and Streams Assessment: The Third Collaborative Survey 2018-2019</title>
        <geoform>publication</geoform>
        <serinfo>
          <sername>EPA</sername>
          <issue>841-R-22-004</issue>
        </serinfo>
        <pubinfo>
          <pubplace>Washington, DC</pubplace>
          <publish>U.S. Environmental Protection Agency, Office of Water and Office of Research and Development.</publish>
        </pubinfo>
        <onlink>https://riverstreamassessment.epa.gov/webreport</onlink>
      </citeinfo>
    </crossref>
  </idinfo>
  <dataqual>
    <attracc>
      <attraccr>These benchmarks were compiled for use in large national and regional evaluations. More accurate, locally derived thresholds are likely to be available elsewhere for addressing local scale questions of water availability and ecological condition. 

Validity of each benchmark is based on the accuracy of the original source dataset, its assumptions, and methods used to derive the values. Taxa-specific critical thermal limits (CTmax) derived from lab-studies have uncertainties reported in these datasets. Benchmarks derived from statistical modeling studies have uncertainty evaluated and reported in the original study publications. Other benchmarks based on distributions of sample observations have their own assumptions about landscape representativeness, degree of human impact, and ecological relevance. Please see source publications for more description on limitations and evaluated accuracies, but general concerns are reproduced below.

A variety of spatial units are used to derive the benchmarks compiled in this dataset. EPA ecoregions are frequently used as the spatial units for developing benchmarks, such as those aggregated version of the EPA Ecoregions Level III that are used by the EPA National Rivers and Streams Assessment (EPA NRSA) and the EPA National Nutrient Strategy (EPA NNS). EPA NRSA created benchmarks based on an aggregation of Ecoregion Level III units into 9 aggregated regions. EPA NNS created benchmarks based on a different aggregation of Ecoregion Level III units into 14 aggregated regions. Due to high amount of natural variation within regions, a single value per region is typically insufficient, and more-spatially variable and site-specific benchmarks are recommended (Olson and Hawkins 2013). Along these lines, both the temperature benchmarks and background salinity reference value by Olson and Cormier (2019) compiled in this dataset are spatially variable at the NHDPlus v2.1 comid stream reach scale, although additional attribute fields make it possible to aggregate or summarize these benchmark values to the aggregated ecoregions used by the EPA NRSA and EPA NNS. 

Presumptive standard benchmarks derived from percentiles of found observational data, such as recommended criteria by the EPA National Nutrient Strategy, have known limitations (e.g. Clune and others 2020, Herlihy and Sifneos 2008). Notably, found observational data often are biased towards problematic areas not representative of the region. Random, probabilistic sampling of a region, such as by the EPA National Rivers and Streams Assessment, may therefore provide a better representation of conditions across a region (Herlihy and Sifneos 2008). In addition, using near-reference sites, such as done by the EPA National Rivers and Streams Assessment, has been argued to be a superior method of defining region-based benchmarks that capture reference value conditions better than observed found data (Herlihy and Sifneos 2008). However, even using near-reference sites confronts the issue that quality reference sites may not equally exist in all regions (Herlihy and others 2008). When available, modeled estimates of natural background condition are recommended for use due to the ability to better capture in-region variability and better represent potential background conditions (Herlihy and Sifneos 2008; Olson and Hawkins 2013).

In addition, benchmarks suggesting ecological impact, especially when based on a distribution of observations, are presumptive standards and do not directly quantify ecological impact in their benchmark creation. Within EPA NRSA, exceeding the Poor presumptive standard was statistically evaluated post-hoc in NRSA reports, and was found to be associated with varying level of ecological impacts among regions when quantified. 

The model created by Olson and Cormier (2019) to predict the background conductivity reference value benchmark was designed for streams with natural background specific conductance (SC) &lt; 2000 microSiemens per centimeter. Above this level, the model estimates may be less reliable. In addition, freshwater and marine interfaces, natural mineral springs, and salt deposits which may affect groundwater and streams are not included in the model, and thus may result in underestimated values in areas where these conditions are prevalent (Olson and Cormier 2019). As a result, values in arid areas may be less reliable, and the use of local knowledge is suggested for assessing the differences between predicted and measured background SC as provided in the original publication (Olson and Cormier 2019) and explorable in interactive format through the Freshwater Explorer (Cormier and others 2021). More details on model validation and accuracy can be found in the source publication at Olson and Cormier (2019).

Lastly, all reported benchmarks, with the exception of the temperature thresholds, are chronic benchmarks, analogous to longer, time-averaged conditions. Specifically, NNS benchmarks are created through averaging found data collected across seasons. NRSA benchmarks were created using samples collected typically at baseflow and during spring and/or summer months corresponding with biological collections. In contrast, the temperature thresholds are indicated by acute, short-term lab studies that result in severe effects to individual fishes.</attraccr>
    </attracc>
    <logic>The benchmarks contained in this data release are not functionally equivalent as to their implication for water availability or ecological condition, even for the same water quality constituent. Values may indicate different conditions as indicated by their benchmark class, from background reference value conditions, thresholds indicating explicit defined ecological impacts, or a presumptive standard of a presumed impact. To note, NRSA and NNS benchmarks were created using distributions of observed data, rather than based on a defined ecological level of impact; the Poor presumptive standard for water quality constituents used in the NRSA is associated with different magnitudes of increased probability of poor ecological condition in different ecoregions (USEPA 2024). Some indicate a general community stress level, and others indicate conditions leading to die-off of taxa quickly due to acute thresholds. Please see information about the individual entity and attributes (and source citations) for full information about the methods used to derive each benchmark. 

Water quality benchmarks are often developed and presented for discrete areas, such as the aggregated Level III Ecoregions used by the EPA NRSA and EPA NNS. There are 181 Level III Ecoregions for North America (EPA Ecoregions Level III), which represent similar ecosystem types and are used as a framework for research, assessment, management, and monitoring of ecosystems. More detail on the Ecoregion framework can be found in Omernik and Griffith 2014. Level III Ecoregion boundaries provided by the EPA (EPA Ecoregions Level III) were downloaded and manually and visually compared to maps of the aggregated ecoregions used in the EPA NNS and presented in Rohm and others (2007) and to maps of the aggregated ecoregions used in the EPA NRSA and presented in Herlihy (2008). Notably, in Herlihy (2008) there were missing Ecoregions Level III in the aggregation table (Table 3), and so the figure of mapped aggregated units for CONUS (Figure 3) was used to fill in the corresponding aggregated ecoregion for the missing Ecoregion Level III regions. In Rohm and others (2007), mapped boundaries for Level III Ecoregions did not match the current GIS data for Ecoregions Level III currently available by EPA (EPA Ecoregions Level III). Specifically, this causes differences in the mapped extent of Nutrient Aggregated Region IV in Figure 1 of Rohm and others (2007), specifically the locations of Ecoregion Level III numeric classes 26 to 30 in Texas. The current Ecoregion Level III boundaries provided by EPA (EPA Ecoregions Level III) are used in this project to crosswalk to NNS aggregated regions, rather than the maps of aggregated units in the original publications.</logic>
    <complete>Data are based off the extent of the conterminous United States in the NHDPlus v2.1 and represent condition for flowlines and streams and rivers. While Level III Ecoregions exist for all of North America, only Ecoregion Level III for the extent of the conterminous United States were used, and no data are provided for rivers and streams falling outside the conterminous United States. Similarly, as this data release focuses on flowlines for rivers and streams, benchmark data may be omitted for flowlines associated with coastal open water lacking catchment polygons. For example, WQ benchmarks are missing for open water areas of the Delaware River estuary or for many flowlines near or along the border with Canada. In addition, the EPA National Nutrient Strategy does not provide any recommended ecoregional benchmarks for the Southern Florida Coastal Plain region, NNS aggregated region XIII, and thus those benchmark values are NA in that region.</complete>
    <posacc>
      <horizpa>
        <horizpar>Stream catchments can intersect with multiple ecoregion boundaries based on exact horizontal positioning of the source Ecoregion Level III spatial dataset provided by EPA (EPA Ecoregions Level III). A single Ecoregion Level III attribution for each catchment is based off the dominant areal-based overlap for each Ecoregion Level III that intersects with each catchment. Ambiguity in exact Ecoregional boundaries may result in slight mis-attribution of catchments around the boundaries of multiple Ecoregions.</horizpar>
      </horizpa>
      <vertacc>
        <vertaccr>N/A</vertaccr>
      </vertacc>
    </posacc>
    <lineage>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>McKay, L.</origin>
            <origin>Bondelid, T.</origin>
            <origin>Dewald, T</origin>
            <origin>Johnston, J.</origin>
            <origin>Moore, R.</origin>
            <origin>Rea, A.</origin>
            <pubdate>2012</pubdate>
            <title>NHDPlus Version 2.1</title>
            <geoform>Hydrologic Network</geoform>
            <pubinfo>
              <pubplace>Washington, DC</pubplace>
              <publish>United States EPA</publish>
            </pubinfo>
            <onlink>https://www.epa.gov/waterdata/get-nhdplus-national-hydrography-dataset-plus-data</onlink>
          </citeinfo>
        </srccite>
        <srcscale>100000</srcscale>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2012</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>NHDPlus v2.1</srccitea>
        <srccontr>Served as spatial framework for benchmark criteria association</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>U.S. Environmental Protection Agency</origin>
            <pubdate>202409</pubdate>
            <title>National Rivers and Streams Assessment 2018-2019 Technical Support Document</title>
            <geoform>publication</geoform>
            <pubinfo>
              <pubplace>Washington, D.C. 20460</pubplace>
              <publish>Office of Water: Office of Wetlands, Oceans and Watersheds; and Office of Research and Development</publish>
            </pubinfo>
            <onlink>https://www.epa.gov/system/files/documents/2024-12/nrsa-2018-19-tsd-final-11252024.pdf</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <rngdates>
              <begdate>2018</begdate>
              <enddate>2019</enddate>
            </rngdates>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>EPA NRSA</srccitea>
        <srccontr>Source for ecoregion benchmark values for fish and benthic macroinvertebrate assemblage condition and stream nutrient concentrations</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>U.S. Environmental Protection Agency</origin>
            <pubdate>2001</pubdate>
            <title>National Nutrient Strategy: Summary Table for the Rivers and Streams Ecoregional Nutrient Criteria Documents</title>
            <geoform>publication</geoform>
            <onlink>https://www.epa.gov/system/files/documents/2021-07/ecoregion-table-rivers-streams.pdf</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <rngdates>
              <begdate>2000</begdate>
              <enddate>2001</enddate>
            </rngdates>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>EPA NNS summary table</srccitea>
        <srccontr>Source for National Nutrient Strategy recommended ecoregional criteria for TP, TN, Chlorophyll a, and FTU/NTU</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Susan M. Cormier</origin>
            <origin>Christopher L. Wharton</origin>
            <origin>John O. Olson</origin>
            <pubdate>20210727</pubdate>
            <title>EPA Freshwater Explorer: Predicted Background Conductivity Data</title>
            <geoform>vector digital data</geoform>
            <pubinfo>
              <pubplace>Washington DC</pubplace>
              <publish>U.S. Environmental Protection Agency</publish>
            </pubinfo>
            <onlink>https://epa.maps.arcgis.com/home/item.html?id=85c2000098e446cb979af577fd95e821</onlink>
            <onlink>https://arcg.is/KHb9S</onlink>
          </citeinfo>
        </srccite>
        <srcscale>100000</srcscale>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>20210727</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>EPA Freshwater Explorer</srccitea>
        <srccontr>Source of predicted background reference SC values for NHDPlusv2.1 COMIDs in the conterminous United States</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Lise Comte</origin>
            <origin>J. D. Olden</origin>
            <pubdate>2017</pubdate>
            <title>Climatic vulnerability of the world’s freshwater and marine fishes.</title>
            <geoform>publication</geoform>
            <pubinfo>
              <pubplace>Nature Climate Change</pubplace>
              <publish>Springer Nature</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1038/nclimate3382</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2017</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Comte and Olden (2017)</srccitea>
        <srccontr>Source of thermal data for fish</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Johnathan Chang</origin>
            <origin>Daniel L. Rabosky</origin>
            <origin>Stephen A. Smith</origin>
            <origin>Michael E. Alfaro</origin>
            <pubdate>2019</pubdate>
            <title>An r package and online resource for macroevolutionary studies using the ray-finned fish tree of life</title>
            <geoform>publication</geoform>
            <onlink>https://doi.org/10.1111/2041-210X.13182</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2019</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Chang and others (2019)</srccitea>
        <srccontr>Name of the phylogenetic tree from the Fish Tree of Life, which is considered in the imputation procedure</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Emmanual A. Frimpong</origin>
            <origin>Paul L. Angermeier</origin>
            <pubdate>2009</pubdate>
            <title>Fish Traits: A Database of Ecological and Life‐history Traits of Freshwater Fishes of the United States</title>
            <geoform>publication</geoform>
            <onlink>https://doi.org/10.1577/1548-8446-34.10.487</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2009</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Frimpong and Angermeir (2009)</srccitea>
        <srccontr>Source of monthly probability weight of spawning occurring for fish taxa</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Daniel J Wieferich</origin>
            <origin>Madison B Most</origin>
            <origin>Hao Yu</origin>
            <origin>Arthur Cooper</origin>
            <origin>Dana M. Infante</origin>
            <origin>Jared A. Ross</origin>
            <pubdate>2024</pubdate>
            <title>Fluvial Fish Native Distributions for the Conterminous United States using the NHDPlusV2.1 and Boosted Regression Tree (BRT) Models (ver. 2.0, December 2024)</title>
            <geoform>dataset</geoform>
            <pubinfo>
              <pubplace>https://www.sciencebase.gov</pubplace>
              <publish>U.S. Geological Survey</publish>
            </pubinfo>
            <onlink>https://doi.org/10.5066/p1uv25fw</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2024</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Yu and others (2022)</srccitea>
        <srccontr>Source for fish species distribution models for 419 species across CONUS</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Michael E Wieczorek</origin>
            <origin>Shannon E. Jackson</origin>
            <origin>Gregory E Schwarz</origin>
            <origin>Andrew J Sekellick</origin>
            <origin>Leah E Staub</origin>
            <pubdate>20181002</pubdate>
            <title>Select Attributes for NHDPlus Version 2.1 Reach Catchments and Modified Network Routed Upstream Watersheds for the Conterminous United States (ver. 5.0, June 2025)</title>
            <geoform>Dataset</geoform>
            <pubinfo>
              <pubplace>https://www.sciencebase.gov</pubplace>
              <publish>U.S. Geological Survey</publish>
            </pubinfo>
            <onlink>https://doi.org/10.5066/f7765d7v</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>202506</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Wieczorek and others (2018)</srccitea>
        <srccontr>Contributes the percentage overlap between EPA Ecoregion Level III and each catchment COMID in CONUS</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Alan T. Herlihy</origin>
            <origin>Steven G. Paulsen</origin>
            <origin>John Van Sickle</origin>
            <origin>John L. Stoddard</origin>
            <origin>Charles P. Hawkins</origin>
            <origin>Lester L. Yuan</origin>
            <pubdate>200812</pubdate>
            <title>Striving for consistency in a national assessment: the challenges of applying a reference-condition approach at a continental scale</title>
            <geoform>publication</geoform>
            <pubinfo>
              <pubplace>Journal of the North American Benthological Society</pubplace>
              <publish>The University of Chicago Press Journals</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1899/08-081.1</onlink>
            <onlink>https://www.journals.uchicago.edu/doi/abs/10.1899/08-081.1</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>200812</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Herlihy and others (2008)</srccitea>
        <srccontr>Crosswalk from Ecoregion Level III to aggregated ecoegions used by EPA National Rivers and Streams Assessment (NRSA)</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Christina M. Rohm</origin>
            <origin>James M. Omernik</origin>
            <origin>Alan J. Woods</origin>
            <origin>John L. Stoddard</origin>
            <pubdate>20070708</pubdate>
            <title>REGIONAL CHARACTERISTICS OF NUTRIENT CONCENTRATIONS IN STREAMS AND THEIR APPLICATION TO NUTRIENT CRITERIA DEVELOPMENT</title>
            <geoform>publication</geoform>
            <pubinfo>
              <pubplace>Journal of the American Water Resources Association</pubplace>
              <publish>Wiley</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1111/j.1752-1688.2002.tb01547.x</onlink>
            <onlink>https://onlinelibrary.wiley.com/doi/10.1111/j.1752-1688.2002.tb01547.x</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>20070708</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Rohm and others (2007)</srccitea>
        <srccontr>Crosswalk from Ecoregion Level III to Aggregated Ecoregions used by EPA National Nutrient Strategy (NNS)</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>S. Chamberlain</origin>
            <pubdate>2021</pubdate>
            <title>ritis: Integrated Taxonomic Information System Client</title>
            <geoform>application/service</geoform>
            <othercit>R Application to the Integrated Taxonomic Information System available at: https://www.itis.gov/</othercit>
            <onlink>https://doi.org/10.32614/CRAN.package.ritis</onlink>
            <onlink>https://www.itis.gov/</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>20250703</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>observed</srccurr>
        </srctime>
        <srccitea>ritis</srccitea>
        <srccontr>Taxonomic Names Validation</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>U.S. Environmental Protection Agency</origin>
            <pubdate>20251105</pubdate>
            <title>Aggregations of Level III Ecoregions for the National Nutrient Strategy</title>
            <geoform>publication</geoform>
            <onlink>https://www.epa.gov/nutrientpollution/ecoregional-nutrient-criteria-rivers-and-streams</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2025</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>EPA NNS website</srccitea>
        <srccontr>Figure 1 of the Ecoregion Level III boundaries and the NNS aggregations were used to visually compare with the aggregation table by Rohm and others (2007).</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>John R. Olson</origin>
            <origin>Susan M. Cormier</origin>
            <pubdate>20190312</pubdate>
            <title>Modeling Spatial and Temporal Variation in Natural Background Specific Conductivity</title>
            <geoform>publication</geoform>
            <serinfo>
              <sername>Environmental Science &amp;amp; Technology</sername>
              <issue>vol. 53, issue 8</issue>
            </serinfo>
            <pubinfo>
              <pubplace>n/a</pubplace>
              <publish>American Chemical Society (ACS)</publish>
            </pubinfo>
            <othercit>ppg. 4316-4325</othercit>
            <onlink>https://doi.org/10.1021/acs.est.8b06777</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <rngdates>
              <begdate>2000</begdate>
              <enddate>2015</enddate>
            </rngdates>
          </timeinfo>
          <srccurr>ground condition</srccurr>
        </srctime>
        <srccitea>Olson and Cormier (2019)</srccitea>
        <srccontr>Original study that produced the natural background specific conductance values that are ultimately retrieved from EPA Freshwater Explorer</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>U.S. Environmental Protection Agency</origin>
            <pubdate>20100501</pubdate>
            <title>Level III Ecoregions of North America Shapefile</title>
            <geoform>vector digital data</geoform>
            <onlink>https://www.epa.gov/eco-research/ecoregions-north-america</onlink>
            <onlink>https://dmap-prod-oms-edc.s3.us-east-1.amazonaws.com/ORD/Ecoregions/cec_na/NA_CEC_Eco_Level3.zip</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>20100501</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>EPA Ecoregions Level III</srccitea>
        <srccontr>Spatial data for the boundaries of the EPA Ecoregions Level III used in Wieczorek and others (2018) and for visual validation of regional aggregations within this study.</srccontr>
      </srcinfo>
      <procstep>
        <procdesc>The U.S. EPA's National Rivers and Streams Assessment (EPA NRSA) is a collaborative survey through the U.S. Environmental Protection Agency. Data are collected from flowing rivers and streams across the United States. The EPA uses these data to develop benchmark values for ecoregions denoting "poor", "fair", or "good" condition for metrics of stream health such as benthic macroinvertebrate assemblage condition and stream nutrient concentrations. For NRSA, the benchmark value associated with each condition category is typically based on a deviation from a least-disturbed condition, inferred from a set of reference sites. Please see more details in individual attributes or the NRSA technical support document (EPA NRSA) for more details. For nutrients, the value at and below the 75th percentile of the reference-site distribution for each ecoregion was used to define the good-fair boundary, and is analogous to a background reference value. The 95th percentile and above of the reference-site distribution was used to define the most disturbed fair-poor boundary and is used as a presumptive standard for ecological impact, which is later used by NRSA to quantify increased odds ratios of fish and macroinvertebrate conditions also being in poor condition. 

The percent overlap of each NHDPlus V2.1 catchment with each EPA Ecoregion Level III polygon was obtained from Wieczorek and others (2018). Catchments were preprocessed to find the single dominant Ecoregion Level III, which were then crosswalked to the EPA NRSA aggregated ecoregions using Table 3 from Herlihy and others (2008). Some ecoregions were missing from the aggregation list in Table 3, but were indicated on the aggregated region map for CONUS provided in Figure 3, which matched aggregated maps provided by  EPA NRSA reports . These missing ecoregions were then filled in manually based on spatial positioning and aggregated region indicated in Figure 3 from Herlihy (2008).  

Lastly, the benchmark cutoffs for good and poor condition for benthic macroinvertebrate, fish, total nitrogen, total phosphorus, and salinity from the EPA NRSA 2018/2019 technical support document (EPA NRSA) were joined to the EPA NRSA aggregated ecoregions associated with each NHDPlus v2.1 catchment.

The dataset at this step contained each NHDPlus V2.1 catchment with associated EPA NRSA aggregated ecoregion and the NRSA benchmark for that aggregated ecoregion.</procdesc>
        <srcused>NHDPlus v2.1</srcused>
        <srcused>EPA NRSA</srcused>
        <srcused>Herlihy and others (2008)</srcused>
        <srcused>Wieczorek and others (2018)</srcused>
        <srcused>EPA Ecoregions Level III</srcused>
        <procdate>20240419</procdate>
      </procstep>
      <procstep>
        <procdesc>As part of the U.S. EPA's National Nutrient Strategy (EPA NNS), the U.S. EPA uses found historical and recent nutrient observed data and reference site data to develop total phosphorus, total nitrogen, Chlorophyll a and turbidity benchmark criteria for aggregated Ecoregion level III across the United States. These benchmarks were obtained by ecoregion from the EPA NNS summary table. Additional ecoregion specific documents are available which document how each benchmark was derived.

The percent overlap of each NHDPlus v2.1 catchment with each Ecoregion Level III was obtained from Wieczorek (2018). Catchments were preprocessed to find the single dominant Ecoregion Level III, which were then crosswalked to the EPA National Nutrient Strategy aggregated ecoregions using Rohm and others (2007). Crosswalk was visually compared with Figure 1 provided on the EPA NNS website that visually shows Ecoregion Level III boundaries and the associated NNS aggregation numbers. 

Lastly, the benchmark cutoffs used by the National Nutrient Strategy Ecoregional recommended nutrient criteria were then joined to the EPA NNS aggregated ecoregions associated with each NHDPlus v2.1 catchment. 

The dataset produced by this step contained each NHDPlus v2.1 catchment with associated NNS aggregated ecoregion and the NNS benchmark criteria for that aggregated ecoregion. These data were then joined to the output of Step 1 by catchment COMID.</procdesc>
        <srcused>NHDPlus v2.1</srcused>
        <srcused>EPA NNS summary table</srcused>
        <srcused>Rohm and others (2007)</srcused>
        <srcused>Wieczorek and others (2018)</srcused>
        <srcused>EPA NNS website</srcused>
        <procdate>20240419</procdate>
      </procstep>
      <procstep>
        <procdesc>The EPA Freshwater Explorer is an interactive web-based mapping tool developed by the U.S. EPA  for displaying water quality data (Cormier and others, 2021). This web tool contains visualizations of predicted background salinity for all NHDPlus v2.1 catchments in the continental U.S., as originally developed using machine learning methods by Olson and Cormier (2019) . 

Values from background reference value modeling efforts were downloaded from the EPA Freshwater Explorer landing page for Predicted Background Conductivity Data (Cormier and others, 2021). Predictions were downloaded as a shapefile and then exported from ArcGIS Pro as a data table. Predictions were added to the water_quality_benchmark dataset by joining by NHDPlusv2.1 COMID.

More information about the development of the predicted background specific conductance can be found in Olson and Cormier (2019).</procdesc>
        <srcused>EPA Freshwater Explorer</srcused>
        <procdate>20240419</procdate>
        <proccont>
          <cntinfo>
            <cntperp>
              <cntper>John R Olson</cntper>
              <cntorg>California State University Monterey Bay</cntorg>
            </cntperp>
            <cntaddr>
              <addrtype>mailing and physical</addrtype>
              <address>100 Campus Center</address>
              <city>Seaside</city>
              <state>California</state>
              <postal>93955</postal>
              <country>United States</country>
            </cntaddr>
            <cntvoice>831-582-3873</cntvoice>
            <cntemail>joolson@csumb.edu</cntemail>
          </cntinfo>
        </proccont>
      </procstep>
      <procstep>
        <procdesc>Taxonomic thermal thresholds were produced by combining several distinct data sources. Specifically, we accessed data on fish community diversity across CONUS by accessing species distribution models at the NHDPlus v2.1 for 419 species across CONUS from Yu and others (2022). We accessed taxa-specific CTmax thermal tolerance data from Comte and Olden (2017), and ancillary fish trait data on spawning months from Fimpong and Angermeir (2008). 

All taxonomic data were harmonized through the Information Taxonomic Information System (ITIS) using the ritis package in R.  Thermal tolerances were imputed for taxa missing tolerance data using phylogenetic imputation using the fishtree package in R (Chang and others 2019). This created a tall table of all fish taxa predicted to be present for each COMID in CONUS, along with their CTmax values (contained in the dataset CTmax_taxa_NHD_CONUS.parquet in this data release). A summary dataset was then created to represent community-level metrics calculated from the individual-taxon data compiled in the previous step, including the minimum,was then created off the taxa-specific raw data in each COMID via calculating the minima, mean, and quantiles (5%, 10%, 25% and 50%/median) for the CTmax of all fish taxa present in a COMID, as well as the number of taxa predicted in that COMID. The summarized community-level metrics were appended to the compiled benchmark dataset in water_quality_benchmarks.parquet.</procdesc>
        <srcused>Comte and Olden (2017)</srcused>
        <srcused>Chang and others (2019)</srcused>
        <srcused>ritis</srcused>
        <srcused>Yu and others (2022)</srcused>
        <srcused>Frimpong and Angermeir (2009)</srcused>
        <procdate>20250130</procdate>
        <proccont>
          <cntinfo>
            <cntperp>
              <cntper>Matthew J Cashman</cntper>
              <cntorg>USGS - WATER</cntorg>
            </cntperp>
            <cntpos>Supervisory Hydrologist</cntpos>
            <cntaddr>
              <addrtype>mailing and physical</addrtype>
              <address>MD/DE/DC WSC UMBC Campus, UMBC Research Park</address>
              <city>Catonsville</city>
              <state>MD</state>
              <postal>21228</postal>
            </cntaddr>
            <cntvoice>443-498-5511</cntvoice>
            <cntemail>mcashman@usgs.gov</cntemail>
          </cntinfo>
        </proccont>
      </procstep>
    </lineage>
  </dataqual>
  <eainfo>
    <detailed>
      <enttyp>
        <enttypl>water_quality_benchmarks.csv</enttypl>
        <enttypd>Comma Separated Value (CSV) file containing data on ecologically relevant benchmarks across CONUS. Each row is one COMID and each column contains a benchmark, threshold, or related ancillary information. COMIDs are not included where no benchmarks are available.</enttypd>
        <enttypds>Producer Defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>comid</attrlabl>
        <attrdef>NHDPlus V2.1 COMID, a unique identifier for each catchment in the NHDPlus V2 hydrographic network</attrdef>
        <attrdefs>NHDPlus v2.1</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>-504212</rdommin>
            <rdommax>948100740</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ECO3_id</attrlabl>
        <attrdef>EPA Ecoregion level III numeric identification code (1-85)</attrdef>
        <attrdefs>EPA Ecoregion Level III</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>NA values for this field typically indicate that the flowlines and/or catchments do not intersect the Ecoregion Level III boundaries used in this study. For example, this often would occur in areas of open water, such as flowlines associated with the open water areas of the Delaware River, or where the COMID was outside of the conterminuous US, which included many areas near or along the border with Canada.</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <codesetd>
            <codesetn>EPA Ecoregion level III Number</codesetn>
            <codesets>EPA Ecoregion Level III</codesets>
          </codesetd>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ECO3_name</attrlabl>
        <attrdef>EPA Ecoregion level III text name</attrdef>
        <attrdefs>EPA Ecoregion Level III</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>The text name for each of the Ecoregion Level III used by EPA in EPA Ecoregion Level III</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ECO3_NRSA_agg_name</attrlabl>
        <attrdef>Text name of the EPA National Rivers and Streams Assessment aggregated version of Ecoregion level III</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>The full text name of the aggregated Ecoregion Level III regions that are used by the EPA National Rivers and Streams Assessment and first defined in Herlihy and others (2008)</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ECO3_NRSA_agg_abrv</attrlabl>
        <attrdef>Abbreviated form of the EPA National Rivers and Streams Assessment aggregated version of Ecoregion level III</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>An abbreviated version of full text name of the aggregated Ecoregion Level III regions that are used by the EPA National Rivers and Streams Assessment and first defined in Herlihy and others (2008)</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_macro_MMI_good</attrlabl>
        <attrdef>Benchmark boundary between Good (above) and Fair condition (below) for the macroinvertebrate multi-metric index defined by the EPA National Rivers and Streams Assessment. Benchmark was calculated via a process of benchmark regression adjustment modeling, as detailed in Herlihy and others (2008), which uses a PCA disturbance gradient to reduce the effects of disturbance on benchmark values within the reference site population. The adjusted benchmarks are intended to be analogous to the 5th (Poor) and 25th (Good) percentile of reference sites in each region based on the slope of the MMI-disturbance relationship in each region. Samples used in this determination were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>35.5</rdommin>
            <rdommax>57</rdommax>
            <attrunit>Unitless</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_macro_MMI_poor</attrlabl>
        <attrdef>Benchmark boundary between Poor (below) and Fair condition (above) for the macroinvertebrate multi-metric index defined by the EPA National Rivers and Streams Assessment. Benchmark was calculated via a process of benchmark regression adjustment modeling, as detailed in Herlihy and others (2008), which uses a PCA disturbance gradient to reduce the effects of disturbance on benchmark values within the reference site population. The adjusted benchmarks are intended to be analogous to the 5th (Poor) and 25th (Good) percentile of reference sites in each region based on the slope of the MMI-disturbance relationship in each region. Samples used in this determination were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>21.3</rdommin>
            <rdommax>42.8</rdommax>
            <attrunit>Unitless</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_fish_MMI_good</attrlabl>
        <attrdef>Benchmark boundary between Good (above) and Fair condition (below) for the fish multi-metric index defined by the EPA National Rivers and Streams Assessment. Benchmark was calculated via a process of benchmark regression adjustment modeling, as detailed in Herlihy and others (2008), which uses a PCA disturbance gradient to reduce the effects of disturbance on benchmark values within the reference site population. The adjusted benchmarks are intended to be analogous to the 5th (Poor) and 25th (Good) percentile of reference sites in each region based on the slope of the MMI-disturbance relationship in each region. Samples used in this determination were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>39.8</rdommin>
            <rdommax>76.8</rdommax>
            <attrunit>Unitless</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_fish_MMI_poor</attrlabl>
        <attrdef>Benchmark boundary between Fair (above) and Poor condition (below) for the fish multi-metric index defined by the EPA National Rivers and Streams Assessment. Benchmark was calculated via a process of benchmark regression adjustment modeling, as detailed in Herlihy and others (2008), which uses a PCA disturbance gradient to reduce the effects of disturbance on benchmark values within the reference site population. The adjusted benchmarks are intended to be analogous to the 5th (Poor) and 25th (Good) percentile of reference sites in each region based on the slope of the MMI-disturbance relationship in each region. Samples used in this determination were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>29.3</rdommin>
            <rdommax>65.4</rdommax>
            <attrunit>Unitless</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_TN_ugL_good</attrlabl>
        <attrdef>Benchmark boundary between Good (below) and Fair condition (above) for total nitrogen as defined by the EPA National Rivers and Streams Assessment. Benchmark calculated by taking the 75th percentile of samples collected in least-disturbed, reference sites within the ecoregion. Samples were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>139</rdommin>
            <rdommax>700</rdommax>
            <attrunit>Microgram per liter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_TN_ugL_poor</attrlabl>
        <attrdef>Benchmark boundary between Fair (below) and Poor condition (above) for total nitrogen as defined by the EPA National Rivers and Streams Assessment. Benchmark calculated by taking the 95th percentile of samples collected in least-disturbed, reference sites within the ecoregion. Samples were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>249</rdommin>
            <rdommax>1274</rdommax>
            <attrunit>Microgram per liter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_TP_ugL_good</attrlabl>
        <attrdef>Benchmark boundary between Good (below) and Fair condition (above) for total phosphorus as defined by the EPA National Rivers and Streams Assessment. Benchmark calculated by taking the 75th percentile of samples collected in least-disturbed, reference sites within the ecoregion. Samples were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>14.8</rdommin>
            <rdommax>88.6</rdommax>
            <attrunit>Microgram per liter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_TP_ugL_poor</attrlabl>
        <attrdef>National Rivers and Streams assessment aggregated ecoregion-specific benchmark for stream total phosphorus concentration above which stream total phosphorus concentration is considered in poor condition. Benchmark boundary between Fair (below) and Poor condition (above) for total phosphorus as defined by the EPA National Rivers and Streams Assessment. Benchmark calculated by taking the 95th percentile of samples collected in least-disturbed, reference sites within the ecoregion. Samples were collected during spring and summer low-flow conditions.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>24.4</rdommin>
            <rdommax>143</rdommax>
            <attrunit>Microgram per liter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_salinity_SC_uScm_good</attrlabl>
        <attrdef>Benchmark boundary between Good (below) and Fair condition (above) for specific conductance as salinity as defined by the EPA National Rivers and Streams Assessment. Benchmark was not defined by field collected samples, as per nitrogen and phosphorus benchmarks, but instead given an ecoregional value without justification.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>500</rdommin>
            <rdommax>1000</rdommax>
            <attrunit>microSiemens per centimeter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NRSA_salinity_SC_uScm_poor</attrlabl>
        <attrdef>Benchmark boundary between Fair (below) and Poor condition (above) for specific conductance as salinity as defined by the EPA National Rivers and Streams Assessment. Benchmark was not defined by field collected samples at reference sites, as per nitrogen and phosphorus benchmarks, but instead given an ecoregional value without justification.</attrdef>
        <attrdefs>EPA NRSA</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>1000</rdommin>
            <rdommax>2000</rdommax>
            <attrunit>microSiemens per centimeter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ECO3_NNS_agg_name</attrlabl>
        <attrdef>Text name of the EPA National Nutrient Strategy aggregated version of Ecoregion level III</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>The full text name of the aggregated Ecoregion Level III regions that are used by the National Nutrient Strategy and first defined in Rohm and others (2007)</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ECO3_NNS_agg_id</attrlabl>
        <attrdef>Roman numeral identity code of the EPA National Nutrient Strategy aggregated version of Ecoregion level III, as used in source publications.</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>The roman numeral identity code values for the aggregated Ecoregion Level III regions that are used by the National Nutrient Strategy and first defined in  Rohm and others (2007)</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ECO3_NNS_agg_abrv</attrlabl>
        <attrdef>Abbreviated form of the EPA National Nutrient Strategy aggregated version of Ecoregion level III</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>The abbreviated form of the full next names of the aggregated Ecoregion Level III regions that are used by the National Nutrient Strategy and first defined in  Rohm and others (2007)</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NNS_TN_mgL</attrlabl>
        <attrdef>EPA recommended regional criteria for total nitrogen concentration for rivers and streams as presented in the National Nutrient Strategy. Criterion was developed from the median value across the 25th percentile of the distribution of found data for data categorized by season. Please see original source data for more details about calculations within each specific ecoregion.</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>0.12</rdommin>
            <rdommax>2.18</rdommax>
            <attrunit>Milligrams per liter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NNS_TP_ugL</attrlabl>
        <attrdef>EPA recommended regional criteria for total phosphorus concentration for rivers and streams as presented in the National Nutrient Strategy. Criterion was developed from the median value across the 25th percentile of the distribution of found data for data categorized by season. Please see original source data for more details about calculations within each specific ecoregion.</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>10</rdommin>
            <rdommax>128</rdommax>
            <attrunit>Micrograms per liter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NNS_TP_notes</attrlabl>
        <attrdef>Specific notes column from source dataset that indicated suspect and/or outlier values for investigation.</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>Specific notes that indicated suspect and/or outlier values for investigation.</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NNS_turb</attrlabl>
        <attrdef>EPA recommended regional criteria for turbidity for rivers and streams as presented in the National Nutrient Strategy. Criterion was developed from the median value across the 25th percentile of the distribution of found data for data categorized by season. Please see original source data for more details about calculations within each specific ecoregion.</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>1.3</rdommin>
            <rdommax>17.5</rdommax>
            <attrunit>Variable, see NNS_turb_units column</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NNS_turb_units</attrlabl>
        <attrdef>Turbidity units used to develop the turbidity benchmark threshold within the ecoregion, based on the availability of data at the time of creation.</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>FTU</edomv>
            <edomvd>Units = Formazin Turbidity Unit</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>NTU</edomv>
            <edomvd>Units = Nephelometric Turbidity Units</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NNS_chla_ugL</attrlabl>
        <attrdef>EPA recommended regional criteria for chlorophyll-a for rivers and streams as presented in the National Nutrient Strategy. Criterion was developed from the median value across the 25th percentile of the distribution of found data for data categorized by season. Please see original source data for more details about calculations within each specific ecoregion.</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>0.4</rdommin>
            <rdommax>3.75</rdommax>
            <attrunit>Micrograms per liter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>NNS_chla_method</attrlabl>
        <attrdef>Specific method used to determine chlorophyll-a concentration thresholds for the region. Can be determined from flourometric or spectrophotometric methods, which are not distinctly interchangeable</attrdef>
        <attrdefs>EPA NNS</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Fluorometric</edomv>
            <edomvd>Method measures the flourescence of chlorophyll when excited by a specific wavelength of light using a flourometer</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Spectrophotometric</edomv>
            <edomvd>Method measures the absorbance of specific wavelengths of light by chlorophyll using a spectrophotometer.</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>OlsonCormier_background_SC</attrlabl>
        <attrdef>Reference value of predicted natural background specific conductance by each unique stream reach. Predicted background specific conductance is the average specific conductance from a model using reference sites modeled monthly between 2000 and 2015.</attrdef>
        <attrdefs>Olson and others 2021</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>10.2933701657459</rdommin>
            <rdommax>1703.74806629834</rdommax>
            <attrunit>microSiemens per centimeter</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>community_CTmax_mean</attrlabl>
        <attrdef>Threshold corresponding to the average critical thermal maximum (CTmax) of all fish taxa predicted to be present in the stream reach, according to the modeled species distributions by Yu and others, 2022</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data. This is due to none of the species from Yu and others (2022) predicted to be present in this COMID and/or the fish present do not have CTmax values.</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>25.83</rdommin>
            <rdommax>40.29</rdommax>
            <attrunit>Degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>community_CTmax_min</attrlabl>
        <attrdef>Threshold corresponding to the minimal critical thermal maximum (CTmax) of all fish taxa predicted to be present in the stream reach, according to the modeled species distributions by Yu and others, 2022</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data. This is due to none of the species from Yu and others (2022) predicted to be present in this COMID and/or the fish present do not have CTmax values.</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>25.83</rdommin>
            <rdommax>40.29</rdommax>
            <attrunit>Degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>community_CTmax_q5</attrlabl>
        <attrdef>Threshold corresponding to the critical thermal maximum (CTmax) of the 5th percentile of all fish taxa predicted to be present in the stream reach, according to the modeled species distributions by Yu and others, 2022</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data. This is due to none of the species from Yu and others (2022) predicted to be present in this COMID and/or the fish present do not have CTmax values.</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>25.83</rdommin>
            <rdommax>40.29</rdommax>
            <attrunit>Degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>community_CTmax_q10</attrlabl>
        <attrdef>Threshold corresponding to the critical thermal maximum (CTmax) of the 10th percentile of all fish taxa predicted to be present in the stream reach, according to the modeled species distributions by Yu and others, 2022</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data. This is due to none of the species from Yu and others (2022) predicted to be present in this COMID and/or the fish present do not have CTmax values.</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>25.83</rdommin>
            <rdommax>40.29</rdommax>
            <attrunit>Degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>community_CTmax_q25</attrlabl>
        <attrdef>Threshold corresponding to the critical thermal maximum (CTmax) of the 25th percentile of all fish taxa predicted to be present in the stream reach, according to the modeled species distributions by Yu and others, 2022</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data. This is due to none of the species from Yu and others (2022) predicted to be present in this COMID and/or the fish present do not have CTmax values.</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>25.83</rdommin>
            <rdommax>40.29</rdommax>
            <attrunit>Degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>community_CTmax_q50</attrlabl>
        <attrdef>Threshold corresponding to the critical thermal maximum (CTmax) of the median (50th percentile) of all fish taxa predicted to be present in the stream reach, according to the modeled species distributions by Yu and others, 2022</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data. This is due to none of the species from Yu and others (2022) predicted to be present in this COMID and/or the fish present do not have CTmax values.</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>25.83</rdommin>
            <rdommax>40.29</rdommax>
            <attrunit>Degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>community_CTmax_num_taxa</attrlabl>
        <attrdef>Total number of fish taxa predicted for this stream reach, out of the total number of taxa modeled in the species distribution models of Yu and others, 2022</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No Data. This is due to none of the species from Yu and others (2022) predicted to be present in this COMID and/or the COMID was outside of the modeling domain by Yu and others (2022).</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>79</rdommax>
            <attrunit>Count</attrunit>
          </rdom>
        </attrdomv>
      </attr>
    </detailed>
    <detailed>
      <enttyp>
        <enttypl>Temperature_CTmax_taxa_NHD_CONUS.parquet</enttypl>
        <enttypd>A parquet file containing data on species-specific fish thermal thresholds across CONUS. Each row is the unique combination of fish taxa predicted to be present by each COMID in CONUS. Columns are information about the fish taxa, thermal threshold, information on how that was derived, and related spawning probability trait information. COMIDs are not included where no fish taxa predicted by the species distribution models of Yu et al (2022)</enttypd>
        <enttypds>Producer Defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>comid</attrlabl>
        <attrdef>NHDPlus V2.1 COMID, a unique identifier for each catchment in the NHDPlus V2 hydrographic network</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>101</rdommin>
            <rdommax>948100736</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>accepted_tsn</attrlabl>
        <attrdef>The unique Integrated Taxonomic Information System's (ITIS's) Taxonomic Serial Number (TSN) for each taxon. This attribute matches the accepted_tsn attribute present in the unique_tsn_crosswalk.csv dataset.</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <codesetd>
            <codesetn>ITIS Taxonomic Serial Number</codesetn>
            <codesets>Integrated Taxonomic Information System (ITIS)</codesets>
          </codesetd>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>accepted_name</attrlabl>
        <attrdef>Scientific name returned by ITIS</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <udom>Scientific names of the various taxa in this study, as returned and updated by ITIS</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>CTmax</attrlabl>
        <attrdef>Critical thermal limit maxima, the upper thermal tolerance, for the fish species from lab-controlled experiments. Historically, CTmax has been defined as 'the thermal point at which locomotory activity becomes disorganized and the animal loses its ability to escape from conditions that will promptly lead to its death' (Cowles &amp; Bogert, 1944).</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No CTmax data were in the source dataset or able to be imputed for this species</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>25.83</rdommin>
            <rdommax>40.29</rdommax>
            <attrunit>degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>CTmax_SD</attrlabl>
        <attrdef>Standard deviation of the CTmax as derived from lab-experiments. Not available when threshold was phylogenetically interpolated</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>Not available when threshold was phylogenetically interpolated or when standard deviation was not available for the lab-experiment as reported in the source dataset.</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>0.14</rdommin>
            <rdommax>4.22</rdommax>
            <attrunit>degrees Celsius</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>imputed</attrlabl>
        <attrdef>True/False whether thermal tolerance threshold was imputed for this fish taxa for this study</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NA</edomv>
            <edomvd>No CTmax data were in the source dataset or able to be imputed for this species</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <rdom>
            <rdommin>0</rdommin>
            <rdommax>1</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>phy_tree_name</attrlabl>
        <attrdef>Name of the phylogenetic tree from the Fish Tree of Life, which is considered in the imputation procedure</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <edom>
            <edomv>Polyodon_spathula</edomv>
            <edomvd>Phylogenetic tree associated with Polyodon spathula was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Lepisosteus_osseus</edomv>
            <edomvd>Phylogenetic tree associated with Lepisosteus osseus was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Lepisosteus_oculatus</edomv>
            <edomvd>Phylogenetic tree associated with Lepisosteus oculatus was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Amia_calva</edomv>
            <edomvd>Phylogenetic tree associated with Amia calva was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Dorosoma_cepedianum</edomv>
            <edomvd>Phylogenetic tree associated with Dorosoma cepedianum was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Esox_americanus_americanus</edomv>
            <edomvd>Phylogenetic tree associated with Esox americanus americanus was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Notemigonus_crysoleucas</edomv>
            <edomvd>Phylogenetic tree associated with Notemigonus crysoleucas was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Notropis_buchanani</edomv>
            <edomvd>Phylogenetic tree associated with Notropis buchanani was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Notropis_texanus</edomv>
            <edomvd>Phylogenetic tree associated with Notropis texanus was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Notropis_volucellus</edomv>
            <edomvd>Phylogenetic tree associated with Notropis volucellus was used in imputation</edomvd>
            <edomvds>This study</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>jan</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in January</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>feb</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in Febuary</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>mar</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in March</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>apr</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in April</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>may</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in May</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>jun</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in June</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>jul</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in July</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>aug</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in August</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>sep</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in September</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>oct</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in October</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>nov</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in November</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>dec</attrlabl>
        <attrdef>Probability weight of spawning occurring for this taxon in December</attrdef>
        <attrdefs>This study</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.0</rdommin>
            <rdommax>1.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
    </detailed>
    <detailed>
      <enttyp>
        <enttypl>unique_tsn_crosswalk.csv</enttypl>
        <enttypd>Comma Separated Value (CSV) file containing data indicating the crosswalk of names used by Yu and others (2022) and those used for this study after updates from The Integrated Taxonomic Information System (ITIS).</enttypd>
        <enttypds>Producer Defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>agap_itis_tsn</attrlabl>
        <attrdef>The unique Integrated Taxonomic Information System's (ITIS's) Taxonomic Serial Number (TSN) for each taxon, as originally returned, in the AGAP modeling done by Yu and others (2022)</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>159705</rdommin>
            <rdommax>913996</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>agap_scientific_name</attrlabl>
        <attrdef>Scientific name returned by ITIS and used in the AGAP modeling done by Yu and others (2022)</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Scientific names of various taxa in this study, as originally reported by Yu and others (2022)</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>agap_common_name</attrlabl>
        <attrdef>Common names used in the AGAP modeling project as done by Yu and others (2022)</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Common names of various taxa in this study, as originally reported by Yu and others (2022)</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>agap_accepted_tsn</attrlabl>
        <attrdef>The unique Integrated Taxonomic Information System's (ITIS's) Taxonomic Serial Number (TSN) for each taxon, as accepted and used in the AGAP modeling done by Yu and others (2022)</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>159705</rdommin>
            <rdommax>1199065</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>accepted_tsn</attrlabl>
        <attrdef>The unique Integrated Taxonomic Information System's (ITIS's) Taxonomic Serial Number (TSN) for each taxon, used in this study. This attribute matches the accepted_tsn attribute present in the Temperature_CTmax_taxa_NHD_CONUS.parquet dataset.</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>159705</rdommin>
            <rdommax>1199065</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>accepted_name</attrlabl>
        <attrdef>Scientific name returned by ITIS and used in this study</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Scientific names of the various taxa in this study, as returned and updated by ITIS</udom>
        </attrdomv>
      </attr>
    </detailed>
    <detailed>
      <enttyp>
        <enttypl>variable_lookup.csv</enttypl>
        <enttypd>Data dictionary of variables contained in other files of this data release. Also used to facilitate (optional) remote data querying capabilities of the parquet files when hosted in the cloud.</enttypd>
        <enttypds>Producer Defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>variable</attrlabl>
        <attrdef>Variable names of columns contained in other files of this dataset.</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Variable names</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>description</attrlabl>
        <attrdef>Variable description for columns contained in other files of this dataset. Should match the attribute description for the respective entities in this metadata record</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Custom variable descriptions</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>units</attrlabl>
        <attrdef>Reporting units for the benchmarks contained in this dataset</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>&lt;&lt; empty cell &gt;&gt;</edomv>
            <edomvd>Not applicable for this variable</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>mg/L</edomv>
            <edomvd>milligrams per liter</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>μg/L</edomv>
            <edomvd>microgram per liter</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Multiple units, please see turb_units column</edomv>
            <edomvd>Multiple units of this variable exist in the source data. Please see the turb_units column for specific unit used in this region</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>μS/cm</edomv>
            <edomvd>microSiemens per centimeter</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>degrees celsius</edomv>
            <edomvd>degrees celsius</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Count</edomv>
            <edomvd>count of the total number of species reported to be present</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Unitless</edomv>
            <edomvd>Does not have an explicit unit</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>benchmark_class</attrlabl>
        <attrdef>Sub-category of benchmark corresponding to each benchmark.</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>&lt;&lt; empty cell &gt;&gt;</edomv>
            <edomvd>This variable is not an explicit benchmark, and thus has no class</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>criterion</edomv>
            <edomvd>A criterion is a type of benchmark that is used or suggested for regulatory purposes, including enforceable and nonenforceable standards.</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>background reference value</edomv>
            <edomvd>Background reference values are a class of benchmark that indicate conditions representative of no to minimal amounts of human activity or disturbance</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>presumptive standard</edomv>
            <edomvd>Presumptive standards are a class of benchmark that lack a clear and quantifiable degree of impact and where negative effects are merely presumed to occur.</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>threshold</edomv>
            <edomvd>A threshold indicate a class of benchmark that indicate a significant, pre-defined quantifiable change in condition.</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>category</attrlabl>
        <attrdef>Type of water quality constituent the benchmark relates to, or category of related non-constituent metadata.</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>Region</edomv>
            <edomvd>This variable contains information on the regional aggregation used in the source dataset</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Nitrogen</edomv>
            <edomvd>This variable is related to a nitrogen benchmark</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Phosphorus</edomv>
            <edomvd>This variable is related to a phosphorus benchmark</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Sediment</edomv>
            <edomvd>This variable is related to a sediment benchmark</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Chlorophyll</edomv>
            <edomvd>This variable is related to a chlorophyll benchmark</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Biotic</edomv>
            <edomvd>This variable is related to a non-chlorophyll biological benchmark</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Salinity</edomv>
            <edomvd>This variable is related to a salinity benchmark</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Temperature</edomv>
            <edomvd>This variable is related to a temperature benchmark</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Traits</edomv>
            <edomvd>This variable is related to life history or functional traits of an aquatic species</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>source</attrlabl>
        <attrdef>The original source that the benchmark data were compiled from Frimpong and Angermeier 2009.</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>EPA National Nutrient Strategy Ecoregional Nutrient Criteria for rivers and streams; https://www.epa.gov/system/files/documents/2021-07/ecoregion-table-rivers-streams.pdf</edomv>
            <edomvd>The EPA National Nutrient Strategy recommended regional criteria</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>EPA 2013-2014 National Rivers and Streams Assessment; https://www.epa.gov/system/files/documents/2024-10/nrsa-2013-14-final-tsd-version-1.1-september-2024_.pdf</edomv>
            <edomvd>EPA National Rivers and Streams Assessment 2013-2014 cycle, taken from the Technical document</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Olson and Cormier 2019: https://pubs.acs.org/doi/10.1021/acs.est.8b06777; Data accessed from the EPA Freshwater Explorer: https://arcg.is/KHb9S</edomv>
            <edomvd>Manuscript published by Olson and Cormier 2019 but data visualized and accessed through the EPA Freshwater Explorer online data visualization platform</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>This study, derived from Yu and others, 2022; https://doi.org/10.5066/P1UV25FW</edomv>
            <edomvd>This benchmark was created for the purposes of this data release by the project team. Even though the benchmark uses other source datasets (see Processing Steps) these values did not originate from another source.</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>ITIS; https://www.itis.gov</edomv>
            <edomvd>Value taken from the Integrated Taxonomic Information System available at https://www.itis.gov</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Comte and Olden 2017; https://doi.org/10.1038/nclimate3382</edomv>
            <edomvd>Value taken from Comte and Olden's 2017 in Nature Climate Change, "Climactic vulnerability of the world's freshwater and marine fishes"</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Chang and others, 2019; https://doi.org/10.1111/2041-210X.13182</edomv>
            <edomvd>Phylogenetic tree used was obtained from the R package fishtree which accesses the Fish Tree of Life website</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Frimpong and Angermeier, 2011; https://doi.org/10.1577/1548-8446-34.10.487</edomv>
            <edomvd>Values obtained from the Fish Traits database by Frimpong and Angermeier 2009.</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>dataset_file</attrlabl>
        <attrdef>Which data file contains this variable and can be used to access the variable programatically in a cloud-hosted environment</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>water_quality_benchmarks.parquet</edomv>
            <edomvd>The primary dataset containing the water quality benchmarks for CONUS</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>Temperature_CTmax_taxa_NHD_CONUS.parquet</edomv>
            <edomvd>The dataset containing all the individual taxa-by-taxa records of fishes and their critical thermal maxima by COMID across CONUS</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>sb_item_name</attrlabl>
        <attrdef>The name of the Sciencebase Item containing this variable</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>Ecological Water Quality Benchmarks for the Conterminous United States, Attributed to the NHDPlus version 2.1</edomv>
            <edomvd>The title of this data release</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>sb_item_url</attrlabl>
        <attrdef>The url to get to the Sciencebase item that contains this variable</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Website url</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>sb_id</attrlabl>
        <attrdef>The unique Sciencebase identifier that is used for the item containing this variable</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <codesetd>
            <codesetn>Sciencebase ID</codesetn>
            <codesets>Sciencebase</codesets>
          </codesetd>
        </attrdomv>
      </attr>
    </detailed>
    <overview>
      <eaover>An additional data dictionary table, called variable_lookup.csv is also provided which provides information on entity and attributes in a tabular format, as well as additional metadata to  to facilitate (optional) programmatic access and data querying capabilities of the parquet files when hosted in the cloud and using the USGS ScienceBase application programming interface (API).</eaover>
      <eadetcit>Matthew J. Cashman, Taylor E. Woods, Kelly O. Maloney, Daren M. Carlisle, and Lindsey J. Boyle, 20250901, Ecological Water Quality Benchmarks for the Conterminous United States, Attributed to the NHDPlus version 2.1: U.S. Geological Survey, https://doi.org/10.5066/P1FBQTVT.</eadetcit>
    </overview>
  </eainfo>
  <distinfo>
    <distrib>
      <cntinfo>
        <cntperp>
          <cntper>GS ScienceBase</cntper>
          <cntorg>U.S. Geological Survey</cntorg>
        </cntperp>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>Denver Federal Center, Building 810, Mail Stop 302</address>
          <city>Denver</city>
          <state>CO</state>
          <postal>80225</postal>
          <country>United States</country>
        </cntaddr>
        <cntvoice>1-888-275-8747</cntvoice>
        <cntemail>sciencebase@usgs.gov</cntemail>
      </cntinfo>
    </distrib>
    <distliab>Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although these data and associated metadata have been reviewed for accuracy and completeness and approved for release by the U.S. Geological Survey (USGS), no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty.</distliab>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>Digital Data</formname>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://doi.org/10.5066/P1FBQTVT</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
  </distinfo>
  <metainfo>
    <metd>20260302</metd>
    <metc>
      <cntinfo>
        <cntperp>
          <cntper>Matthew J Cashman</cntper>
          <cntorg>USGS - WATER</cntorg>
        </cntperp>
        <cntpos>Physical Scientist</cntpos>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>MD/DE/DC WSC UMBC Campus, UMBC Research Park</address>
          <city>Catonsville</city>
          <state>MD</state>
          <postal>21228</postal>
        </cntaddr>
        <cntvoice>443-498-5511</cntvoice>
        <cntemail>mcashman@usgs.gov</cntemail>
      </cntinfo>
    </metc>
    <metstdn>FGDC Content Standard for Digital Geospatial Metadata</metstdn>
    <metstdv>FGDC-STD-001-1998</metstdv>
  </metainfo>
</metadata>
