<?xml version='1.0' encoding='UTF-8'?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <idinfo>
    <citation>
      <citeinfo>
        <origin>Tyler V. King</origin>
        <pubdate>20260226</pubdate>
        <title>Crosswalk of Waterbody Identifiers between National Hydrography Dataset Plus High Resolution, National Hydrography Dataset Plus V2, and Hydrolakes Datasets for the Contiguous United States</title>
        <geoform>tabular digital data</geoform>
        <pubinfo>
          <pubplace>USGS ScienceBase</pubplace>
          <publish>USGS ScienceBase</publish>
        </pubinfo>
        <onlink>https://doi.org/10.5066/P13N85SW</onlink>
      </citeinfo>
    </citation>
    <descript>
      <abstract>There are multiple systems for mapping waterbodies within the conterminous United States. Prominent among them are the "High Resolution" and "Version 2" editions of the National Hydrography Dataset Plus. The National Hydrography Dataset Plus High Resolution (NHDPlusHR, Moore and others, 2025) uses "Permanent Identifiers" to identify waterbodies, while the National Hydrography Dataset Plus Version 2 (NHDPlusV2, McKay and others, 2012) uses "COMID" for the same purpose. While waterbodies may exist in both datasets, their spatial representation can vary, and there can be many-to-one relationships in both directions. One waterbody polygon in the NHDPlusHR may overlap with multiple waterbody polygons in the NHDPlusV2 and vice verse. The Hydrolakes dataset (Messager and others, 2016) is another system for mapping waterbodies. This system uses "Hylake Identifiers" to differentiate spatially explicit representations of waterbodies.

This datarelease provides a simple method to look up waterbody identifiers across these three datasets. In each datafile overlapping waterbodies were identified and the respective waterbodies identifiers are listed. The datafiles also include the surface area for each data source and the surface area that is shared between data sources.  Three of the files represent pairs of data sources. The fourth file represents the areas that are shared across all three datasets.

There is also a data dictionary file that contains definitions for every column in the datafiles.

Files:
- NHDPlusHR_NHDPlusV2.csv: Data file of lookup table between Permanent Identifier and COMID
- NHDPlusHR_Hydrolakes.csvL: Data file of lookup table between Permanent Identifier and HylakeID
- NHDPlusV2_Hydrolakes.csv: Data file of lookup table between Permanent Identifier and HylakeID
- NHDPlusHR_NHDPlusV2_Hydrolakes.csv: Data file of lookup table between Permanent Identifier and COMID and HylakeID
- Data_Dictionary_Waterbody_Crosswalk.csv: Metadata file with definitions for columns in the data files.

NHDPlusHR: Moore, R., McKay, L., Rea, A., Bondelid, T., Price, C., Dewald, T., &amp; Hayes, L. (2025). User’s guide for the National Hydrography Dataset Plus High Resolution (NHDPlus HR) (Report Nos. 2025–5031; Scientific Investigations Report, p. 78). USGS Publications Warehouse. https://doi.org/10.3133/sir20255031
NHDPlusV2: McKay, L., Bondelid, T., Dewald, T., Johnston, J., Moore, R., and Rea, A., 2012 “NHDPlus Version 2: 
User Guide”
Hydrolakes:  Messager, M. L., Lehner, B., Grill, G., Nedeva, I., &amp; Schmitt, O. (2016). Estimating the volume and age of water stored in global lakes using a geo-statistical approach. Nature Communications, 7(1), 13603. https://doi.org/10.1038/ncomms13603</abstract>
      <purpose>Provide simple lookup tables between waterbody identifiers across common hydrographic indexing systems for waterbodies in the Conterminous United States.</purpose>
    </descript>
    <timeperd>
      <timeinfo>
        <rngdates>
          <begdate>2015</begdate>
          <enddate>2024</enddate>
        </rngdates>
      </timeinfo>
      <current>observed</current>
    </timeperd>
    <status>
      <progress>Complete</progress>
      <update>Annually</update>
    </status>
    <spdom>
      <bounding>
        <westbc>-124.8047</westbc>
        <eastbc>-66.9500</eastbc>
        <northbc>49.3840</northbc>
        <southbc>24.8466</southbc>
      </bounding>
    </spdom>
    <keywords>
      <theme>
        <themekt>ISO 19115 Topic Category</themekt>
        <themekey>inlandWaters</themekey>
        <themekey>environment</themekey>
        <themekey>climatologyMeteorologyAtmosphere</themekey>
      </theme>
      <theme>
        <themekt>USGS Thesaurus</themekt>
        <themekey>Rivers</themekey>
        <themekey>Hydrology</themekey>
        <themekey>Water Resources</themekey>
        <themekey>Waterbodies</themekey>
        <themekey>Hydrography</themekey>
      </theme>
      <theme>
        <themekt>USGS Metadata Identifier</themekt>
        <themekey>USGS:692f2402d4be026ff273aa53</themekey>
      </theme>
      <place>
        <placekt>None</placekt>
        <placekey>CONUS</placekey>
        <placekey>Conterminous United States</placekey>
      </place>
    </keywords>
    <accconst>None. Please see 'Distribution Info' for details.</accconst>
    <useconst>The USGS Water Mission Area - Hydrologic Remote Sensing Branch was responsible for production of these data. All products developed from these data should cite this data release (https://doi.org/10.5066/P13N85SW).</useconst>
    <ptcontac>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S Geological Survey</cntorg>
          <cntper>Tyler V King</cntper>
        </cntorgp>
        <cntpos>Supervisory Research Hydrologist</cntpos>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>230 Collins Rd</address>
          <city>Boise</city>
          <state>Idaho</state>
          <postal>83702</postal>
          <country>US</country>
        </cntaddr>
        <cntvoice>1-208-387-1352</cntvoice>
        <cntemail>tvking@usgs.gov</cntemail>
      </cntinfo>
    </ptcontac>
    <datacred>Please acknowledge the USGS Hydrologic Remote Sensing Branch and cite the USGS data release as the source for all products developed from these data.</datacred>
    <native>operating system: Windows 11
software: R 4.5.2
dependencies:
- terra = 1.8-80
- nhdplusTools = 1.4.0
- stringr = 1.6.0
- sbtools = 1.4.1</native>
    <crossref>
      <citeinfo>
        <origin>Luke A Winslow</origin>
        <origin>Scott Chamberlain</origin>
        <origin>Alison P Appling</origin>
        <origin>Jordan S Read</origin>
        <pubdate>2016</pubdate>
        <title>sbtools</title>
        <geoform>publication</geoform>
        <pubinfo>
          <pubplace>https://cran.r-project.org/web/packages/sbtools</pubplace>
          <publish>CRAN</publish>
        </pubinfo>
        <onlink>https://journal.r-project.org/articles/RJ-2016-029/index.html</onlink>
      </citeinfo>
    </crossref>
  </idinfo>
  <dataqual>
    <attracc>
      <attraccr>No formal attribute accuracy tests were conducted. Spot checks were conducted to ensure that final identifiers mapped to the same features in the initial datasets.</attraccr>
    </attracc>
    <logic>Values were checked to confirm that the computed overlapping areas were smaller than the areas of the initial datasets.</logic>
    <complete>This dataset includes waterbodies within the NHDPlusHR, NDHPlusV2, and Hydrolakes datasets that intersect each other.  Identifiers from each data source that are not included in these files indicate that there is no spatial overlap for that waterbody.</complete>
    <posacc>
      <horizpa>
        <horizpar>A formal accuracy assessment of the horizontal positional information in the data set has not been conducted.</horizpar>
      </horizpa>
      <vertacc>
        <vertaccr>A formal accuracy assessment of the vertical positional information is not applicable as there is no vertical position information in this dataset.</vertaccr>
      </vertacc>
    </posacc>
    <lineage>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Richard B. Moore</origin>
            <origin>Lucinda D. McKay</origin>
            <origin>Alan H. Rea</origin>
            <origin>Timothy R. Bondelid</origin>
            <origin>Curtis V. Price</origin>
            <origin>Thomas G. Dewald</origin>
            <origin>Laura Hayes</origin>
            <pubdate>2025</pubdate>
            <title>National Hydrography Dataset Plus High Resolution</title>
            <geoform>vector digital data</geoform>
            <pubinfo>
              <pubplace>https://pubs.usgs.gov/publication/sir20255031</pubplace>
              <publish>U.S. Geological Survey</publish>
            </pubinfo>
            <onlink>https://doi.org/10.3133/sir20255031</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2025</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>NHDPlusHR</srccitea>
        <srccontr>Provided NHDPlusHR waterbody polygons.</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Lucinda D. McKay</origin>
            <origin>Timothy R. Bondelid</origin>
            <origin>Thomas G. Dewald</origin>
            <origin>Craig Johnston</origin>
            <origin>Richard Moore</origin>
            <origin>Alan H. Rea</origin>
            <pubdate>2012</pubdate>
            <title>NHDPlus Version 2: User Guide</title>
            <geoform>vector digital data</geoform>
            <pubinfo>
              <pubplace>https://www.epa.gov/waterdata/get-nhdplus-national-hydrography-dataset-plus-data</pubplace>
              <publish>U.S Environmental Protection Agency</publish>
            </pubinfo>
            <onlink>https://www.epa.gov/waterdata/get-nhdplus-national-hydrography-dataset-plus-data</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2012</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>NHDPlusV2</srccitea>
        <srccontr>Provided NHDPlusV2 polygons</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Mathis Loïc Messager</origin>
            <origin>Bernhard Lehner</origin>
            <origin>Günther Grill</origin>
            <origin>Irena Nedeva</origin>
            <origin>Oliver Schmitt</origin>
            <pubdate>2016</pubdate>
            <title>HydroLAKES</title>
            <geoform>vector digital data</geoform>
            <pubinfo>
              <pubplace>https://www.hydrosheds.org/products/hydrolakes</pubplace>
              <publish>HydroSHEDS</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1038/ncomms13603</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2016</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>Hydrolakes</srccitea>
        <srccontr>Provides Hydrolake polygons.</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>U.S. Geological Survey</origin>
            <pubdate>2025</pubdate>
            <title>Watershed Boundary Dataset (WBD)</title>
            <geoform>vector digital data</geoform>
            <pubinfo>
              <pubplace>https://prd-tnm.s3.amazonaws.com/index.html?prefix=StagedProducts/Hydrography/WBD/National/</pubplace>
              <publish>U.S. Geological Survey</publish>
            </pubinfo>
            <onlink>https://www.usgs.gov/national-hydrography/access-national-hydrography-products</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2025</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication date</srccurr>
        </srctime>
        <srccitea>WBD</srccitea>
        <srccontr>Provides HUC4 watershed boundaries</srccontr>
      </srcinfo>
      <procstep>
        <procdesc>Data download: Watershed boundaries were downloaded for the continuous United States from https://www.usgs.gov/national-hydrography/access-national-hydrography-products. The NHDPlusV2 snapshots were downloaded for the contiguous United States (units 01 through 18) from https://www.epa.gov/waterdata/get-nhdplus-national-hydrography-dataset-plus-data#Download.  NHDPlusHR data were downloaded for each level 4 hydrologic unit code (HUC4) in the Continuous United States using the download_nhdplushr function in the sbtools package in R.  All geospatial data was converted to a common coordinate reference system (EPSG:4326).</procdesc>
        <srcused>NHDPlusHR</srcused>
        <srcused>NHDPlusV2</srcused>
        <srcused>Hydrolakes</srcused>
        <srcused>WBD</srcused>
        <procdate>20251201</procdate>
        <proccont>
          <cntinfo>
            <cntperp>
              <cntper>Tyler V King</cntper>
              <cntorg>USGS - WATER</cntorg>
            </cntperp>
            <cntpos>Supervisory Research Hydrologist</cntpos>
            <cntaddr>
              <addrtype>mailing and physical</addrtype>
              <address>230 Collins Rd</address>
              <city>Boise</city>
              <state>Idaho</state>
              <postal>83702</postal>
              <country>US</country>
            </cntaddr>
            <cntvoice>208-387-1352</cntvoice>
            <cntemail>tvking@usgs.gov</cntemail>
          </cntinfo>
        </proccont>
      </procstep>
      <procstep>
        <procdesc>Spatial Intersection NHDPlusHR and NHDPlusV2.

For each HUC4:
1. the NHDPlusV2 dataset was subset to the watershed boundary,
2. the intersections between NHDPlusHR and NHDPlusV2 were identified using the intersect function in the Terra package in R,
3. the overlapping area between the NHDPlusHR and NHDPlusV2 polygons were computed using the expanse function in the Terra package in R,
4. the results of the NHDPlusHR to NHDPlusV2 intersection are stored in intermediate files,

Intermediate files were appended together to produce NHDPlusHR_NHDPlusV2.csv</procdesc>
        <procdate>20251201</procdate>
      </procstep>
      <procstep>
        <procdesc>Spatial Intersection NHDPlusHR and Hydrolakes.

For each HUC4:
1. the Hydrolakes dataset was subset to the watershed boundary,
2. the intersections between NHDPlusHR and Hydrolakes were identified using the intersect function in the Terra package in R,
3. the overlapping area between the NHDPlusHR and Hydrolakes polygons were computed using the expanse function in the Terra package in R,
4. the results of the NHDPlusHR to Hydrolakes intersection are stored in intermediate files,

Intermediate files were appended together to produce NHDPlusHR_Hydrolakes.csv</procdesc>
        <procdate>20251201</procdate>
      </procstep>
      <procstep>
        <procdesc>Spatial Intersection NHDPlusV2 and Hydrolakes.

For each HUC4:
1. the Hydrolakes and NHDPlusV2 dataset was subset to the watershed boundary,
2. the intersections between NHDPlusV2 and Hydrolakes were identified using the intersect function in the Terra package in R,
3. the overlapping area between the NHDPlusV2 and Hydrolakes polygons were computed using the expanse function in the Terra package in R,
4. the results of the NHDPlusV2 to Hydrolakes intersection are stored in intermediate files,

Intermediate files were appended together to produce NHDPlusV2_Hydrolakes.csv</procdesc>
        <procdate>20251201</procdate>
      </procstep>
      <procstep>
        <procdesc>Spatial Intersection NHDPlusHR, NHDPlusV2, and Hydrolakes.

For each HUC4:
1. the Hydrolakes and NHDPlusV2 dataset was subset to the watershed boundary,
2. the intersections between NHDPlusHR and NHDPlusV2 were identified using the intersect function in the Terra package in R,
3. the intersection between the intersection in step 2 and Hydrolakes were identified using the intersect function in the Terra package in R,
4. the overlapping area between the NHDPlusHR, NHDPlusV2 and Hydrolakes polygons were computed using the expanse function in the Terra package in R,
4. the results of the NHDPlusHR, NHDPlusV2 and Hydrolakes intersection are stored in intermediate files.

Intermediate files were appended together to produce NHDPlusHR_NHDPlusV2_Hydrolakes.csv</procdesc>
        <procdate>20251201</procdate>
      </procstep>
    </lineage>
  </dataqual>
  <eainfo>
    <detailed>
      <enttyp>
        <enttypl>Data_Dictionary_NHD_Waterbody_Crosswalk.csv</enttypl>
        <enttypd>Comma Separated Value (CSV) file containing data.</enttypd>
        <enttypds>Producer Defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>File</attrlabl>
        <attrdef>File in the data release</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>NHDPlusHR_Hydrolakes.csv</edomv>
            <edomvd>crosswalk between NHDPlusHR and Hydrolakes</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>NHDPlusHR_NHDPlusV2.csv</edomv>
            <edomvd>crosswalk between NHDPlusHR and NHDPlusV2</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>NHDPlusV2_Hydrolakes.csv</edomv>
            <edomvd>crosswalk between NHDPlusV2 and Hydrolakes</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>NHDPlusHR_NHDPlusV2_Hydrolakes.csv</edomv>
            <edomvd>crosswalk between NHDPlusHR, NHDPlusV2, and Hydrolakes</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>Column</attrlabl>
        <attrdef>Column in file</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Name of the column in the datafile</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>Definition</attrlabl>
        <attrdef>Definition of the column in the datafile</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>Definition of the column in the datafile</udom>
        </attrdomv>
      </attr>
    </detailed>
    <detailed>
      <enttyp>
        <enttypl>code.zip</enttypl>
        <enttypd>Zipped file with R scripts used to produce cross-walk data files.</enttypd>
        <enttypds>Producer Defined</enttypds>
      </enttyp>
    </detailed>
    <overview>
      <eaover>Crosswalk of Lake Identifiers between versions of the National Hydrography Dataset and Hydrolakes for the Contiguous United States</eaover>
      <eadetcit>King, T.V., 2025, Crosswalk of Lake Identifiers between versions of the National Hydrography Dataset and Hydrolakes for the Contiguous United States : U.S. Geological Survey data release, https://doi.org/10.5066/P13N85SWS.</eadetcit>
    </overview>
  </eainfo>
  <distinfo>
    <distrib>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey</cntorg>
          <cntper>GS ScienceBase</cntper>
        </cntorgp>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>Denver Federal Center, Building 810, Mail Stop 302</address>
          <city>Denver</city>
          <state>CO</state>
          <postal>80225</postal>
          <country>United States</country>
        </cntaddr>
        <cntvoice>1-888-275-8747</cntvoice>
        <cntemail>sciencebase@usgs.gov</cntemail>
      </cntinfo>
    </distrib>
    <distliab>Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although these data and associated metadata have been reviewed for accuracy and completeness and approved for release by the U.S. Geological Survey (USGS), no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty.</distliab>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>Digital Data</formname>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://doi.org/10.5066/P13N85SW</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
  </distinfo>
  <metainfo>
    <metd>20260226</metd>
    <metc>
      <cntinfo>
        <cntperp>
          <cntper>Tyler V. King</cntper>
          <cntorg>USGS - WATER</cntorg>
        </cntperp>
        <cntpos>Supervisory Research Hydrologist</cntpos>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>230 Collins Rd</address>
          <city>Boise</city>
          <state>Idaho</state>
          <postal>83702</postal>
          <country>US</country>
        </cntaddr>
        <cntvoice>208-387-1352</cntvoice>
        <cntemail>tvking@usgs.gov</cntemail>
      </cntinfo>
    </metc>
    <metstdn>FGDC Content Standard for Digital Geospatial Metadata</metstdn>
    <metstdv>FGDC-STD-001-1998</metstdv>
  </metainfo>
</metadata>
