<?xml version='1.0' encoding='UTF-8'?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <idinfo>
    <citation>
      <citeinfo>
        <origin>Se Jong Cho</origin>
        <pubdate>20251119</pubdate>
        <title>3 - Statistical analyses of hydrodynamics and landscape attributes in the Delaware and Illinois River Basins</title>
        <geoform>.R, .csv.</geoform>
        <pubinfo>
          <pubplace>Baltimore, MD</pubplace>
          <publish>U.S. Geological Survey</publish>
        </pubinfo>
        <onlink>https://doi.org/10.5066/P1492CKA</onlink>
      </citeinfo>
    </citation>
    <descript>
      <abstract>3_Hysteresis_PCA_LDA is the third child item of this Data Release. Principal Component Analysis (PCA) was performed to reduce the dimensionality of streamflow event variables and to reveal the underlying patterns in hydrodynamic and turbidity characteristics in the Delaware River Basin (DRB) and Illinois River Basin (IRB). Linear Discriminant Analysis (LDA) was performed to identify the streamflow event characteristics and watershed-scale landscape feature variables that influence Hysteresis Index - Concentration Index (HI-CI) classes in each region. An annotated R script is included for each analysis. Additional information can be found in the "Entity and Attribute" section.</abstract>
      <purpose>This research aims to improve understanding of the complex sediment transport process and can lead to more informed inferences about sediment dynamics, hydrology, source, and in channel processes as well as improved predictions of sediment transport. 

Suspended sediment transport is critical to understanding future states of water quality and represents an important Integrated Water Science (IWS) basin effort. Delaware River Basin (DRB) and Illinois River Basin (IRB) are two of the IWS basins with a wide range of environmental, hydrologic, and landscape settings and human stressors of water resources. USGS stream gaging at 35 watersheds within these two regional Basins were selected to evaluate the storm events and the corresponding sediment and hydrodynamic observations.</purpose>
    </descript>
    <timeperd>
      <timeinfo>
        <rngdates>
          <begdate>2007</begdate>
          <enddate>2023</enddate>
        </rngdates>
      </timeinfo>
      <current>ground condition</current>
    </timeperd>
    <status>
      <progress>Complete</progress>
      <update>None planned</update>
    </status>
    <spdom>
      <bounding>
        <westbc>-92.3730</westbc>
        <eastbc>-71.5430</eastbc>
        <northbc>44.6530</northbc>
        <southbc>36.3151</southbc>
      </bounding>
    </spdom>
    <keywords>
      <theme>
        <themekt>ISO 19115 Topic Category</themekt>
        <themekey>inlandWaters</themekey>
        <themekey>climatologyMeteorologyAtmosphere</themekey>
        <themekey>geoscientificInformation</themekey>
        <themekey>environment</themekey>
      </theme>
      <theme>
        <themekt>None</themekt>
        <themekey>Machine Learning</themekey>
        <themekey>Daymet</themekey>
        <themekey>Principal Component Analysis</themekey>
        <themekey>Linear Discriminant Analysis</themekey>
      </theme>
      <theme>
        <themekt>USGS Thesaurus</themekt>
        <themekey>sediment transport</themekey>
        <themekey>turbidity</themekey>
        <themekey>datasets</themekey>
        <themekey>streamflow</themekey>
      </theme>
      <theme>
        <themekt>USGS Metadata Identifier</themekt>
        <themekey>USGS:6827b468d4be02693eeabdf7</themekey>
      </theme>
      <place>
        <placekt>Common geographic areas</placekt>
        <placekey>Illinois</placekey>
        <placekey>Indiana</placekey>
        <placekey>New York</placekey>
        <placekey>New Jersey</placekey>
        <placekey>Pennsylvania</placekey>
        <placekey>Delaware</placekey>
      </place>
    </keywords>
    <accconst>No access constraints. Please see 'Distribution Information' for details.</accconst>
    <useconst>These data are marked with a Creative Common CC0 1.0 Universal License. These data are in the public domain and do not have any use constraints. Users are advised to read the dataset's metadata thoroughly to understand appropriate use and data limitations.</useconst>
    <ptcontac>
      <cntinfo>
        <cntperp>
          <cntper>Se Jong Cho</cntper>
          <cntorg>USGS - WATER</cntorg>
        </cntperp>
        <cntpos>Research Hydrologist</cntpos>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>5522 Research Park Drive</address>
          <city>Baltimore</city>
          <state>MD</state>
          <postal>21228</postal>
        </cntaddr>
        <cntvoice>703-648-5714</cntvoice>
        <cntemail>scho@usgs.gov</cntemail>
      </cntinfo>
    </ptcontac>
  </idinfo>
  <dataqual>
    <attracc>
      <attraccr>Verified that attribute values were within expected bounds.</attraccr>
    </attracc>
    <logic>No formal logical accuracy tests were conducted</logic>
    <complete>Data set is considered complete for the information presented, as described in the abstract. Users are advised to read the rest of the metadata record carefully for additional details.</complete>
    <posacc>
      <horizpa>
        <horizpar>A formal accuracy assessment of the horizontal positional information in the data set has not been conducted.</horizpar>
      </horizpa>
      <vertacc>
        <vertaccr>A formal accuracy assessment of the vertical positional information in the data set has either not been conducted, or is not applicable.</vertaccr>
      </vertacc>
    </posacc>
    <lineage>
      <procstep>
        <procdesc>Processing described for each dataset in the Entity and Attribute section</procdesc>
        <procdate>2025</procdate>
      </procstep>
    </lineage>
  </dataqual>
  <eainfo>
    <detailed>
      <enttyp>
        <enttypl>3_Hysteresis_PCA_LDA\PCA\PCA_HICI_temporal_vars.R</enttypl>
        <enttypd>R script to conduct Principal Component Analysis using hydrodynamic temporal variables. 

Detailed processing steps are noted in the R script PCA_HICI_temporal_vars.R.

The script imports storm event variables and executes Principal Component Analysis (PCA) iterations using relevant variables
1. Intake event data output from Step 2 of data release and define initial PCA dataframe (Tdf)
2. Execute initial PCA with dataframe Tdf
3. Execute PCA with filtered dataframe Tdf1
4. Execute PCA with filter dataframe TDf2 (this is the final PCA reported in the publication)
5. Execute PCA for individual 5 HICI classes: PosHCI, PosLCI, NegHCI, NegLCI, and Lin</enttypd>
        <enttypds>U.S. Geological Survey</enttypds>
      </enttyp>
    </detailed>
    <detailed>
      <enttyp>
        <enttypl>3_Hysteresis_PCA_LDA\LDA\LDA_HICI_temporalspatial_vars.R</enttypl>
        <enttypd>R script to conduct Linear Discriminant Analysis using hydrodynamic variables and landscape attributes. 

Detailed processing steps are noted in the R script LDA_HICI_temporalspatial_vars.R

The script imports storm event variables and executes Linear Discriminant Analysis (LDA) iterations using relevant variables
1. Intake event data output from Step 2 of data release and define initial LDA dataframe (TSdf)
2. Site selection: conduct LDA in DRB and IRB
3. Conduct hierarchical cluster analysis
4. Conduct perMANOVA test
5. Execute stepwise LDA feature selection
6. Execute LDA predictive model</enttypd>
        <enttypds>U.S. Geological Survey</enttypds>
      </enttyp>
    </detailed>
    <detailed>
      <enttyp>
        <enttypl>3_Hysteresis_PCA_LDA\PCA_LDA_shortnames.csv</enttypl>
        <enttypd>Comma Separated Value (CSV) file containing the cross walk table for column names defined in (2_EventSeparation_HysteresisIndex\1_HydRun_Millar_cQ_functions\HI_EventSummary_FeatureDictionary.csv) to shortened column names used in child item 3_Hysteresis_PCA_LDA.</enttypd>
        <enttypds>U.S. Geological Survey</enttypds>
      </enttyp>
      <attr>
        <attrlabl>col_names_EventSummary_final</attrlabl>
        <attrdef>Column names defined in (2_EventSeparation_HysteresisIndex\1_HydRun_Millar_cQ_functions\HI_EventSummary_FeatureDictionary.csv)</attrdef>
        <attrdefs>U.S. Geological Survey</attrdefs>
        <attrdomv>
          <udom>See Column Definition</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>col_names_PCALDA</attrlabl>
        <attrdef>Shortened column names used in child item 3_Hysteresis_PCA_LDA.</attrdef>
        <attrdefs>U.S. Geological Survey</attrdefs>
        <attrdomv>
          <udom>See Column Definition</udom>
        </attrdomv>
      </attr>
    </detailed>
  </eainfo>
  <distinfo>
    <distrib>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey - ScienceBase</cntorg>
        </cntorgp>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>Denver Federal Center</address>
          <address>Building 810</address>
          <address>Mail Stop 302</address>
          <city>Denver</city>
          <state>CO</state>
          <postal>80225</postal>
        </cntaddr>
        <cntvoice>1-888-275-8747</cntvoice>
        <cntemail>sciencebase@usgs.gov</cntemail>
      </cntinfo>
    </distrib>
    <distliab>Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although these data and associated metadata have been reviewed for accuracy and completeness and approved for release by the U.S. Geological Survey (USGS), no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty.</distliab>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>Digital Data</formname>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://doi.org/10.5066/P1492CKA</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
  </distinfo>
  <metainfo>
    <metd>20251119</metd>
    <metc>
      <cntinfo>
        <cntperp>
          <cntper>Se Jong Cho</cntper>
          <cntorg>USGS - WATER</cntorg>
        </cntperp>
        <cntpos>Research Hydrologist</cntpos>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>5522 Research Park Drive</address>
          <city>Baltimore</city>
          <state>MD</state>
          <postal>21228</postal>
        </cntaddr>
        <cntvoice>703-648-5714</cntvoice>
        <cntemail>scho@usgs.gov</cntemail>
      </cntinfo>
    </metc>
    <metstdn>FGDC Content Standard for Digital Geospatial Metadata</metstdn>
    <metstdv>FGDC-STD-001-1998</metstdv>
  </metainfo>
</metadata>
