<?xml version='1.0' encoding='UTF-8'?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <idinfo>
    <citation>
      <citeinfo>
        <origin>Vandergast, A.G.</origin>
        <origin>Wood, D.A.</origin>
        <origin>Smith, J.G.</origin>
        <origin>Mitelberg, A.</origin>
        <pubdate>20250623</pubdate>
        <title>Campylorhynchus brunneicapillus single nucleotide polymorphism genotype data from southern California, USA</title>
        <geoform>tabular digital data (.csv and .vcf)</geoform>
        <pubinfo>
          <pubplace>ScienceBase</pubplace>
          <publish>U.S. Geological Survey data release</publish>
        </pubinfo>
        <onlink>https://doi.org/10.5066/P18WGRYU</onlink>
      </citeinfo>
    </citation>
    <descript>
      <abstract>This dataset contains 184 single nucleotide polymorphism (SNP) genotypes of Campylorhynchus brunneicapillus (Cactus Wren) sampled in southern California. Genomic markers were generated from ddRAD loci and analyzed using the Stacks v2.60 pipeline. The genotypes for all samples are provided in a VCF file with 40,707 independent loci. A companion sample data file is provided with sample names, location, and other data. These files may be opened and edited in a text editor program, such as Notepad (PC) or BBEdit (Mac). The .vcf file can be loaded into the Stacks population program to calculate genetic diversity statistics, or loaded into R, using vcfR, for further analysis.</abstract>
      <purpose>These data were collected to assess the distribution of genetic diversity within and among occurrences of cactus wren in southern California</purpose>
    </descript>
    <timeperd>
      <timeinfo>
        <rngdates>
          <begdate>20110503</begdate>
          <enddate>20210602</enddate>
        </rngdates>
      </timeinfo>
      <current>observed</current>
    </timeperd>
    <status>
      <progress>Complete</progress>
      <update>None planned</update>
    </status>
    <spdom>
      <descgeog>Southern California</descgeog>
      <bounding>
        <westbc>-119.0372</westbc>
        <eastbc>-115.0237</eastbc>
        <northbc>35.6997</northbc>
        <southbc>32.5866</southbc>
      </bounding>
    </spdom>
    <keywords>
      <theme>
        <themekt>ISO 19115 Topic Category</themekt>
        <themekey>biota</themekey>
      </theme>
      <theme>
        <themekt>USGS Thesaurus</themekt>
        <themekey>birds</themekey>
        <themekey>genotype</themekey>
        <themekey>genetic diversity</themekey>
        <themekey>genetics</themekey>
        <themekey>native species</themekey>
      </theme>
      <theme>
        <themekt>USGS Metadata Identifier</themekt>
        <themekey>USGS:683dcb8fd4be0234870fc967</themekey>
      </theme>
      <place>
        <placekt>Common Geographic Areas</placekt>
        <placekey>San Diego</placekey>
        <placekey>Orange</placekey>
        <placekey>Imperial</placekey>
        <placekey>Riverside</placekey>
        <placekey>San Bernardino</placekey>
        <placekey>Ventura</placekey>
        <placekey>Los Angeles</placekey>
        <placekey>Kern</placekey>
      </place>
    </keywords>
    <taxonomy>
      <keywtax>
        <taxonkt>Integrated Taxonomic Information System (ITIS)</taxonkt>
        <taxonkey>Campylorhynchus brunneicapillus</taxonkey>
      </keywtax>
      <taxonsys>
        <classsys>
          <classcit>
            <citeinfo>
              <origin>U.S. Geological Survey</origin>
              <pubdate>2013</pubdate>
              <title>Integrated Taxonomic Information System (ITIS)</title>
              <geoform>Online Database</geoform>
              <onlink>https://doi.org/10.5066/F7KH0KBK</onlink>
              <onlink>www.itis.gov</onlink>
            </citeinfo>
          </classcit>
        </classsys>
        <taxonpro>expert identifier</taxonpro>
      </taxonsys>
      <taxoncl>
        <taxonrn>Kingdom</taxonrn>
        <taxonrv>Animalia</taxonrv>
        <taxoncl>
          <taxonrn>Subkingdom</taxonrn>
          <taxonrv>Bilateria</taxonrv>
          <taxoncl>
            <taxonrn>Infrakingdom</taxonrn>
            <taxonrv>Deuterostomia</taxonrv>
            <taxoncl>
              <taxonrn>Phylum</taxonrn>
              <taxonrv>Chordata</taxonrv>
              <taxoncl>
                <taxonrn>Subphylum</taxonrn>
                <taxonrv>Vertebrata</taxonrv>
                <taxoncl>
                  <taxonrn>Infraphylum</taxonrn>
                  <taxonrv>Gnathostomata</taxonrv>
                  <taxoncl>
                    <taxonrn>Superclass</taxonrn>
                    <taxonrv>Tetrapoda</taxonrv>
                    <taxoncl>
                      <taxonrn>Class</taxonrn>
                      <taxonrv>Aves</taxonrv>
                      <taxoncl>
                        <taxonrn>Order</taxonrn>
                        <taxonrv>Passeriformes</taxonrv>
                        <taxoncl>
                          <taxonrn>Family</taxonrn>
                          <taxonrv>Troglodytidae</taxonrv>
                          <taxoncl>
                            <taxonrn>Genus</taxonrn>
                            <taxonrv>Campylorhynchus</taxonrv>
                            <taxoncl>
                              <taxonrn>Species</taxonrn>
                              <taxonrv>Campylorhynchus brunneicapillus</taxonrv>
                              <common>TSN: 178587</common>
                              <common>Cactus Wren</common>
                            </taxoncl>
                          </taxoncl>
                        </taxoncl>
                      </taxoncl>
                    </taxoncl>
                  </taxoncl>
                </taxoncl>
              </taxoncl>
            </taxoncl>
          </taxoncl>
        </taxoncl>
      </taxoncl>
    </taxonomy>
    <accconst>No access constraints.  Please see 'Distribution Info' for details.</accconst>
    <useconst>No use constraints. Questions pertaining to appropriate use or assistance with understanding limitations or interpretation of the data are to be directed to the individuals/organization listed in the Point of Contact section.</useconst>
    <ptcontac>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey, Western Ecological Research Center</cntorg>
        </cntorgp>
        <cntpos>Data Manager</cntpos>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>3020 State University Drive, Modoc Hall, suite 4004</address>
          <city>Sacramento</city>
          <state>CA</state>
          <postal>95819</postal>
          <country>US</country>
        </cntaddr>
        <cntvoice>279-782-0904</cntvoice>
        <cntemail>gs-b-werc_data_management@usgs.gov</cntemail>
      </cntinfo>
    </ptcontac>
    <tool>
      <tooldesc>Stacks 2.60 was used to analyze genomic markers for these data. Stacks is a software pipeline for building loci from short-read sequences, such as those generated on the Illumina platform. Stacks was developed to work with restriction enzyme-based data, such as RAD-seq, for the purpose of building genetic maps and conducting population genomics and phylogeography.</tooldesc>
      <toolacc>
        <onlink>https://catchenlab.life.illinois.edu/stacks/</onlink>
        <toolinst>Stacks is implemented in C++ with wrapper programs written in Perl. Stacks should build on any standard UNIX-like environment (Apple OS X, Linux, etc.) Stacks is an independent pipeline and can be run without any additional external software. Stacks uses a standard autotools installation script. Visit https://catchenlab.life.illinois.edu/stacks/manual/#install to view the standard autotools installation script. Visit https://catchenlab.life.illinois.edu/stacks to download the Stacks package. View the INSTALL file within the package for specific installation instructions.</toolinst>
      </toolacc>
      <toolcite>
        <citeinfo>
          <origin>Catchen, J.</origin>
          <origin>Hohenlohe, P. A.</origin>
          <origin>Bassham, S.</origin>
          <origin>Amores, A.</origin>
          <origin>Cresko, W. A.</origin>
          <pubdate>2013</pubdate>
          <title>Stacks: An analysis tool set for population genomics</title>
          <geoform>Publication</geoform>
        </citeinfo>
      </toolcite>
    </tool>
    <tool>
      <tooldesc>Raw sequence demultiplexing, quality filtering, and genotyping was performed using Stacks v2.60 on the
USGS Yeti High Performance Computing platform. Decommissioned in 2023, Yeti was the first supercomputer available to all USGS research staff.</tooldesc>
      <toolacc>
        <onlink>https://doi.org/10.5066/F7D798MJ</onlink>
        <toolinst>Hovenweep is a USGS on-premises supercomputer that replaced Yeti as the workhorse USGS supercomputer for general-purpose HPC workloads. For more information or to get started using McKinley, Tallgrass, or Hovenweep supercomputers, contact the USGS High Performance Computing team at hpc@usgs.gov.</toolinst>
      </toolacc>
      <toolcite>
        <citeinfo>
          <origin>Falgout, J.T.</origin>
          <origin>Gordon, J.</origin>
          <pubdate>2023</pubdate>
          <title>USGS Yeti Supercomputer: U.S. Geological Survey</title>
          <geoform>supercomputer</geoform>
          <serinfo>
            <sername>Molecular Ecology</sername>
            <issue>vol. 22, issue 11, pages 3124-3140</issue>
          </serinfo>
          <pubinfo>
            <pubplace>n/a</pubplace>
            <publish>Wiley</publish>
          </pubinfo>
          <onlink>https://doi.org/10.5066/F7D798MJ</onlink>
          <onlink>https://doi.org/10.1111/mec.12354</onlink>
        </citeinfo>
      </toolcite>
    </tool>
    <tool>
      <tooldesc>The .vcf file can be loaded into R using vcfR to calculate genetic diversity statistics. VcfR is an R package intended to allow easy manipulation and visualization of variant call format (VCF) data.</tooldesc>
      <toolacc>
        <onlink>https://knausb.github.io/vcfR_documentation/</onlink>
        <onlink>https://github.com/knausb/vcfR/blob/master/README.md</onlink>
        <toolinst>vcfR is available on the Comprehensive R Archive Network. For download instructions, visit https://github.com/knausb/vcfR/blob/master/README.md.</toolinst>
      </toolacc>
      <toolcite>
        <citeinfo>
          <origin>Knaus, B.J.</origin>
          <origin>Grünwald, N.J.</origin>
          <pubdate>2017</pubdate>
          <title>vcfr: a package to manipulate and visualize variant call format data in R.</title>
          <geoform>publication</geoform>
          <serinfo>
            <sername>Molecular Ecology Resources</sername>
            <issue>vol 17, issue 1, Special Issue: Population Genomics with R</issue>
          </serinfo>
          <pubinfo>
            <pubplace>n/a</pubplace>
            <publish>Wiley</publish>
          </pubinfo>
          <onlink>https://doi.org/10.1111/1755-0998.12549</onlink>
        </citeinfo>
      </toolcite>
    </tool>
    <tool>
      <tooldesc>FastQC is a quality control tool for high throughput sequence data.</tooldesc>
      <toolacc>
        <onlink>https://www.bioinformatics.babraham.ac.uk/projects/fastqc/</onlink>
        <toolinst>A Java Runtime environment is required for download. FastQC can be downloaded by visiting https://www.bioinformatics.babraham.ac.uk/projects/fastqc/</toolinst>
      </toolacc>
      <toolcite>
        <citeinfo>
          <origin>Andrews, S.</origin>
          <pubdate>2010</pubdate>
          <title>FastQC:  A Quality Control Tool for High Throughput Sequence Data</title>
          <geoform>Tool</geoform>
          <onlink>https://www.bioinformatics.babraham.ac.uk/projects/fastqc/</onlink>
        </citeinfo>
      </toolcite>
    </tool>
  </idinfo>
  <dataqual>
    <attracc>
      <attraccr>The quality control tool for high throughput sequence data, FastQC, was used to assess the attribute quality of data and remove samples that had greater than 30 percent missing data. A minor allele frequency cut off of 5% was also applied.</attraccr>
    </attracc>
    <logic>It was verified that there were no duplicate samples and all samples included passed quality filters. A minimum of 9X coverage at a locus was required, and samples with  &gt; 30% missing data were removed.</logic>
    <complete>Data set is considered complete for the information presented, as described in the abstract. Users are advised to read the rest of the metadata record carefully for additional details.</complete>
    <posacc>
      <horizpa>
        <horizpar>No formal positional accuracy tests were conducted. Handheld GPS were used to collect location data.</horizpar>
      </horizpa>
    </posacc>
    <lineage>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Paris, J.R.</origin>
            <origin>Stevens, J.R.</origin>
            <origin>Catchen. J.M.</origin>
            <pubdate>2017</pubdate>
            <title>Lost in parameter space: a road map for stacks</title>
            <geoform>publication</geoform>
            <serinfo>
              <sername>Methods in Ecology and Evolution</sername>
              <issue>vol. 8, issue 10</issue>
            </serinfo>
            <pubinfo>
              <pubplace>n/a</pubplace>
              <publish>British Ecological Society</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1111/2041-210X.12775</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2017</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication</srccurr>
        </srctime>
        <srccitea>Stacks parameters</srccitea>
        <srccontr>Clustering, assembly, and filtering parameters were optimized using a subset of individuals following the r80 method by Paris et al.</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Catchen. J.M.</origin>
            <origin>Hohenlohe, P.A.</origin>
            <origin>Bassham, S.</origin>
            <origin>Amores, A.</origin>
            <origin>Cresko, W.A.</origin>
            <pubdate>2013</pubdate>
            <title>Stacks: an analysis tool set for population genomics</title>
            <geoform>publication</geoform>
            <serinfo>
              <sername>Molecular Ecology</sername>
              <issue>vol. 22, issue 11, Special Issue: Genotyping by Sequencing in Ecological and Conservation Genomics</issue>
            </serinfo>
            <pubinfo>
              <pubplace>n/a</pubplace>
              <publish>Wiley</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1111/mec.12354</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2013</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication</srccurr>
        </srctime>
        <srccitea>Population genomics functions</srccitea>
        <srccontr>The following final parameters were used to create a locus catalog (cstacks) for the full Stacks genotyping pipeline: minimum number of raw reads required to form a stack (putative allele), m = 3; maximum number of mismatches allowed when matching stacks (putative alleles within loci) within samples, M = 2; maximum number of mismatches between stacks (putative loci) of the samples and the catalog of loci among all samples, n = 1; minimum percentage of individuals across populations required to process a locus, R = 80</srccontr>
      </srcinfo>
      <srcinfo>
        <srccite>
          <citeinfo>
            <origin>Peterson, B.K.</origin>
            <origin>Weber, J.N.</origin>
            <origin>Kay, E.H.</origin>
            <origin>Fisher, H.S.</origin>
            <origin>Hoekstra, H.E.</origin>
            <pubdate>2012</pubdate>
            <title>Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species</title>
            <geoform>publication</geoform>
            <serinfo>
              <sername>PLoS One</sername>
              <issue>7(5):e37135</issue>
            </serinfo>
            <pubinfo>
              <pubplace>n/a</pubplace>
              <publish>PLoS One</publish>
            </pubinfo>
            <onlink>https://doi.org/10.1371/journal.pone.0037135</onlink>
          </citeinfo>
        </srccite>
        <typesrc>Digital and/or Hardcopy</typesrc>
        <srctime>
          <timeinfo>
            <sngdate>
              <caldate>2012</caldate>
            </sngdate>
          </timeinfo>
          <srccurr>publication</srccurr>
        </srctime>
        <srccitea>ddRAD</srccitea>
        <srccontr>The double-digest restriction-associated DNA (ddRAD) sequencing protocol developed in Peterson et al. was followed when extracting genomic DNA.</srccontr>
      </srcinfo>
      <procstep>
        <procdesc>Genetic blood samples were collected from Cactus Wrens at California desert sites.</procdesc>
        <procdate>Unknown</procdate>
      </procstep>
      <procstep>
        <procdesc>Genomic DNA was extracted from blood cards using the gentra purgene extraction kit and following the manufacturer's protocol. We followed the double-digest restriction-associated DNA (ddRAD) sequencing protocol developed in Peterson et al. (2012) and used restriction enzymes EcoRI and MseI and size-selected for 250-400 base pair fragments before being pooled for sequencing (150 bp paired-end reads) on Illumina HiSeqX and NovaSeq platforms at MedGenome, Inc. (Foster City, CA).</procdesc>
        <srcused>ddRAD</srcused>
        <procdate>2022</procdate>
        <proccont>
          <cntinfo>
            <cntperp>
              <cntper>Dustin Wood</cntper>
              <cntorg>U.S. Geological Survey, Western Ecological Research Center</cntorg>
            </cntperp>
            <cntpos>Geneticist</cntpos>
            <cntaddr>
              <addrtype>mailing and physical</addrtype>
              <address>4165 Spruance Road, suite 200</address>
              <city>San Diego</city>
              <state>CA</state>
              <postal>92101</postal>
              <country>USA</country>
            </cntaddr>
            <cntvoice>619-225-6432</cntvoice>
            <cntemail>dawood@usgs.gov</cntemail>
          </cntinfo>
        </proccont>
      </procstep>
      <procstep>
        <procdesc>Development of Campylorhynchus brunneicapillus genomic marker dataset from ddRADseq library prep: Raw sequence demultiplexing, quality filtering, and genotyping was performed using Stacks v2.60 on the USGS Yeti High Performance Computing platform. Clustering, assembly, and filtering parameters were optimized using a subset of individuals following the r80 method by Paris et al (2017). This subset included 70 individuals that were distributed across collection locations and with read coverages that fell within one standard deviation of overall mean retained reads per sample. This subset was also used to create a locus catalog (cstacks) for the full Stacks genotyping pipeline. The following final parameters were used: minimum number of raw reads required to form a stack (putative allele), m = 3; maximum number of mismatches allowed when matching stacks (putative alleles within loci) within samples, M = 2; maximum number of mismatches between stacks (putative loci) of the samples and the catalog of loci among all samples, n = 1; minimum percentage of individuals across populations required to process a locus, R = 80 (Catchen et al. 2013). We further filtered the number of samples and loci by requiring a minimum depth of coverage of 9x and less than 30% missing data per sample. One single nucleotide polymorphism (SNP) was randomly chosen from each locus and exported in a varianct call format (VCF) file.</procdesc>
        <srcused>Stacks parameters</srcused>
        <srcused>Population genomics functions</srcused>
        <procdate>2022</procdate>
      </procstep>
    </lineage>
  </dataqual>
  <spref>
    <horizsys>
      <geograph>
        <latres>0.0197727549</latres>
        <longres>0.0237726774</longres>
        <geogunit>Decimal seconds</geogunit>
      </geograph>
      <geodetic>
        <horizdn>WGS_1984</horizdn>
        <ellips>WGS 84</ellips>
        <semiaxis>6378137.0</semiaxis>
        <denflat>298.257223563</denflat>
      </geodetic>
    </horizsys>
  </spref>
  <eainfo>
    <detailed>
      <enttyp>
        <enttypl>CACW_SNP_sampledata.csv</enttypl>
        <enttypd>Comma Separated Value (CSV) file containing data.</enttypd>
        <enttypds>Producer Defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>sample</attrlabl>
        <attrdef>genetic sample unique identifier</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>alpha numeric codes</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>pop_code</attrlabl>
        <attrdef>sampling location code</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>three letter site code</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>vcf_order</attrlabl>
        <attrdef>order of samples in the .vcf file</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>187</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>site_name</attrlabl>
        <attrdef>full name of site</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <udom>researcher designated site names for collection locations</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>latitude</attrlabl>
        <attrdef>latitude in decimal degrees, WGS 84 datum</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>32.58655</rdommin>
            <rdommax>35.69969</rdommax>
            <attrunit>decimal degrees</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>longitude</attrlabl>
        <attrdef>longitude in decimal degrees, WGS 84 datum</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>-119.0372</rdommin>
            <rdommax>-115.02366</rdommax>
            <attrunit>decimal degrees</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>collection_date</attrlabl>
        <attrdef>dates birds were sampled in the field</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>na</edomv>
            <edomvd>no date collected</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>collection dates formatted month/day/year</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>field sample number</attrlabl>
        <attrdef>sample code or bird band number given to bird sampled.</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>na</edomv>
            <edomvd>not available</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <udom>numeric bird bands (USGS bird band lab); or alpha numeric code given to individual birds at the time of field sampling</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>clus1</attrlabl>
        <attrdef>proportion of genetic assignment to cluster 1</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.00001</rdommin>
            <rdommax>0.99998</rdommax>
            <attrunit>none</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>clus2</attrlabl>
        <attrdef>proportion of genetic assignment to cluster 2</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.00001</rdommin>
            <rdommax>0.99998</rdommax>
            <attrunit>none</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>clus3</attrlabl>
        <attrdef>proportion of genetic assignment to cluster 2</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.00001</rdommin>
            <rdommax>0.968875</rdommax>
            <attrunit>none</attrunit>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>assignment</attrlabl>
        <attrdef>cluster assignment</attrdef>
        <attrdefs>Producer Defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>pop2</edomv>
            <edomvd>majority of assignment to pop2</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>pop1</edomv>
            <edomvd>majority of assignment to pop 1</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>pop3</edomv>
            <edomvd>majority of assignment to pop 3</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
    </detailed>
    <detailed>
      <enttyp>
        <enttypl>CACW_populations.snps.vcf</enttypl>
        <enttypd>This is a tab delimited text file in the variant call format (VCF). The first 15 rows are headers and contain descriptional metadata only. Subsequent rows contain the genetic data - each row is a unique single nucleotide polymorphism. These data can be used to answer evolutionary questions about the species.</enttypd>
        <enttypds>Producer defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>CHROM</attrlabl>
        <attrdef>An unique identifier from the reference genome</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>10</rdommin>
            <rdommax>441387</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>POS</attrlabl>
        <attrdef>Position of the SNP in the reference sequence CHROM. Postions are sorted numerically.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>6</rdommin>
            <rdommax>151</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ID</attrlabl>
        <attrdef>Semicolon-separated list of unique identifiers within the RAD locus</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>10:14 to 441387:114</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>REF</attrlabl>
        <attrdef>Values represent nucleobases that compose the units of DNA. The value in this column is the reference single nucleotide polymorphism.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>G</edomv>
            <edomvd>guanine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>C</edomv>
            <edomvd>cytosine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>T</edomv>
            <edomvd>thymine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>A</edomv>
            <edomvd>adenine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ALT</attrlabl>
        <attrdef>Values represent nucleobases that compose the units of DNA. The value is this column is the alternative non-reference single nucleotide polymorphism.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>T</edomv>
            <edomvd>thymine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>A</edomv>
            <edomvd>adenine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>C</edomv>
            <edomvd>cytosine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>G</edomv>
            <edomvd>guanine</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>QUAL</attrlabl>
        <attrdef>Value of '.' denotes that locus passed internal quality filters. All value are the same (i.e., all values passed quality control).</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>.</edomv>
            <edomvd>passed internal quality filters</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>FILTER</attrlabl>
        <attrdef>Value is the verbal determination of the quality filtering (i.e., PASS means the locus passed quality filtering).</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>PASS</edomv>
            <edomvd>passed quality filtering</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>INFO</attrlabl>
        <attrdef>NS equals the number of individuals for which data is reported; AF equals the allele frequency of the alternative non-reference allele (see ALT)</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>NS equals 148; AF equals 0.051</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>FORMAT</attrlabl>
        <attrdef>This value denotes the format of the data in the following columns, specifically Genotype:Total Read Depth:Depth of each allele:Genotype Quality:Genotype Likelihood for the biallelic data [AA,AB,BB]</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>GT:DP:AD:GQ:GL</edomv>
            <edomvd>Genotype:Total Read Depth:Depth of each allele:Genotype Quality:Genotype Likelihood</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ABS08_r to WEL04</attrlabl>
        <attrdef>These are the actual data, in the format specified in the previous column (i.e., GT:DP:AD:GQ:GL). Values correspond to the individual names in row 15.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>An example of the encoding is:
0/1:51:15,36:40:-120.01,-0.00,-41.81. This individual has 2 alleles (0 and 1) representing REF and ALT, a total read depth of 51 with 15 reads of the REF allele and 36 reads of the ALT allele, a phred score of 40 and likelihoods of -120.01, 0, and -41.81 for the different possible genotypes (00, 01, 11).</udom>
        </attrdomv>
      </attr>
    </detailed>
  </eainfo>
  <distinfo>
    <distrib>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey - ScienceBase</cntorg>
        </cntorgp>
        <cntaddr>
          <addrtype>mailing address</addrtype>
          <address>Denver Federal Center, Building 810, Mail Stop 302</address>
          <city>Denver</city>
          <state>CO</state>
          <postal>80225</postal>
          <country>United States</country>
        </cntaddr>
        <cntvoice>1-888-275-8747</cntvoice>
        <cntemail>sciencebase@usgs.gov</cntemail>
      </cntinfo>
    </distrib>
    <distliab>Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although these data and associated metadata have been reviewed for accuracy and completeness and approved for release by the U.S. Geological Survey (USGS), no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty.</distliab>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>Digital Data</formname>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://doi.org/10.5066/P18WGRYU</networkr>
              </networka>
            </computer>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None</fees>
    </stdorder>
  </distinfo>
  <metainfo>
    <metd>20250623</metd>
    <metc>
      <cntinfo>
        <cntperp>
          <cntper>Amy Vandergast</cntper>
          <cntorg>U.S. Geological Survey, Western Ecological Research Center</cntorg>
        </cntperp>
        <cntpos>Research Geneticist</cntpos>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>4165 Spruance Road, Suite 200</address>
          <city>San Diego</city>
          <state>CA</state>
          <postal>92101</postal>
          <country>USA</country>
        </cntaddr>
        <cntvoice>619-225-6445</cntvoice>
        <cntemail>avandergast@usgs.gov</cntemail>
      </cntinfo>
    </metc>
    <metstdn>FGDC Biological Data Profile of the Content Standard for Digital Geospatial Metadata</metstdn>
    <metstdv>FGDC-STD-001.1-1999</metstdv>
  </metainfo>
</metadata>
