U.S. flag

An official website of the United States government

icon-dot-gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

icon-https

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Spatial Extent of Data

ISO 19115 Topic Category

Place Keywords

Coast Train--Labeled imagery for training and evaluation of data-driven models for image segmentation

Coast Train is a library of images of coastal environments, annotations, and corresponding thematic label masks (or ‘label images’) collated for the purposes of training and evaluating machine learning (ML), deep learning, and other models for image segmentation. It includes image sets from both geospatial satellite, aerial, and UAV imagery and orthomosaics, as well as non-geospatial oblique and nadir imagery. Images include a diverse range of coastal environments from the U.S. Pacific, Gulf of Mexico, Atlantic, and Great Lakes coastlines, consisting of time-series of high-resolution (≤1m) orthomosaics and satellite image tiles (10–30m). Each image, image annotation, and labelled image is available as a single NPZ zipped file. NPZ files follow the following naming convention: {datasource}_{numberofclasses}_{threedigitdatasetversion}.zip, where {datasource} is the source of the original images (for example, NAIP, Landsat 8, Sentinel 2), {numberofclasses} is the number of classes used to annotate the images, and {threedigitdatasetversion} is the three-digit code corresponding to the dataset version (in other words, 001 is version 1). Each zipped folder contains a collection of NPZ format files, each of which corresponds to an individual image. An individual NPZ file is named after the image that it represents and contains (1) a CSV file with detail information for every image in the zip folder and (2) a collection of the following NPY files: orig_image.npy (original input image unedited), image.npy (original input image after color balancing and normalization), classes.npy (list of classes annotated and present in the labelled image), doodles.npy (integer image of all image annotations), color_doodles.npy (color image of doodles.npy), label.npy (labelled image created from the classes present in the annotations), and settings.npy (annotation and machine learning settings used to generate the labelled image from annotations). All NPZ files can be extracted using the utilities available in Doodler (Buscombe, 2022). A merged CSV file containing detail information on the complete imagery collection is available at the top level of this data release, details of which are available in the Entity and Attribute section of this metadata file.

Get Data and Metadata
Author(s) Phillipe A Wernette orcid, Daniel D Buscombe orcid, Jaycee Favela orcid, Sharon N Fitzpatrick orcid, Evan Goldstein orcid, Nicholas M Enwright orcid, Erin Dunand
Publication Date 2022-03-19
Beginning Date of Data 2008-01-01
Ending Date of Data 2020-12-31
Data Contact
DOI https://doi.org/10.5066/P91NP87I
Citation Wernette, P.A., Buscombe, D.D., Favela, J., Fitzpatrick, S.N., Goldstein, E., Enwright, N.M., and Dunand, E., 2022, Coast Train--Labeled imagery for training and evaluation of data-driven models for image segmentation: U.S. Geological Survey data release, https://doi.org/10.5066/P91NP87I.
Metadata Contact
Metadata Date 2022-05-04
Related Publication
Citations of these data No citations of these data are known at this time.
Access public
License http://www.usa.gov/publicdomain/label/1.0/
Loading...
Harvest Source: Coastal and Marine Geoscience Data System
Harvest Date: 2025-01-22T04:21:20.072Z