Data Package Metadata   View Summary

LAGOS-US RESERVOIR: Data module classifying conterminous U.S. lakes 4 hectares and larger as natural lakes or reservoirs

General Information
Data Package:
Local Identifier:edi.804.2
Title:LAGOS-US RESERVOIR: Data module classifying conterminous U.S. lakes 4 hectares and larger as natural lakes or reservoirs
Alternate Identifier:DOI PLACE HOLDER
Abstract:

The LAGOS-US RESERVOIR data module (hereafter, RESERVOIR) classifies all 137,465 lakes > 4 hectares in the conterminous U.S. into one of the following three categories using a machine-learning predictive model based on visual interpretation of lake outlines and a classification rule based on lake shape. Natural Lakes (NLs) are defined as lakes that are likely to be entirely or mostly naturally-formed and that do not have large, flow-altering structures on or near them; Reservoir Class A’s (RSVR_A) are defined as lakes that are likely to be either human-made or highly human-altered by the presence of a relatively large water control structure that appears to significantly change the flow of water; and Reservoir Class B’s (RSVR_Bs) are lakes that are likely to be entirely human-made based on isolation from rivers and a highly angular shape that is rarely, if ever, seen in natural lakes also often. We trained the machine learning models on 12,162 manually-classified lakes to assign probabilities of a lake being in 1 of 2 of the categories (NL or RSVR), then we further classified the RSVR classification into either A or B based on NHD Fcodes, isolation, and angularity. The data module includes a detailed User Guide, metadata tables, and a data table that includes information such as location, lake geometry, surface water connectivity class, and official name. Using our definition, our classification indicates that over 46 % of lakes > 4 ha in the conterminous U.S. are reservoir lakes. These data can be combined with other LAGOS-US data modules and U.S. national databases using unique lake identifiers to study both reservoir lakes and natural lakes at broad scales.

Publication Date:2022-11-23
For more information:
Visit: DOI PLACE HOLDER

Time Period
Begin:
2018
End:
2020

People and Organizations
Contact:Hanly, Patrick J (Michigan State University, Academic Specialist) [  email ]
Creator:Polus, Sam M (Michigan State University)
Creator:Hanly, Patrick J (Michigan State University)
Creator:Rodriguez, Lauren K (Michigan State University)
Creator:Wang, Qi (Michigan State University)
Creator:Díaz Vázquez, Jessica (Michigan State University)
Creator:Webster, Katherine E (Michigan State University)
Creator:Tan, Pang-Ning (Michigan State University)
Creator:Zhou, Jiayu (Michigan State University)
Creator:Danila, Laura (Michigan State University)
Creator:Soranno, Patricia A (Michigan State University)
Creator:Cheruvelil, Kendra Spence (Michigan State University)
Associate:Infante, Dana (Michigan State University, Advisory)
Associate:Cooper, Arthur (Michigan State University, Advisory)
Associate:Boudreau, Claire (Michigan State University, REU summer 2018)
Associate:Shoffner, Allie (Michigan State University, Research Administration)
Associate:Namovich, Jake R (Michigan State University, Hourly assistant for manual classification)
Associate:Hawkins, Arika (Michigan State University, Hourly assistant for manual classification)

Data Entities
Data Table Name:
lake_reservoir
Description:
Contains the observations of the variables for LAGOS-US RESERVOIR. This module contains a single data table that can be linked with other LAGOS modules through the common LAGOS-US LOCUS identifier lagoslakeid.
Other Name:
data_dictionary_rsvr
Description:
Provides a definition for each variable name or ‘column’ of every table in the module, and includes other useful information such as units.
Other Name:
source_table_rsvr
Description:
Includes a detailed description of the data sources used to create the information and metrics in this module.
Other Name:
LAGOS-US RESERVOIR Guide
Description:
User Guide for the LAGOS-US RESERVOIR data module containing information on methods, variable descriptions, QA/QC, and validation.
Detailed Metadata

Data Entities


Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/804/2/dac3a0d7e34070639f4894ccc316cbd1
Name:lake_reservoir
Description:Contains the observations of the variables for LAGOS-US RESERVOIR. This module contains a single data table that can be linked with other LAGOS modules through the common LAGOS-US LOCUS identifier lagoslakeid.
Number of Records:137465
Number of Columns:23

Table Structure
Object Name:lake_reservoir.csv
Size:21908983 byte
Authentication:f8aa57ced634b404ad83d3c63a6f8a84 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 lagoslakeidlake_rsvr_nididlake_nhdidneon_zoneidlake_rsvr_model_classlake_lat_decdeglake_lon_decdeglake_connectivity_classlake_rsvr_probnllake_rsvr_probrsvrlake_rsvr_probdifflake_rsvr_modellake_rsvr_nlneardam_flaglake_rsvr_rsvrisolated_flaglake_rsvr_classmethodlake_centroidstatelake_namelagoslake_shorelinedevfactorlake_rsvr_nidlat_decdeglake_rsvr_nidlon_decdeglake_shapeFTypelake_rsvr_class
Column Name:lagoslakeid  
lake_rsvr_nidid  
lake_nhdid  
neon_zoneid  
lake_rsvr_model_class  
lake_lat_decdeg  
lake_lon_decdeg  
lake_connectivity_class  
lake_rsvr_probnl  
lake_rsvr_probrsvr  
lake_rsvr_probdiff  
lake_rsvr_model  
lake_rsvr_nlneardam_flag  
lake_rsvr_rsvrisolated_flag  
lake_rsvr_classmethod  
lake_centroidstate  
lake_namelagos  
lake_shorelinedevfactor  
lake_rsvr_nidlat_decdeg  
lake_rsvr_nidlon_decdeg  
lake_shape  
FType  
lake_rsvr_class  
Definition:unique lake identifier developed by LAGOS-USthe NID identification code for the dam located 50 meters or less from the RSVRthe unique 'Permanent_identifier' from the NHD for each LAGOS lake; from LAGOSUS v1the unique identifier assigned by LAGOS-US for zones in the spatial division National Ecological Observation Network; from LAGOSUS v1the classification of a lake into either RSVR or NL from the machine learning procedurelatitude of centroid of the NHD lake polygon in decimal degrees; NAD83 projection; from LAGOSUS v1longitude of centroid of the NHD lake polygon in decimal degrees; NAD83 projection; from LAGOSUS v1maximum hydrologic connectivity class of the focal lake determined from the NHD network considering both permanent and intermittent-ephemeral flow; from LAGOSUS v1the model-generated probability that the correct classification of a waterbody is a natural lakethe model-generated probability that the correct classification of a waterbody is as a reservoirthe difference between the model assigned probabilities; calculated as lake_rsvr_probnl minus lake_rsvr_probrsvrdenotes the model used in lake prediction; values are either NE or US; value is NULL for manually classified lakesflag indicating that the lake classified as NL is located less than 50 meters (in 3 dimensional space) from a dam in the NIDflag indicating that the lake classified as RSVR has a connectivity classification of isolateddenotes whether a lake_rsvr_class prediction was assigned manually or via a model predictiontwo-letter postal abbreviation of the state containing the lake centroid; from LAGOSUS v1lake name from a combination of data sources; examples are GNIS, WQP, etc.; from LAGOSUS v1shoreline development factor; calculated as lake_perimeter_m divided by the product of 2 times the square root of pi times lake_waterarea_ha; from LAGOSUS v1latitude of the dam from the National Inventory of Dams (NID)longitude of the dam from the NIDflag indicating lake polygon shape is excessively angular (e.g., triangle, rectangle) or elongate (very thin relative to length); may indicate the lake is not natural in origin (angular) or is more riverine (elongate)NHD FTode classification of the water bodythe overall classification of water bodies into natural lakes (NL), reservoir type A (RSVR-A), and reservoir type B (RSVR-B)
Storage Type:float  
string  
string  
string  
string  
float  
float  
string  
float  
float  
float  
string  
string  
string  
string  
string  
string  
float  
float  
float  
string  
string  
string  
Measurement Type:rationominalnominalnominalnominalratiorationominalratioratiorationominalnominalnominalnominalnominalnominalratioratiorationominalnominalnominal
Measurement Values Domain:
Unitno units
Typeinteger
Definitionthe NID identification code for the dam located 50 meters or less from the RSVR
Definitionthe unique 'Permanent_identifier' from the NHD for each LAGOS lake; from LAGOSUS v1
Definitionthe unique identifier assigned by LAGOS-US for zones in the spatial division National Ecological Observation Network; from LAGOSUS v1
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeNL
Definitionnatural lake
Source
Code Definition
CodeRSVR
Definitionreservoir
Source
Unitdecimal degrees
Typereal
Unitdecimal degrees
Typereal
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codenan
Definitionno value
Source
Code Definition
CodeDrainage
Definitionall lakes that do not meet one of the prior three criteria; traces either in both directions or only the upstream trace has network connectivity
Source
Code Definition
CodeDrainageLk
Definitiontraces either in both directions or only upstream have/has network connectivity, and the upstream trace contains the identifier of one or more lakes greater than 10 hectares (as defined by the NHDNetwork class)
Source
Code Definition
CodeHeadwater
Definitiononly the downstream trace contains network connectivity
Source
Code Definition
CodeIsolated
Definitiontraces in both directions were empty (no network connectivity)
Source
Code Definition
CodeTerminal
Definitiononly the upstream trace contains network connectivity
Source
Code Definition
CodeTerminalLk
Definitiononly the upstream trace contains network connectivity, and the upstream trace contains the identifier of one or more lakes greater than 10 hectares (as defined by the NHDNetwork class)
Source
Unitno units
Typereal
Unitno units
Typereal
Unitno units
Typereal
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codenan
Definitionmanually classified lakes
Source
Code Definition
CodeNE
DefinitionNortheast prediction model was used
Source
Code Definition
CodeUS
DefinitionU.S. prediction model was used
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeN
DefinitionNo flag
Source
Code Definition
CodeY
DefinitionYes flag
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeN
DefinitionNo flag
Source
Code Definition
CodeY
DefinitionYes flag
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codemanual
DefinitionLake was manually classified
Source
Code Definition
Codepredicted
DefinitionLake was classified using the prediction model
Source
Definitiontwo-letter postal abbreviation of the state containing the lake centroid; from LAGOSUS v1
Definitionlake name from a combination of data sources; examples are GNIS, WQP, etc.; from LAGOSUS v1
Unitno units
Typereal
Unitdecimal degrees
Typereal
Unitdecimal degrees
Typereal
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeangular
Definitionangular shape
Source
Code Definition
Codeelongate
Definitionelongate shape
Source
Code Definition
Codenoflag
Definitionno shape flag
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code390
Definitionlake/pond NHD code
Source
Code Definition
Code436
Definitionreservoir NHD code
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeNL
DefinitionA lake that is likely to be entirely or mostly naturally-formed and that does not have a relatively large, flow-altering structure on it or near it based on visual interpretation of imagery. Such lakes may have a small human-made water-control structures on it that appear to be physically small relative to the size of the lake shoreline, or that are downstream of the lake and so are assumed to have minimal impacts on the lake, such as those that can influence water levels only.
Source
Code Definition
CodeRSVR_A
DefinitionReservoir Type A: A lake that is likely to be either human-made or highly human-altered by the presence of a relatively large water control structure that appears to significantly change the flow of water based on a machine-learning model prediction with lake outlines as model input.
Source
Code Definition
CodeRSVR_B
Definitionreservoir type B: A lake that is likely to be entirely human-made based on a highly angular shape that is rarely, if ever, seen in natural lakes.
Source
Missing Value Code:  
CodeNULL
Explno NID code
         
CodeNULL
Explno assigned connectivity class
     
CodeNULL
Explmanually classified lakes
       
CodeNULL
ExplNo lake name available
 
CodeNULL
ExplNo dam
CodeNULL
Explno dam
     
Accuracy Report:                                              
Accuracy Assessment:                                              
Coverage:                                              
Methods:                                              

Non-Categorized Data Resource

Name:data_dictionary_rsvr
Entity Type:csv
Description:Provides a definition for each variable name or ‘column’ of every table in the module, and includes other useful information such as units.
Physical Structure Description:
Object Name:data_dictionary_rsvr.csv
Size:6157 byte
Authentication:2d044fbcfc13b3c17e6a3ac0f2bb74c9 Calculated By MD5
Externally Defined Format:
Format Name:csv
Data:https://pasta-s.lternet.edu/package/data/eml/edi/804/2/ee7a7c1d8abeaf3b1f80ea5be53f5f55

Non-Categorized Data Resource

Name:source_table_rsvr
Entity Type:csv
Description:Includes a detailed description of the data sources used to create the information and metrics in this module.
Physical Structure Description:
Object Name:source_table_rsvr.csv
Size:4009 byte
Authentication:03651e588841eafec4e24af77c3efaa5 Calculated By MD5
Externally Defined Format:
Format Name:csv
Data:https://pasta-s.lternet.edu/package/data/eml/edi/804/2/96ffa5319174a8c6da21b8bfd8422970

Non-Categorized Data Resource

Name:LAGOS-US RESERVOIR Guide
Entity Type:docx
Description:User Guide for the LAGOS-US RESERVOIR data module containing information on methods, variable descriptions, QA/QC, and validation.
Physical Structure Description:
Object Name:LAGOS-US RESERVOIR Guide.docx
Size:1265086 byte
Authentication:3d26a24e2f588a879f77de99c31c4d37 Calculated By MD5
Externally Defined Format:
Format Name:docx
Data:https://pasta-s.lternet.edu/package/data/eml/edi/804/2/9f3c02fa6ba6b22dcebde5894718a39a

Data Package Usage Rights

This information is released under the Creative Commons license - Attribution - CC BY (https://creativecommons.org/licenses/by/4.0/). The consumer of these data ("Data User" herein) is required to cite it appropriately in any publication that results from its use. The Data User should realize that these data may be actively used by others for ongoing research and that coordination may be necessary to prevent duplicate publication. The Data User is urged to contact the authors of these data if any questions about methodology or results occur. Where appropriate, the Data User is encouraged to consider collaboration or co-authorship with the authors. The Data User should realize that misinterpretation of data may occur if used out of context of the original study. While substantial efforts are made to ensure the accuracy of data and associated documentation, complete accuracy of data sets cannot be guaranteed. All data are made available "as is." The Data User should be aware, however, that data are updated periodically and it is the responsibility of the Data User to check for new versions of the data. The data authors and the repository where these data were obtained shall not be liable for damages resulting from any use or misinterpretation of the data. Thank you.

Keywords

By Thesaurus:
LTER Controlled Vocabularylakes, freshwater, limnology
(No thesaurus)reservoirs, dams, conterminous US, classification, ResNet18, LAGOS

Methods and Protocols

These methods, instrumentation and/or protocols apply to all data in this dataset:

Methods and protocols used in the collection of this data package
Description:

Detailed methods are found in the LAGOS-US RESERVOIR user guide:

User Guide to LAGOS-US RESERVOIR: Data module classifying conterminous U.S. lakes 4 hectares and larger as natural lakes or reservoirs. Environmental Data Initiative. https://doi.org/XXXXXX. Dataset accessed XX/XX/XXXX.

People and Organizations

Publishers:
Organization:Environmental Data Initiative
Email Address:
info@edirepository.org
Web Address:
https://edirepository.org
Id:https://ror.org/0330j0z60
Creators:
Individual: Sam M Polus
Organization:Michigan State University
Email Address:
polussam@msu.edu
Id:https://orcid.org/0000-0002-2742-1775
Individual: Patrick J Hanly
Organization:Michigan State University
Email Address:
hanlypat@msu.edu
Id:https://orcid.org/0000-0001-9435-9572
Individual: Lauren K Rodriguez
Organization:Michigan State University
Email Address:
rodri683@msu.edu
Id:https://orcid.org/0000-0002-9337-6087
Individual: Qi Wang
Organization:Michigan State University
Email Address:
wangqi19@msu.edu
Id:https://orcid.org/0000-0002-0713-2677
Individual: Jessica Díaz Vázquez
Organization:Michigan State University
Email Address:
jessica.dv405@gmail.com
Id:https://orcid.org/0000-0001-8493-4035
Individual: Katherine E Webster
Organization:Michigan State University
Email Address:
katherine.e.webster@gmail.com
Id:https://orcid.org/0000-0002-6009-0146
Individual: Pang-Ning Tan
Organization:Michigan State University
Email Address:
ptan@msu.edu
Id:https://orcid.org/0000-0003-3205-0339
Individual: Jiayu Zhou
Organization:Michigan State University
Email Address:
jiayuz@msu.edu
Id:https://orcid.org/0000-0003-4336-6777
Individual: Laura Danila
Organization:Michigan State University
Individual: Patricia A Soranno
Organization:Michigan State University
Email Address:
soranno@msu.edu
Id:https://orcid.org/0000-0003-1668-9271
Individual: Kendra Spence Cheruvelil
Organization:Michigan State University
Email Address:
ksc@msu.edu
Id:https://orcid.org/0000-0003-1880-2880
Contacts:
Individual: Patrick J Hanly
Organization:Michigan State University
Position:Academic Specialist
Email Address:
hanlypat@msu.edu
Id:https://orcid.org/0000-0001-9435-9572
Associated Parties:
Individual: Dana Infante
Organization:Michigan State University
Email Address:
infanted@msu.edu
Role:Advisory
Individual: Arthur Cooper
Organization:Michigan State University
Email Address:
coopera6@msu.edu
Role:Advisory
Individual: Claire Boudreau
Organization:Michigan State University
Email Address:
boudre32@msu.edu
Role:REU summer 2018
Individual: Allie Shoffner
Organization:Michigan State University
Email Address:
shoffne1@msu.edu
Role:Research Administration
Individual: Jake R Namovich
Organization:Michigan State University
Email Address:
namovich@msu.edu
Role:Hourly assistant for manual classification
Individual: Arika Hawkins
Organization:Michigan State University
Email Address:
hawki267@msu.edu
Role:Hourly assistant for manual classification

Temporal, Geographic and Taxonomic Coverage

Temporal, Geographic and/or Taxonomic information that applies to all data in this dataset:

Time Period
Begin:
2018
End:
2020
Geographic Region:
Description:conterminous U.S. (lower 48 states and the District of Columbia)
Bounding Coordinates:
Northern:  49.0Southern:  25.0
Western:  -125.0Eastern:  -67.0

Project

Parent Project Information:

Title:LAGOS-US RESERVOIR
Personnel:
Individual: Kendra Spence Cheruvelil
Organization:Michigan State University
Email Address:
ksc@msu.edu
Id:https://orcid.org/0000-0003-1880-2880
Role:PI
Additional Award Information:
Funder:US National Science Foundation
Number:US NSF Macrosystems Biology Program grants: DEB‐1638679, DEB‐1638550, DEB‐1638539, DEB‐1638554
Title:A macrosystems ecology framework for continental-scale prediction and understanding of lakes

Maintenance

Maintenance:
Description:

Data collection completed and no maintenance is planned.

Frequency:
Other Metadata

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'unitList'
        |     |     |___text '\n        '
        |     |     |___element 'unit'
        |     |     |     |  \___attribute 'id' = 'no units'
        |     |     |     |  \___attribute 'name' = 'no units'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'description'
        |     |     |     |___text '\n        '
        |     |     |___text '\n        '
        |     |     |___element 'unit'
        |     |     |     |  \___attribute 'id' = 'decimal degrees'
        |     |     |     |  \___attribute 'name' = 'decimal degrees'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'description'
        |     |     |     |___text '\n        '
        |     |     |___text '\n      '
        |     |___text '\n    '
        |___text '\n  '

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'emlEditor'
        |     |        \___attribute 'app' = 'ezEML'
        |     |        \___attribute 'release' = '2022.11.16'
        |     |___text '\n    '
        |___text '\n  '

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo