Data Package Metadata View Summary

Lake chloride concentrations and model predictions for 49,432 lakes in the Midwest and Northeast United States.

General Information

Data Package:
Local Identifier:	edi.452.2
Title:	Lake chloride concentrations and model predictions for 49,432 lakes in the Midwest and Northeast United States.
Alternate Identifier:	DOI PLACE HOLDER
Abstract:	Lakes in the Midwest and Northeast United States are at risk of anthropogenic chloride contamination, but we have little knowledge of the prevalence and spatial distribution of the problem. The majority of salt pollution in north temperate regions stems from road salt application but other chloride sources include water softeners, synthetic fertilizers, and livestock excretion. Although chloride contamination of lakes is well documented, it is unknown how many lakes are at risk of long-term salinization. We used a quantile regression forest to leverage information from 2,773 lakes to predict the chloride concentration of all 49,432 lakes greater than 4 ha in a 17-state area. The QRF used 22 predictor variables, which included lake morphometry characteristics, watershed land use, and distance to the nearest interstate and road. Model predictions had an r2 of 0.94 for all chloride observations, and 0.87 for predictions of the mean chloride concentration observed at each lake.
Publication Date:	2020-04-24

Time Period

Begin:

1990-01-01

End:

2018-12-13

People and Organizations
Contact:	Dugan, Hilary A (University of Wisconsin-Madison) [ email ]
Creator:	Dugan, Hilary A (University of Wisconsin-Madison)
Creator:	Skaff, Nicholas K (University of California, Berkeley University)
Creator:	Doubek, Jonathan P (Lake Superior State University)
Creator:	Burke, Samantha M (University of Guelph)
Creator:	Krivak-Tetley, Flora E (Dartmouth College)
Creator:	Summers, Jamie C

Data Entities
Data Table Name:	lakeCL_predictions.csv
Description:	chloride prediction model output
Data Table Name:	lakeCL_trainingData.csv
Description:	chloride prediction model training data
Data Table Name:	WisconsinLakes_Chloride.csv
Description:	Chloride concentrations from a suite of Wisconsin Lakes in summer 2018
Other Name:	QRF_script.R
Description:	R code which builds a quantile regression forest model using observational chloride data and predictor variables found in lakeCL_trainingData.csv

Detailed Metadata

Data Entities

Data Table


Data:	https://pasta-s.lternet.edu/package/data/eml/edi/452/2/282e3d41aef63c11c386cf65eed1b26d
Name:	lakeCL_predictions.csv
Description:	chloride prediction model output
Number of Records:	49432
Number of Columns:	32

Table Structure

Object Name:

lakeCL_predictions.csv

Size:

10153548 bytes

Authentication:

9b23d7e8d75efbdc4bd28007b06e445d Calculated By MD5

Text Format:

Number of Header Lines:

Record Delimiter:

Orientation:

column

Simple Delimited:

Field Delimiter:	,
Quote Character:	"

Table Column Descriptions

Column Name:

lagoslakeid

nhdid

gnis_name

nhd_lat

nhd_long

LakeArea

WS_Area

MaxDepth

lakeconnection

WS_OpenWater

WS_Dev_Open

WS_Dev_Low

WS_Dev_Med

WS_Dev_High

WS_Barren

WS_DeciduousForest

WS_EvergreenForest

WS_MixedForest

WS_Schrub

WS_Grassland

WS_PastureHay

WS_Crops

WS_WoodyWetlands

WS_EmergentWetlands

WS_RoadDensity

InterstateDistance

RoadDistance

WinterSeverity

state_name

pred_05

pred_50

pred_95

Definition:

Unique lake identifier developed for LAGOS-NE

Unique lake identifier from National Hydrography dataset

Lake Name

Latitude

Longitude

Surface area of the lake

Surface area of the watershed

Maximum depth of lake

Connectivity of focal lake to upstream features (DR_LakeStream = drainage lake with an upstream lake, DR_Stream = drainage lake with upstream stream, Headwater = lake with outlet but no inlet, Isolated = lake with no inlets or outlets)

% landuse classified as open water in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as open space, developed in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as developed, low intensity in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as developed, medium intensity in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as developed, high intensity in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as barren/transitional in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as deciduous forest in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as evergreen forest in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as mixed forest in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as schrubland in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as grassland in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as pasture/hay in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as row crops in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as woody wetlands in the watershed. Derived from the National Land Cover Dataset (NLCD).

% landuse classified as herbaceous wetlands in the watershed. Derived from the National Land Cover Dataset (NLCD).

Road density in the watershed. Derived from the National Land Cover Dataset (NLCD).

Distance to the nearest interstate

Distance to the nearest road

Winter severity index obtained from ClearRoads (national research consortium, clearroads.org). Calculated from 2000 to 2010 as 0.50 × (average annual snowfall in inches) + 0.05 × (annual duration of snowfall in hours) + 0.05 × (annual duration of blowing snow in hours) + 0.10 × (annual duration of freezing rain in hours).

Name of US state that lake is located in (or partially in)

Prediction interval: 0.05 quantile

Median prediction

Prediction interval: 0.95 quantile

Storage Type:

string

float

string

float

string

float

Measurement Type:

nominal

ratio

nominal

ratio

nominal

ratio

Measurement Values Domain:

Definition

Unique lake identifier developed for LAGOS-NE

Definition

Unique lake identifier from National Hydrography dataset

Definition

Lake Name

Unit	degree
Type	real
Min	35.998945
Max	48.98999

Unit	degree
Type	real
Min	-97.216823
Max	-67.091051

Unit	hectare
Type	real
Min	4
Max	66650.33

Unit	hectare
Type	real
Min	0.1
Max	3204167.25

Unit	meter
Type	real
Min	0.1
Max	198.4

Allowed Values and Definitions

Enumerated Domain

Code Definition

Code	DR_LakeStream
Definition	drainage lake with an upstream lake
Source

Code Definition

Code	Headwater
Definition	lake with outlet but no inlet
Source

Code Definition

Code	DR_Stream
Definition	drainage lake with upstream stream
Source

Code Definition

Code	Isolated
Definition	lake with no inlets or outlets
Source

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	83.51

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	98.84

Unit	dimensionless
Type	real
Min	0
Max	86.15

Unit	dimensionless
Type	real
Min	0
Max	97.63

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	dimensionless
Type	real
Min	0
Max	100

Unit	metersPerHectare
Type	real
Min	0
Max	5631.9

Unit	meter
Type	real
Min	0.01
Max	311.21

Unit	meter
Type	real
Min	0
Max	114.41

Unit	dimensionless
Type	real
Min	4.97
Max	185.17

Definition

Name of US state that lake is located in (or partially in)

Unit	milligramsPerLiter
Type	real
Min	0.071
Max	955.001

Unit	milligramsPerLiter
Type	real
Min	0.081
Max	2778.001

Unit	milligramsPerLiter
Type	real
Min	0.262321230738575
Max	2979.001

Missing Value Code:

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Code	NA
Expl	not available

Accuracy Report:

Accuracy Assessment:

Coverage:

Methods:

Data Table


Data:	https://pasta-s.lternet.edu/package/data/eml/edi/452/2/c485df5d24d63fe422c2f1949872ff51
Name:	lakeCL_trainingData.csv
Description:	chloride prediction model training data
Number of Records:	29010
Number of Columns:	31

Table Structure

Object Name:

lakeCL_trainingData.csv

Size:

6477664 bytes

Authentication:

d9bc49d4207530cfbbcbc26115e3b509 Calculated By MD5

Text Format:

Number of Header Lines:

Record Delimiter:

Orientation:

column

Simple Delimited:

Field Delimiter:	,
Quote Character:	"

Table Column Descriptions

Column Name:

lagoslakeid

nhdid

gnis_name

ActivityStartDate

Chloride

nhd_lat

nhd_long

MaxDepth

state_name

Month

LakeArea

WS_Area

WinterSeverity

WS_OpenWater

WS_Dev_Open

WS_Dev_Low

WS_Dev_Med

WS_Dev_High

WS_Barren

WS_DeciduousForest

WS_EvergreenForest

WS_MixedForest

WS_Schrub

WS_Grassland

WS_PastureHay

WS_Crops

WS_WoodyWetlands

WS_EmergentWetlands

WS_RoadDensity

InterstateDistance

RoadDistance

Definition:

Unique lake identifier developed for LAGOS-NE

Unique lake identifier from National Hydrography dataset

Lake Name

Date of sampling

Chloride concentration

Latitude

Longitude

Maximum depth of lake

Name of US state that lake is located in (or partially in)

Month of sampling