Data Package Metadata   View Summary

Satellite derived chlorophyll-a of the Ohio and Illinois Rivers (CHOIR)

General Information
Data Package:
Local Identifier:edi.1815.1
Title:Satellite derived chlorophyll-a of the Ohio and Illinois Rivers (CHOIR)
Alternate Identifier:DOI PLACE HOLDER
Abstract:

Chlorophyll-a is a vital water quality parameter used to quantify concentrations of algal biomass in freshwater systems. However, insufficient field data in the Ohio River Basin has resulted in limited understanding of the development of algal blooms. We built a 38-year (1984 – 2022) dataset of satellite derived chlorophyll-a predictions to support research efforts which aim to quantify the frequency and intensity of river algal blooms. We developed our model by leveraging coinciding in situ chlorophyll-a data and surface reflectance extracted from Landsat Collection 2 Tier 1, referred to as matchups. Matchups were used to train and test our machine learning model. We also extracted Landsat surface reflectance over 6,116 NHD river reaches using similar methods. We then applied our model to this reach-level data to create a comprehensive dataset of chlorophyll-a predictions. This dataset includes the following files: 1) data used to train and test the model (matchups), 2) the model infrastructure, 3) satellite derived chlorophyll-a predictions aggregated over NHD river reaches, and 4) a shapefile of NHD river reaches.

Publication Date:2024-12-03
For more information:
Visit: DOI PLACE HOLDER

Time Period
Begin:
1984-03-16
End:
2022-11-01

People and Organizations
Contact:Sillen, Samuel J (University of Pittsburgh) [  email ]
Contact:Gardner, John R  [  email ]
Creator:Sillen, Samuel James (University of Pittsburgh)
Creator:Zuccolotto, Gabriella (University of Pittsburgh)
Creator:Steele, Bethel (Colorado State University)
Creator:Ross, Matthew R.V. (Colorado State University)
Creator:Elliott, Emily Maureen (University of Pittsburgh)
Creator:Gardner, John R (University of Pittsburgh)

Data Entities
Data Table Name:
CHOIR_LC02_pred
Description:
Satellite derived chlorophyll-a prediction. Included is the data used to apply our chlorophyll-a model over NHD reaches.
Data Table Name:
CHOIR_LC02_matchups
Description:
Matchups used for model train and test.
Other Name:
CHOIR_Model
Description:
XGBoost model environment used to estimate chlorophyll-a.
Other Name:
NHD_Centerlines
Description:
Shapefile including nhd reach geometry and common identifiers (COMID).
Detailed Metadata

Data Entities


Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/1815/1/8540f76eb8be0203ddc0b08a8a9d4dd8
Name:CHOIR_LC02_pred
Description:Satellite derived chlorophyll-a prediction. Included is the data used to apply our chlorophyll-a model over NHD reaches.
Number of Records:3211825
Number of Columns:54

Table Structure
Object Name:CHOIR_RiverSR_pred.csv
Size:2144050466 byte
Authentication:c7c0c09c79d7f495fd9abc610e78efc4 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 Aerosolsd_AerosolpCount_negative_Aerosolsd_bluepCount_negative_Bluesd_GreenpCount_negative_Greensd_RedpCount_negative_Redsd_NirpCount_negative_Nirsd_Swir1pCount_negative_Swir1sd_Swir2pCount_negative_Swir2Surface_temp_kelvinpixel_qacloudsalgal_maskhillShadowpCount_algal_maskpCount_shadowlandsatIDIMAGE_QUALITYdateCLOUD_COVERSUN_ELEVATIONSUN_AZIMUTHCOMIDsatRed_rawGreen_rawBlue_rawNir_rawSwir1_rawSwir2_rawRedGreenBlueNirSwir1Swir2R.BSfaiGCIdwIRG: Red-Greenband ratio: Swir1/NirR.BNB.GSN.RSN.BS: Nir / (Blue + Swir1)G.BSpred
Column Name:Aerosol  
sd_Aerosol  
pCount_negative_Aerosol  
sd_Blue  
pCount_negative_Blue  
sd_Green  
pCount_negative_Green  
sd_Red  
pCount_negative_Red  
sd_Nir  
pCount_negative_Nir  
sd_Swir1  
pCount_negative_Swir1  
sd_Swir2  
pCount_negative_Swir2  
Surface_temp_kelvin  
pixel_qa  
clouds  
algal_mask  
hillShadow  
pCount_algal_mask  
pCount_shadow  
landsatID  
IMAGE_QUALITY  
date  
CLOUD_COVER  
SUN_ELEVATION  
SUN_AZIMUTH  
COMID  
sat  
Red_raw  
Green_raw  
Blue_raw  
Nir_raw  
Swir1_raw  
Swir2_raw  
Red  
Green  
Blue  
Nir  
Swir1  
Swir2  
R.BS  
fai  
GCI  
dw  
IRG  
SN  
R.BN  
B.GS  
N.RS  
N.BS  
G.BS  
pred  
Definition:Median of corrected surface reflectance from the aerosol band over WQP siteStandard deviation of corrected surface reflectance from the aerosol band over WQP siteNumber of pixels that contain negative values for the aerosol band over WQP siteStandard deviation of corrected surface reflectance from the blue band over WQP siteNumber of pixels that contain negative values for the blue band over WQP siteStandard deviation of corrected surface reflectance from the green band over WQP siteNumber of pixels that contain negative values for the green band over WQP siteStandard deviation of corrected surface reflectance from the red band over WQP siteNumber of pixels that contain negative values for the red band over WQP siteStandard deviation of corrected surface reflectance from the NIR band over WQP siteNumber of pixels that contain negative values for the NIR band over WQP siteStandard deviation of corrected surface reflectance from the swir1 band over WQP siteNumber of pixels that contain negative values for the swir1 band over WQP siteStandard deviation of corrected surface reflectance from the swir2 band over WQP siteNumber of pixels that contain negative values for the swir2 band over WQP siteSurface water temperature in Kelvin recorded from the Landsat thermal bandPixel qa of water maskCloud mask within WQP site bufferDynamic surface water extent mask with additional algal mask thresholds applied. Only 1 was collected.Median hillshadow of site buffer. Binary (0 = shadowed, 1 = not shadowed).Number of pixels in WQP site water maskNumber of shadowed pixels in WQP site water maskLandsat product identifier (collection 2)Landsat image qualityDate of satellite image% cloud cover of Landsat sceneSolar elevation from Landsat sceneSolar azimuth angle from Landsat sceneUnique identifier used to link to NHD geometrysatellite idMedian of raw surface reflectance from the red band over NHD reachMedian of corrected surface reflectance from the green band over NHD reachMedian of corrected surface reflectance from the blue band over NHD reachMedian of corrected surface reflectance from the nir band over NHD reachMedian of corrected surface reflectance from the swir1 band over NHD reachMedian of corrected surface reflectance from the swir2 band over NHD reachMedian of corrected surface reflectance from the red band over WQP siteMedian of corrected surface reflectance from the green band over WQP siteMedian of corrected surface reflectance from the blue band over WQP siteMedian of corrected surface reflectance from the NIR band over WQP siteMedian of corrected surface reflectance from the swir1 band over WQP siteMedian of corrected surface reflectance from the swir2 band over WQP siteband ratio (Red / Blue + Swir1)band ratio Nir - (Red + (Swir1-Red)*((830-660)/(1650-660)))band ratio Nir/(Green-1)dominant wavelengthband ratio:band ratioRed/ (Blue + Nir)band ratio: Blue / (Green + Swir1)band ratio: Nir / (Red + Swir1)band ratio:band ratio: Green / (Blue + Swir1)satellite predicted chlorophyll-a
Storage Type:float  
float  
integer  
float  
integer  
float  
integer  
float  
float  
float  
integer  
float  
integer  
float  
integer  
float  
integer  
float  
float  
float  
integer  
integer  
string  
float  
dateTime  
float  
float  
float  
integer  
string  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
integer  
float  
float  
float  
float  
float  
float  
float  
float  
Measurement Type:ratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratiorationominalratiodateTimeratioratioratiorationominalratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratio
Measurement Values Domain:
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitkelvin
Typereal
Unitinteger
Typereal
Unitinteger
Typereal
Unitinteger
Typereal
Unitinteger
Typereal
Unitinteger
Typeinteger
Unitinteger
Typeinteger
Definitiontext
Unitinteger
Typereal
FormatYYYY-MM-DD
Precision
Unitpercent
Typereal
Unitinteger
Typereal
Unitinteger
Typereal
Unitinteger
Typeinteger
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeLC08
Definitionlandsat 8
Source
Code Definition
CodeLC09
Definitionlandsat 9
Source
Code Definition
CodeLE07
Definitionlandsat 7
Source
Code Definition
CodeLT05
Definitionlandsat 5
Source
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
UnitmicrogramPerLiter
Typereal
Missing Value Code:                              
CodeNA
ExplNo temperature recorded
                                                         
CodeNA
Explcolor metric not available
               
Accuracy Report:                                                                                                            
Accuracy Assessment:                                                                                                            
Coverage:                                                                                                            
Methods:                                                                                                            

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/1815/1/8b226634e9dfaeebbd933ae5025d9780
Name:CHOIR_LC02_matchups
Description:Matchups used for model train and test.
Number of Records:78898
Number of Columns:57

Table Structure
Object Name:CHOIR_LC02_matchups_v2.csv
Size:55627297 byte
Authentication:e341aa287ca3339a26ad69ca3d4495c4 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 datecharacteristicNameSiteIDtimevalueanalytical_methodharmonized_parameterharmonized_depthdate_unitysourceAerosolsd_AerosolpCount_negative_AerosolBluesd_BluepCount_negative_BlueGreensd_GreenpCount_negative_GreenRedsd_RedpCount_negative_RedNirsd_NirpCount_negative_NirSwir1sd_Swir1pCount_negative_Swir1Swir2sd_Swir2pCount_negative_Swir2Surface_temp_kelvinpixel_qacloudsalgal_maskhillShadowpCount_algal_maskpCount_shadowlandsatIDIMAGE_QUALITYCLOUD_COVERSUN_ELEVATIONSUN_AZIMUTHlatlonguniqueIDR.BSfaiGCIdwIRG: Red-Greenband ratio: Swir1/NirR.BNB.GSN.RSN.BS: Nir / (Blue + Swir1)G.BS
Column Name:date  
characteristicName  
SiteID  
time  
value  
analytical_method  
harmonized_parameter  
harmonized_depth  
date_unity  
source  
Aerosol  
sd_Aerosol  
pCount_negative_Aerosol  
Blue  
sd_Blue  
pCount_negative_Blue  
Green  
sd_Green  
pCount_negative_Green  
Red  
sd_Red  
pCount_negative_Red  
Nir  
sd_Nir  
pCount_negative_Nir  
Swir1  
sd_Swir1  
pCount_negative_Swir1  
Swir2  
sd_Swir2  
pCount_negative_Swir2  
Surface_temp_kelvin  
pixel_qa  
clouds  
algal_mask  
hillShadow  
pCount_algal_mask  
pCount_shadow  
landsatID  
IMAGE_QUALITY  
CLOUD_COVER  
SUN_ELEVATION  
SUN_AZIMUTH  
lat  
long  
uniqueID  
R.BS  
fai  
GCI  
dw  
IRG  
SN  
R.BN  
B.GS  
N.RS  
N.BS  
G.BS  
Definition:Date of in-situ sample collectionMeasurement type (Chlorophyll-a, Chlorophyll-a corrected for pheophytin, etc.)Site identifier for WQP in situ sample locationTime of in-situ sample collectionRecorded value of in situ sampleMethod used for chlorophyll-a detectionHarmonized characteristic name (chl-a)Depth of in situ sampleJoined date of in situ and Landsat SR observatrionSource of in situ data (LAGOS, WQP)Median of corrected surface reflectance from the aerosol band over WQP siteStandard deviation of corrected surface reflectance from the aerosol band over WQP siteNumber of pixels that contain negative values for the aerosol band over WQP siteMedian of corrected surface reflectance from the blue band over WQP siteStandard deviation of corrected surface reflectance from the blue band over WQP siteNumber of pixels that contain negative values for the blue band over WQP siteMedian of corrected surface reflectance from the green band over WQP siteStandard deviation of corrected surface reflectance from the green band over WQP siteNumber of pixels that contain negative values for the green band over WQP siteMedian of corrected surface reflectance from the red band over WQP siteStandard deviation of corrected surface reflectance from the red band over WQP siteNumber of pixels that contain negative values for the red band over WQP siteMedian of corrected surface reflectance from the NIR band over WQP siteStandard deviation of corrected surface reflectance from the NIR band over WQP siteNumber of pixels that contain negative values for the NIR band over WQP siteMedian of corrected surface reflectance from the swir1 band over WQP siteStandard deviation of corrected surface reflectance from the swir1 band over WQP siteNumber of pixels that contain negative values for the swir1 band over WQP siteMedian of corrected surface reflectance from the swir2 band over WQP siteStandard deviation of corrected surface reflectance from the swir2 band over WQP siteNumber of pixels that contain negative values for the swir2 band over WQP siteSurface water temperature in Kelvin recorded from the Landsat thermal bandPixel qa of water maskCloud mask within WQP site bufferDynamic surface water extent mask with additional algal mask thresholds applied. Only 1 was collected.Median hillshadow of site buffer. Binary (0 = shadowed, 1 = not shadowed).Number of pixels in WQP site water maskNumber of shadowed pixels in WQP site water maskLandsat product identifier (collection 2)Landsat image quality% cloud cover of Landsat sceneSolar elevation from Landsat sceneSolar azimuth angle from Landsat scenelatitude measurement of in situ samplelongitudeunique identifier (row number)band ratio (Red / Blue + Swir1)band ratio Nir - (Red + (Swir1-Red)*((830-660)/(1650-660)))band ratio Nir/(Green-1)dominant wavelengthband ratio:band ratioRed/ (Blue + Nir)band ratio: Blue / (Green + Swir1)band ratio: Nir / (Red + Swir1)band ratio:band ratio: Green / (Blue + Swir1)
Storage Type:dateTime  
string  
string  
dateTime  
float  
string  
string  
float  
dateTime  
string  
float  
float  
integer  
float  
float  
integer  
float  
float  
integer  
float  
float  
float  
float  
float  
integer  
float  
float  
integer  
float  
float  
integer  
float  
integer  
float  
float  
float  
integer  
integer  
string  
float  
float  
float  
float  
float  
float  
integer  
float  
float  
float  
integer  
float  
float  
float  
float  
float  
float  
float  
Measurement Type:dateTimenominalnominaldateTimerationominalnominalratiodateTimenominalratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratiorationominalratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratio
Measurement Values Domain:
FormatYYYY-MM-DD
Precision
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeChlorophyll a
DefinitionCharacteristic names identify different types of environmental measurements derived from the USEPA Substance Registry System (SRS).
Source
Code Definition
CodeChlorophyll a (probe relative fluorescence)
DefinitionCharacteristic names identify different types of environmental measurements derived from the USEPA Substance Registry System (SRS).
Source
Code Definition
CodeChlorophyll a (probe)
DefinitionCharacteristic names identify different types of environmental measurements derived from the USEPA Substance Registry System (SRS).
Source
Code Definition
CodeChlorophyll a - Phytoplankton (suspended)
DefinitionCharacteristic names identify different types of environmental measurements derived from the USEPA Substance Registry System (SRS).
Source
Code Definition
CodeChlorophyll a, corrected for pheophytin
DefinitionCharacteristic names identify different types of environmental measurements derived from the USEPA Substance Registry System (SRS).
Source
Code Definition
CodeChlorophyll a, free of pheophytin
DefinitionCharacteristic names identify different types of environmental measurements derived from the USEPA Substance Registry System (SRS).
Source
Definitiontext
Formathh:mm:ss
Precision
UnitmicrogramPerLiter
Typereal
Definitiontext
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codechl.a
Definitionchlorophyll-a
Source
Unitmeter
Typereal
FormatYYYY-MM-DD hh:mm:ss
Precision
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeWQP
DefinitionWater Quality Portal
Source
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitkelvin
Typereal
Unitinteger
Typeinteger
Unitinteger
Typereal
Unitinteger
Typereal
Unitinteger
Typereal
Unitinteger
Typeinteger
Unitinteger
Typeinteger
Definitiontext
Unitinteger
Typereal
Unitpercent
Typereal
Unitinteger
Typereal
Unitinteger
Typereal
Unitdecimal degrees
Typereal
Unitdecimal degrees
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitinteger
Typeinteger
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Unitunitless (0-1)
Typereal
Missing Value Code:      
CodeNA
ExplTime not recorded
 
CodeNA
ExplTime not recorded
 
CodeNA
Expldepth not recorded
CodeNA
ExplTime not recorded, but date field was extracted
                                                                               
CodeNA
Explcolor metric not available
             
Accuracy Report:                                                                                                                  
Accuracy Assessment:                                                                                                                  
Coverage:                                                                                                                  
Methods:                                                                                                                  

Non-Categorized Data Resource

Name:CHOIR_Model
Entity Type:.rDS
Description:XGBoost model environment used to estimate chlorophyll-a.
Physical Structure Description:
Object Name:CHOIR_Model.rDS
Size:1769554 byte
Authentication:6d3db4a81c6c8dba0117fab0186d10f6 Calculated By MD5
Externally Defined Format:
Format Name:.rDS
Data:https://pasta-s.lternet.edu/package/data/eml/edi/1815/1/442961f82c9c03392289061965b66b42

Non-Categorized Data Resource

Name:NHD_Centerlines
Entity Type:application/zip
Description:Shapefile including nhd reach geometry and common identifiers (COMID).
Physical Structure Description:
Object Name:NHD_Centerlines.zip
Size:924126 byte
Authentication:eb8bdf380d710cf4a0ff62e6af9eb80d Calculated By MD5
Externally Defined Format:
Format Name:application/zip
Data:https://pasta-s.lternet.edu/package/data/eml/edi/1815/1/edaeae15f56f582b91ed68ae2a9d1bca

Data Package Usage Rights

This information is released under the Creative Commons license - Attribution - CC BY (https://creativecommons.org/licenses/by/4.0/). The consumer of these data ("Data User" herein) is required to cite it appropriately in any publication that results from its use. The Data User should realize that these data may be actively used by others for ongoing research and that coordination may be necessary to prevent duplicate publication. The Data User is urged to contact the authors of these data if any questions about methodology or results occur. Where appropriate, the Data User is encouraged to consider collaboration or co-authorship with the authors. The Data User should realize that misinterpretation of data may occur if used out of context of the original study. While substantial efforts are made to ensure the accuracy of data and associated documentation, complete accuracy of data sets cannot be guaranteed. All data are made available "as is." The Data User should be aware, however, that data are updated periodically and it is the responsibility of the Data User to check for new versions of the data. The data authors and the repository where these data were obtained shall not be liable for damages resulting from any use or misinterpretation of the data. Thank you.

Keywords

By Thesaurus:
(No thesaurus)Chlorophyll-a, Long-term data, Water quality, Remote sensing

Methods and Protocols

These methods, instrumentation and/or protocols apply to all data in this dataset:

Methods and protocols used in the collection of this data package
Description:

To develop a model to predict chlorophyll-a based solely on Landsat imagery, we first built a dataset of in situ chlorophyll-a data paired with coinciding surface reflectance, referred to as “matchups”. We used those matchups to train a model to estimate chlorophyll-a. Then, this model was applied to reach-level remote sensing summaries to create a seamless spatial data product over the Ohio and Illinois river basins.

Our approach to creating a matchup data set follows the data and methods outlined in Ross et al. (2019), where coinciding water quality observations from the field are joined with satellite acquisitions captured within +/- 2 days of the in-situ measurement. To build our matchup dataset, we first identified the unique locations of chlorophyll-a sampling within the Water Quality Portal (Read et al., 2017) and LAGOS-US (Cheruvelil et al., 2021), resulting in a total of 44,154 sites across the US. Using the Google Earth Engine API (Gorelick et al., 2017), we extracted the full Landsat Collection 2 Tier 1 Surface Reflectance product stack across Landsat 5, 7, 8, and 9 from these site locations. These remote sensing workflows are described in Ross (2019), Topp (2021), and Gardner (2021).

We developed an XgBoost model (Chen and Guestrin, 2016) to estimate chlorophyll-a based on our matchup dataset. We evaluated our model using a number of performance metrics such as Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). Our testing data represented the full range of chlorophyll-a values observed in our full training set, with values from 0.11 µg/L - 190 µg/L, with an average value of 12.82 µg/L. Our model was able to predict chlorophyll-a with a Root Mean Squared Error (RMSE) of 14.0 µg/L and a Mean Absolute Error (MAE) of 6.21 µg/L.

Citations:

Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (Association for Computing Machinery, New York, NY, USA, 2016). doi:10.1145/2939672.2939785.

Cheruvelil, K. S. et al. LAGOS-US LOCUS v1.0: Data module of location, identifiers, and physical characteristics of lakes and their watersheds in the conterminous U.S. Limnology and Oceanography Letters 6, 270–292 (2021).

Gardner, J. R. et al. The Color of Rivers. Geophysical Research Letters 48, e2020GL088946 (2021).

Read, E. K. et al. Water quality data for national-scale aquatic research: The Water Quality Portal. Water Resources Research 53, 1735–1745 (2017).

Ross, M. R. V. et al. AquaSat: A Data Set to Enable Remote Sensing of Water Quality for Inland Waters. Water Resources Research 55, 10012–10025 (2019).

Topp, S. N. et al. Multi-decadal improvement in US Lake water clarity. Environ. Res. Lett. 16,

055025 (2021).

People and Organizations

Publishers:
Organization:Environmental Data Initiative
Email Address:
info@edirepository.org
Web Address:
https://edirepository.org
Id:https://ror.org/0330j0z60
Creators:
Individual: Samuel James Sillen
Organization:University of Pittsburgh
Email Address:
SJS281@pitt.edu
Id:https://orcid.org/0009-0005-5099-3340
Individual: Gabriella Zuccolotto
Organization:University of Pittsburgh
Email Address:
GLZ9@pitt.edu
Individual: Bethel Steele
Organization:Colorado State University
Email Address:
b.steele@colostate.edu
Individual: Matthew R.V. Ross
Organization:Colorado State University
Email Address:
Matt.Ross@colostate.edu
Individual: Emily Maureen Elliott
Organization:University of Pittsburgh
Email Address:
eelliott@pitt.edu
Individual: John R Gardner
Organization:University of Pittsburgh
Email Address:
gardner.john@pitt.edu
Contacts:
Individual: Samuel J Sillen
Organization:University of Pittsburgh
Email Address:
SJS281@pitt.edu
Id:https://orcid.org/0009-0005-5099-3340
Individual: John R Gardner
Email Address:
gardner.john@pitt.edu

Temporal, Geographic and Taxonomic Coverage

Temporal, Geographic and/or Taxonomic information that applies to all data in this dataset:

Time Period
Begin:
1984-03-16
End:
2022-11-01
Geographic Region:
Description:This dataset is located within the Ohio and Illinois River Basins.
Bounding Coordinates:
Northern:  43.20504Southern:  35.31332
Western:  -91.23696Eastern:  -77.83937

Project

Parent Project Information:

Title:Satellite derived chlorophyll-a of the Ohio and Illinois Rivers (CHOIR)
Personnel:
Individual: Samuel James Sillen
Organization:University of Pittsburgh
Position:Data Scientist
Email Address:
SJS281@pitt.edu
Id:https://orcid.org/0009-0005-5099-3340
Role:Dataset Manager

Maintenance

Maintenance:
Description:

This dataset currently extends up until 2022. In the future we may update this dataset with additional data.

Frequency:unknown
Other Metadata

Additional Metadata

additionalMetadata
        |___text '\n      '
        |___element 'metadata'
        |     |___text '\n         '
        |     |___element 'unitList' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:unitList')
        |     |     |___text '\n            '
        |     |     |___element 'unit' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:unit')
        |     |     |     |  \___attribute 'id' = 'unitless (0-1)'
        |     |     |     |  \___attribute 'name' = 'unitless (0-1)'
        |     |     |     |___text '\n               '
        |     |     |     |___element 'description' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:description')
        |     |     |     |___text '\n            '
        |     |     |___text '\n            '
        |     |     |___element 'unit' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:unit')
        |     |     |     |  \___attribute 'id' = 'unitless'
        |     |     |     |  \___attribute 'name' = 'unitless'
        |     |     |     |___text '\n               '
        |     |     |     |___element 'description' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:description')
        |     |     |     |___text '\n            '
        |     |     |___text '\n            '
        |     |     |___element 'unit' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:unit')
        |     |     |     |  \___attribute 'id' = 'integer'
        |     |     |     |  \___attribute 'name' = 'integer'
        |     |     |     |___text '\n               '
        |     |     |     |___element 'description' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:description')
        |     |     |     |___text '\n            '
        |     |     |___text '\n            '
        |     |     |___element 'unit' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:unit')
        |     |     |     |  \___attribute 'id' = 'decimal degrees'
        |     |     |     |  \___attribute 'name' = 'decimal degrees'
        |     |     |     |___text '\n               '
        |     |     |     |___element 'description' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:description')
        |     |     |     |___text '\n            '
        |     |     |___text '\n            '
        |     |     |___element 'unit' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:unit')
        |     |     |     |  \___attribute 'id' = 'kelvin'
        |     |     |     |  \___attribute 'name' = 'kelvin'
        |     |     |     |___text '\n               '
        |     |     |     |___element 'description' in ns 'http://www.xml-cml.org/schema/stmml-1.2' ('stmml:description')
        |     |     |     |___text '\n            '
        |     |     |___text '\n         '
        |     |___text '\n      '
        |___text '\n   '

Additional Metadata

additionalMetadata
        |___text '\n      '
        |___element 'metadata'
        |     |___text '\n         '
        |     |___element 'emlEditor'
        |     |        \___attribute 'app' = 'ezEML'
        |     |        \___attribute 'release' = '2024.10.30'
        |     |___text '\n      '
        |___text '\n   '

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo