Data Package Metadata   View Summary

State-of-the-art review of near-term freshwater forecasting literature published between 2017 and 2022

General Information
Data Package:
Local Identifier:edi.960.2
Title:State-of-the-art review of near-term freshwater forecasting literature published between 2017 and 2022
Alternate Identifier:DOI PLACE HOLDER
Abstract:

This data publication includes code and results from a systematic literature review on the current state of near-term forecasting of freshwater quality. The review aimed to address the following questions:

(1) Freshwater variables, scales, models, and skill: Which freshwater variables and temporal scales are most commonly targeted for near-term forecasts, and what modeling methods are most commonly employed to develop these forecasts? How is the accuracy of freshwater quality forecasts assessed, and how accurate are they? How is uncertainty typically incorporated into water quality forecast output?

(2) Forecast infrastructure and workflows: Are iterative, automated workflows commonly employed in near-term freshwater quality forecasting? How are forecasts validated and archived?

(3) Human dimensions: What is the stated motivation for development of most near-term freshwater quality forecasts, and who are the most common end users (if any)? How are end users engaged in forecast development?

An initial search was conducted for published papers presenting freshwater quality forecasts from 1 January 2017 to 17 February 2022 in the Web of Science Core Collection. Results were subsequently analyzed in three stages. First, paper titles were screened for relevance. Second, an initial screen was conducted to assess whether each paper presented a near-term freshwater quality forecast. Third, papers that passed the initial screen were analyzed using a standardized matrix to assess the state of near-term freshwater quality forecasting and identify areas of recent progress and ongoing challenges. Additional details regarding the systematic literature search and review are presented in the Methods section of the metadata.

Publication Date:2022-12-16
For more information:
Visit: DOI PLACE HOLDER

Time Period
Begin:
2017-01-01
End:
2022-02-17

People and Organizations
Contact:Carey, Cayelan C. (Virginia Tech) [  email ]
Creator:Lofton, Mary E. (Virginia Tech)
Creator:Howard, Dexter W. (Virginia Tech)
Creator:Thomas, R. Quinn (Virginia Tech)
Creator:Carey, Cayelan C. (Virginia Tech)

Data Entities
Data Table Name:
freshwater-forecasting-review-results.csv
Description:
Freshwater forecasting review results
Data Table Name:
Fig4_data.csv
Description:
Data table to creat Fig. 4
Other Name:
EDI_data_QAQC_and_formatting.R
Description:
data aggregation and quality control script
Detailed Metadata

Data Entities


Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/960/2/3bdc7201f46157a0d6db5f0a9a8330cc
Name:freshwater-forecasting-review-results.csv
Description:Freshwater forecasting review results
Number of Records:964
Number of Columns:89

Table Structure
Object Name:freshwater-forecasting-review-results.csv
Size:3758658 bytes
Authentication:0a857e12c2742019aa4e8b3ddfa48bf9 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:Publication Type  
Authors  
Book Authors  
Book Editors  
Book Group Authors  
Author Full Names  
Book Author Full Names  
Group Authors  
Article Title  
Source Title  
Book Series Title  
Book Series Subtitle  
Language  
Document Type  
Conference Title  
Conference Date  
Conference Location  
Conference Sponsor  
Conference Host  
Author Keywords  
Keywords Plus  
Abstract  
Addresses  
Affiliations  
Reprint Addresses  
Email Addresses  
Researcher Ids  
ORCIDs  
Funding Orgs  
Funding Text  
Cited Reference Count  
Times Cited, WoS Core  
Times Cited, All Databases  
180 Day Usage Count  
Since 2013 Usage Count  
Publisher  
Publisher City  
Publisher Address  
ISSN  
eISSN  
ISBN  
Journal Abbreviation  
Journal ISO Abbreviation  
Publication Date  
Publication Year  
Volume  
Issue  
Part Number  
Supplement  
Special Issue  
Start Page  
End Page  
Article Number  
DOI  
Book DOI  
Early Access Date  
Number of Pages  
WoS Categories  
Web of Science Index  
Research Areas  
IDS Number  
UT (Unique WOS ID)  
Pubmed Id  
Open Access Designations  
Date of Export  
result_num  
title_screen  
freshwater  
forecast  
uncertainty_present  
nearterm  
model_approach  
hydrological  
ecosystem  
other_ecosystem  
phys_chem_bio  
forecast_vars  
min_horizon_days  
max_horizon_days  
skill_metrics  
model_comparison  
simple_null_model  
uncertainty_method  
iterative  
automated  
archived  
motivation  
end_user  
engagement  
Definition:reports whether a publication is a book, book series, journal article, or conference proceedingsauthors of paperauthors of book, if applicableeditors of book, if applicableorganization or institution credited with book authorshipfull names of authors of paperfull names of authors of book, if applicableorganization or institution credited with paper authorshiptitle of papertitle of paper source, most often the journal in which it was publishedtitle of book series, if applicablesubtitle of book series, if applicablelanguage of paper as originally publishedthe document type of a papertitle of conference at which paper was presented, if applicabledate of conference at which paper was presented, if applicablelocation of conference at which paper was presented, if applicablesponsor of conference at which paper was presented, if applicablehost of conference at which paper was presented, if applicablekeywords listed by the authorsindex terms generated from the title of the articlearticle abstractauthor addressesauthor affiliationsthe address of the corresponding authorauthor email addressesindividual author identifiers that are not ORCIDsauthor ORCIDsorganizations funding the work presented in the paperacknowledgements text from paper listing funding sourcesnumber of documents cited by the current papernumber of times paper was cited by products within the Web of Science databasenumber of times paper was cited across all databases accessible by Web of Sciencenumber of times paper was accessed in last 180 daysnumber of times paper was accessed since 2013publisher of papercity of paper publisheraddress of paper publisher8-digit International Standard Serial Number8-digit International Standard Serial Number for the electronic version of a record13-digit International Standard Book Number, if applicabletext abbreviation of journal name where record is publishedInternational Organization for Standardization abbreviation of journal name where record is publishedmonth and day (if available) of record publicationyear of record publicationjournal volume where paper is publishedjournal issue where paper is publishedjournal part or number where paper is published, if applicablesupplement number, if applicableindicates whether the record was part of a journal special issuepage number of first page of recordpage number of last page of recordarticle number within journal, if applicabledigitial object identifier for recorddigital object identifier for book, if applicabledate record was avaiable via early accesspage length of recordsubject categories for record assigned by Web of Scienceindex from which abstract was extractedsubject areas of researchunique identifier for journal issue where article was published, if applicablea record identifier unique to Web of ScienceID number for records of documents that have corresponding records on MEDLINEopen access designations assigned by Web of Science following https://clarivate.com/blog/a-researchers-complete-guide-to-open-access-papers/date record was retrieved from Web of Scienceunique identifier for freshwater forecasting analysisindicates whether paper passed title screen of freshwater forecasting analysisindicates whether research is regarding a freshwater ecosystemindicates whether a paper presents prediction of the futureindicates whether a paper specifies uncertainty of a future predictionindicates whether the minimum horizon of a future prediction is less than or equal to 10 yearsindicates the type of model used by the authors to make future predictionsindicates whether the only target variable for future predictions is hydrological, e.g., river flow, water levelwhat type of ecosystem is the research conducted inidentifies the focal ecosystem of the paper if it cannot be categorized according to the 'ecosystem' columnindicates whether the target forecast variable is physical, chemical, biological, or whether the paper includes multiple papers across these categorieslists the target forecast variablesthe minimum forecast horizon in daysthe maximum forecast horizon in dayslists the forecast skill assessment metrics used in the paper; see Table 2 of associated freshwater forecasting review publication for definition of abbreviationsindicates whether the paper compared the focal forecast model to another model (but not necessarily a null model)indicates whether the paper compared the focal forecast model to a null model, defined as a persistance, first-order autoregressive, or historical mean modelindicates the method of uncertainty specification used in the paperindicates whether or not the forecast is iterativeindicates whether or not the forecast workflow is identified as automated by the authors, where automated defined as being triggered without a human taking actionindicates whether forecast output is stated as being archived by the authorssummaries or direct quotations extracted from the paper characterizing the motivation for taking a forecasting approach to solve the research problemlist the end users of the forecast as stated by the authors, if applicablecharacterizes how end users were engaged according to the authors
Storage Type:string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
date  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
float  
float  
float  
float  
float  
string  
string  
string  
string  
string  
string  
string  
string  
date  
date  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
date  
float  
string  
string  
string  
string  
string  
string  
string  
date  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
float  
float  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
Measurement Type:nominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominaldateTimenominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalratioratioratioratiorationominalnominalnominalnominalnominalnominalnominalnominaldateTimedateTimenominalnominalnominalnominalnominalnominalnominalnominalnominalnominaldateTimerationominalnominalnominalnominalnominalnominalnominaldateTimenominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalratiorationominalnominalnominalnominalnominalnominalnominalnominalnominalnominal
Measurement Values Domain:
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeB
Definitionbook
Source
Code Definition
CodeC
Definitionconference proceedings
Source
Code Definition
CodeJ
Definitionjournal
Source
Code Definition
CodeS
Definitionseries
Source
Definitionauthors of paper
Definitionauthors of book, if applicable
Definitioneditors of book, if applicable
Definitionorganization or institution credited with book authorship
Definitionfull names of authors of paper
Definitionfull names of authors of book, if applicable
Definitionorganization or institution credited with paper authorship
Definitiontitle of paper
Definitiontitle of paper source, most often the journal in which it was published
Definitiontitle of book series, if applicable
Definitionsubtitle of book series, if applicable
Definitionlanguage of paper as originally published
Definitionthe document type of a paper
Definitiontitle of conference at which paper was presented, if applicable
FormatMMM DD-DD YYYY
Precision
Definitionlocation of conference at which paper was presented, if applicable
Definitionsponsor of conference at which paper was presented, if applicable
Definitionhost of conference at which paper was presented, if applicable
Definitionkeywords listed by the authors
Definitionindex terms generated from the title of the article
Definitionarticle abstract
Definitionauthor addresses
Definitionauthor affiliations
Definitionthe address of the corresponding author
Definitionauthor email addresses
Definitionindividual author identifiers that are not ORCIDs
Definitionauthor ORCIDs
Definitionorganizations funding the work presented in the paper
Definitionacknowledgements text from paper listing funding sources
Unitnumber
Typewhole
Min
Max729 
Unitnumber
Typewhole
Min
Max286 
Unitnumber
Typewhole
Min
Max291 
Unitnumber
Typewhole
Min
Max57 
Unitnumber
Typewhole
Min
Max452 
Definitionpublisher of paper
Definitioncity of paper publisher
Definitionaddress of paper publisher
Definition8-digit International Standard Serial Number
Definition8-digit International Standard Serial Number for the electronic version of a record
Definition13-digit International Standard Book Number, if applicable
Definitiontext abbreviation of journal name where record is published
DefinitionInternational Organization for Standardization abbreviation of journal name where record is published
FormatMMM DD
Precision
FormatYYYY
Precision
Definitionjournal volume where paper is published
Definitionjournal issue where paper is published
Definitionjournal part or number where paper is published, if applicable
Definitionsupplement number, if applicable
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeSI
Definitionpaper was part of a journal special issue
Source
Definitionpage number of first page of record
Definitionpage number of last page of record
Definitionarticle number within journal, if applicable
Definitiondigitial object identifier for record
Definitiondigital object identifier for book, if applicable
FormatMMM YYYY
Precision
Unitnumber
Typenatural
Min
Max48 
Definitionsubject categories for record assigned by Web of Science
Definitionindex from which abstract was extracted
Definitionsubject areas of research
Definitionunique identifier for journal issue where article was published, if applicable
Definitiona record identifier unique to Web of Science
DefinitionID number for records of documents that have corresponding records on MEDLINE
Definitionopen access designations assigned by Web of Science following https://clarivate.com/blog/a-researchers-complete-guide-to-open-access-papers/
Formatyyyy-mm-dd
Precision
Definitionunique identifier for freshwater forecasting analysis
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Coden
Definitionpaper did not pass title screen
Source
Code Definition
Codey
Definitionpaper passed title screen
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeno
Definitionpaper does not present freshwater research
Source
Code Definition
Codeyes; groundwater
Definitionpaper presents freshwater research about groundwater
Source
Code Definition
Codeyes; surface water
Definitionpaper present freshwater research about surface water
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codecan't tell
Definitionwhether the paper presents a prediction into the future was not determined because the paper was not available in English
Source
Code Definition
Codeforecast
Definitionpaper presents a prediction into the future
Source
Code Definition
Codehindcast
Definitionpaper presents a prediction into the future from the perspective of the model for a time period that is now past
Source
Code Definition
Codenone of the above
Definitionpaper does not present a prediction into the future
Source
Code Definition
Codenowcast
Definitionpaper presents a prediction of current conditions at a spatial point or region for which data is not available
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codecan't tell
Definitionwhether the forecast specifies uncertainty was not determined because the paper was not available in English
Source
Code Definition
Codeno
Definitionthe forecast does not specify uncertainty
Source
Code Definition
Codeyes
Definitionthe forecast specifies uncertainty
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codecan't tell
Definitionwhether the minimum forecast horizon is less than or equal to 10 years was not determined because the paper was not available in English
Source
Code Definition
Codeno
Definitionthe minimum forecast is greater than 10 years
Source
Code Definition
Codeyes
Definitionthe minimum forecast horizon is less than or equal to 10 years
Source
Definitionindicates the type of model used by the authors to make future predictions
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeno
Definitionpaper includes target variables that are not hydrological
Source
Code Definition
Codeyes
Definitionpaper includes only hydrological target variables (e.g., streamflow, water level)
Source
Definitionwhat type of ecosystem is the research conducted in
Definitionidentifies the focal ecosystem of the paper if it cannot be categorized according to the 'ecosystem' column
Definitionindicates whether the target forecast variable is physical, chemical, biological, or whether the paper includes multiple papers across these categories
Definitionlists the target forecast variables
UnitnominalDay
Typereal
Min
Max3650 
UnitnominalDay
Typereal
Min0.17 
Max20075 
Definitionlists the forecast skill assessment metrics used in the paper; see Table 2 of associated freshwater forecasting review publication for definition of abbreviations
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeno
Definitionthe focal forecast model is not compared to other models
Source
Code Definition
Codeyes
Definitionthe focal forecast model is compared to other models
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeno
Definitionthe focal forecast model was not compared to a simple null model, defined as a persistance, historical mean, or first order autoregressive model
Source
Code Definition
Codeyes
Definitionthe focal forecast model was compared to a simple null model, defined as a persistance, historical mean, or first order autoregressive model
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeassimilates
DefinitionThe model iteratively updates this term through data assimilation. An example would be using a formal variational (e.g. 4DVar) or ensemble (EnKF, PF) data assimilation approach. For simpler models, this would also include iteratively refitting the whole model in light of new data.
Source
Code Definition
Codedata_driven
DefinitionThe model contains this concept and the inputs are data driven but uncertainty in this input is not explicitly propagated into predictions (e.g. calibrated model parameters, a single time series of observed meteorological driver data). For sake of internal consistency, quantitative forecasts of other variables that are used as inputs into ecological forecasts (e.g. weather forecasts) should be treated as data.
Source
Code Definition
Codepresent
DefinitionThe model contains this concept (e.g. the model has parameters), but the values used are not derived from data and no uncertainty is represented (e.g. spin-up initial conditions, drivers are scenarios, hand-tuned single-value parameters)
Source
Code Definition
Codepropagates
DefinitionThe model propagates uncertainty about this term into forecasts. The most common example of this is a model run multiple times (i.e. ensemble) that samples the distributions of parameters, initial conditions, or drivers. Alternatively, one might be using an analytical approach to estimate how input uncertainties for a specific term translates into output uncertainties.
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeno
Definitionforecast is not iteratively produced
Source
Code Definition
Codeyes
Definitionforecast is iteratively produced
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeno
Definitionforecast workflow not automated
Source
Code Definition
Codeyes
Definitionforecast workflow automated
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeno/don't know
Definitionforecast output archiving not reported by authors
Source
Code Definition
Codeyes
Definitionforecast output archiving reported by authors
Source
Definitionsummaries or direct quotations extracted from the paper characterizing the motivation for taking a forecasting approach to solve the research problem
Definitionlist the end users of the forecast as stated by the authors, if applicable
Definitioncharacterizes how end users were engaged according to the authors
Missing Value Code:                                                                                                                                                                                  
Accuracy Report:                                                                                                                                                                                  
Accuracy Assessment:                                                                                                                                                                                  
Coverage:                                                                                                                                                                                  
Methods:                                                                                                                                                                                  

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/960/2/9cf7d6f16d5714f9252829717648bfd3
Name:Fig4_data.csv
Description:Data table to creat Fig. 4
Number of Records:83
Number of Columns:4

Table Structure
Object Name:Fig4_data.csv
Size:10815 bytes
Authentication:e31e2fa0b317bc939c33def2213c1754 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:result_number  
article_title  
target_variables  
title_screen  
Definition:unique identifier for papers retrieved in literature searchtitle of articletarget forecast variablesy or n depending on whether paper passed title screen for relevance
Storage Type:float  
string  
string  
string  
Measurement Type:rationominalnominalnominal
Measurement Values Domain:
Unitnumber
Typenatural
Min
Max826 
Definitiontitle of article
Definitiontarget forecast variables
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codey
Definitionpaper passed title screen for relevancy
Source
Missing Value Code:      
CodeNA
Explpaper title is not available
Accuracy Report:        
Accuracy Assessment:        
Coverage:        
Methods:        

Non-Categorized Data Resource

Name:EDI_data_QAQC_and_formatting.R
Entity Type:unknown
Description:data aggregation and quality control script
Physical Structure Description:
Object Name:EDI_data_QAQC_and_formatting.R
Size:8197 bytes
Authentication:75c303dc42a68e084ee7f4b556a2ac6f Calculated By MD5
Externally Defined Format:
Format Name:text/plain
Data:https://pasta-s.lternet.edu/package/data/eml/edi/960/2/36973b68abdfd394beb7384767601b45

Data Package Usage Rights

This information is released under the Creative Commons license - Attribution - CC BY (https://creativecommons.org/licenses/by/4.0/). The consumer of these data ("Data User" herein) is required to cite it appropriately in any publication that results from its use. The Data User should realize that these data may be actively used by others for ongoing research and that coordination may be necessary to prevent duplicate publication. The Data User is urged to contact the authors of these data if any questions about methodology or results occur. Where appropriate, the Data User is encouraged to consider collaboration or co-authorship with the authors. The Data User should realize that misinterpretation of data may occur if used out of context of the original study. While substantial efforts are made to ensure the accuracy of data and associated documentation, complete accuracy of data sets cannot be guaranteed. All data are made available "as is." The Data User should be aware, however, that data are updated periodically and it is the responsibility of the Data User to check for new versions of the data. The data authors and the repository where these data were obtained shall not be liable for damages resulting from any use or misinterpretation of the data. Thank you.

Keywords

By Thesaurus:
(No thesaurus)data assimilation, decision support, ecological forecasting, forecast, forecast automation, forecast end user, forecast model, forecast skill, hindcast, hydrological forecasting, near-term iterative forecasting cycle, null model, uncertainty propagation, uncertainty partitioning, water quantity
LTER Controlled Vocabularyfreshwater, water quality

Methods and Protocols

These methods, instrumentation and/or protocols apply to all data in this dataset:

Methods and protocols used in the collection of this data package
Description:

We conducted a state-of-the-art literature review (sensu Grant and Booth 2009) of freshwater forecasting over the past five years to assess the state of the field, recent progress, and ongoing challenges. First, we conducted a search using the Web of Science Core Collection database. Next we conducted a title screen, followed by an initial full-text screen, during which we assessed whether the paper presented a near-term freshwater quality forecast. We then completed in-depth analysis of each paper that passed the initial screen using a standardized matrix. Finally, we analyzed the tabular data from our matrix-based paper analysis to assess the state of near-term freshwater quality forecasting and identify areas of recent progress and ongoing challenges. Each step of the literature review process is documented in detail below.

We built our search around four concepts: forecasting, freshwater, possible freshwater forecast target variables (e.g., streamflow, harmful algal blooms), and a combined global change/resource management concept. The final search string required the title to contain a word relating to the forecasting concept and for either the title or the abstract to contain a word or phrase relating to each of the four concepts. After several trial searches, we subsequently removed "predict*" and "project*" from the forecasting concept for the abstract search only, as we found this resulted in retrieval of a large proportion of modeling studies that did not address forecasting. Our search period extended from 1 January 2017 to 17 February 2022, representing the past five years of peer-reviewed research, which is a typical approach for state-of-art reviews (Grant and Booth 2009). Together, these requirements resulted in the following final search string, with the final search conducted on 17 February 2022 yielding 963 results:

Title must include: (forecast* OR hindcast* OR predict* OR project*)

Title or abstract must include: (freshwater OR hydrology OR hydrodynamics OR aquatic OR stream* OR river OR lake OR reservoir OR groundwater) AND (forecast* OR hindcast*) AND (fish OR algae OR phytoplankton OR zooplankton OR plankton OR nitrate OR ammoni* OR nitrogen OR phosphate OR phosphorus OR "dissolved gas" OR "dissolved gasses" OR "dissolved gases" OR "carbon dioxide" OR methane OR nutrient* OR temperature OR communit* OR biodiversity OR flow OR streamflow OR "water quality" OR flood OR hydrology OR hydrodynamics OR "algal bloom" OR "dead zone" OR "dissolved oxygen" OR salmon OR "benthic macroinvertebrate" OR "benthic macroinvertebrates" OR toxin OR cyanobacteria* OR chem* OR biogeochem* OR flux*) AND (("global change" OR "climate change" OR climate OR "global warming" OR "global cooling" OR "carbon cycle" OR "carbon cycling" OR "greenhouse gas" OR "greenhouse gasses" OR "greenhouse gases" OR hypoxia OR brownification OR "invasive species" OR "land use" OR "nutrient pollution" OR microplastics OR biodiversity OR "emerging diseases" OR antibiotics OR salinization OR eutrophication OR anthrop*) OR ("resource manager" OR "resource management" OR "freshwater resource" OR "freshwater resources" OR "ecosystem service" OR "ecosystem services" OR "water treatment" OR "drinking water" OR "water supply" OR "lake manager" OR "lake management" OR "river management" OR "river manager" OR "water manager" OR "water management" OR "end user" OR "end-user" OR "decision-making" OR "decision support" OR conservation OR "water policy" OR policymaker* OR "water professional" OR "water professionals" OR "water resource" OR "water resources" OR stakeholder* OR research*))

Second, we screened paper titles and text for relevance and basic information regarding forecasts. The title screen was conducted by M.E.L. and resulted in elimination of 250 papers, leaving 713 papers for the initial full-text screen. Examples of papers eliminated during the title screen include papers forecasting vehicular traffic flow and papers forecasting atmospheric rivers, which are a meteorological phenomenon. The initial screen was also conducted by M.E.L., with 231 (~ one third) abstracts double-screened by D.W.H., C.C.C., and R.Q.T. to ensure agreement amongst co-authors regarding interpretation of the initial screen criteria. The initial screen was conducted using a standardized questionnaire comprising the following questions:

1. Is the study ecosystem an inland waterbody (salty lakes, lagoons, swamps, wetlands are permissible, coastal oceans and estuaries are not permissible)? For studies forecasting runoff or drought/flood risk, there must be some representation of an inland waterbody in the modeling approach.

2. Are the only focal variables some combination of streamflow, inflow, or stream or river discharge, water level or flood risk (i.e., water quantity)?

3. Is the study presenting a forecast, nowcast, or hindcast (defined as a prediction of future conditions from the perspective of the model)?

4. If the study is a forecast, nowcast, or hindcast, is uncertainty specified?

5. If the study is a forecast, nowcast, or hindcast, what modeling approach is used (see Table 2 for list of modeling approaches)?

6. If the study is a forecast, nowcast, or hindcast, is the forecast/hindcast/nowcast near-term, defined as having a minimum forecast horizon _ 10 yr?

Following the initial screen, we conducted in-depth analysis of all identified near-term freshwater quality forecasting papers (n = 16) using a standardized matrix. Each paper was double-screened by M.E.L. and D.W.H., and any discrepancies regarding entries to the matrix were resolved through discussion.

Questions included in standardized matrix analysis of near-term freshwater quality forecasting papers:

Forecast variables, scales, models, and skill:

What is the forecast ecosystem? Use the term the authors use in the paper. Is the forecast targeting a physical, chemical, or biological variable, or some combination of the three?

List the target forecast variable(s), separated by commas (e.g., DOC concentration, streamflow).

What is the minimum forecast horizon in days?

What is the maximum forecast horizon in days?

List the forecast skill metric(s) used, separated by commas (e.g., R2, RMSE); leave blank if forecast not assessed.

Does the paper include a multi-model (2 or more models) comparison?

Does the paper include a simple null model, defined as either a persistence model, the historical mean (climatology), or a first-order autoregressive model?

How is uncertainty incorporated? Options are following (Dietze et al. 2021): present = The model contains this concept (e.g. the model has parameters), but the values used are not derived from data and no uncertainty is represented (e.g. spin-up initial conditions, drivers are scenarios, hand-tuned single-value parameters); data_driven = The model contains this concept and the inputs are data driven but uncertainty in this input is not explicitly propagated into predictions (e.g. calibrated model parameters, a single time series of observed meteorological driver data). For sake of internal consistency, quantitative forecasts of other variables that are used as inputs into ecological forecasts (e.g. weather forecasts) should be treated as data.; propagates = The model propagates uncertainty about this term into forecasts. The most common example of this is a model run multiple times (i.e. ensemble) that samples the distributions of parameters, initial conditions, or drivers. Alternatively, one might be using an analytical approach to estimate how input uncertainties for a specific term translates into output uncertainties.; assimilates = The model iteratively updates this term through data assimilation. An example would be using a formal variational (e.g. 4DVar) or ensemble (EnKF, PF) data assimilation approach. For simpler models, this would also include iteratively refitting the whole model in light of new data.

Forecast infrastructure and workflows:

Is the forecast iterative, defined as regularly updated and re-issued when new data become available?

Is the forecast described by the authors as automated, meaning it can be reissued without manual intervention by a human?

Is the forecast archived? Select yes if the archiving is noted in the text, otherwise select no/don't know.

Human dimensions of forecasts:

What is the stated motivation for forecast development? Be brief; copy-pasting in quotations is fine but indicate this using quotation marks (" "); leave blank if not stated.

Who is the stated end user? Spell out acronyms; leave blank if there isn't one.

How were end users/stakeholders engaged in development? Be brief; leave blank if not applicable.

Finally, we analyzed our tabular data from both the initial screen of freshwater forecasts and matrix analysis of near-term freshwater quality forecasts to assess the state of the field of freshwater forecasting as well recent progress and ongoing opportunities following our focal questions. All analysis code is available on Github (https://github.com/melofton/freshwater-forecasting-review) and published with a DOI on Zenodo [placeholder for Zenodo DOI].

References:

Dietze, Michael, R. Q. Thomas, J. Peters, and C. Boettiger. 2021. "A Community Convention for Ecological Forecasting: Output Files and Metadata." EcoEvoRxiv Preprints. 2021. https://ecoevorxiv.org/9dgtq/.

Grant, Maria J., and Andrew Booth. 2009. "A Typology of Reviews: An Analysis of 14 Review Types and Associated Methodologies." Health Information and Libraries Journal 26 (2): 91-108.

People and Organizations

Publishers:
Organization:Environmental Data Initiative
Email Address:
info@edirepository.org
Web Address:
https://edirepository.org
Id:https://ror.org/0330j0z60
Creators:
Individual: Mary E. Lofton
Organization:Virginia Tech
Email Address:
melofton@vt.edu
Id:https://orcid.org/0000-0003-3270-1330
Individual: Dexter W. Howard
Organization:Virginia Tech
Email Address:
dwh1998@vt.edu
Id:https://orcid.org/0000-0002-6118-2149
Individual: R. Quinn Thomas
Organization:Virginia Tech
Email Address:
rqthomas@vt.edu
Individual: Cayelan C. Carey
Organization:Virginia Tech
Email Address:
cayelan@vt.edu
Id:https://orcid.org/0000-0001-8835-4476
Contacts:
Individual: Cayelan C. Carey
Organization:Virginia Tech
Email Address:
cayelan@vt.edu
Id:https://orcid.org/0000-0001-8835-4476

Temporal, Geographic and Taxonomic Coverage

Temporal, Geographic and/or Taxonomic information that applies to all data in this dataset:

Time Period
Begin:
2017-01-01
End:
2022-02-17
Geographic Region:
Description:area represented in literature review
Bounding Coordinates:
Northern:  70.0Southern:  -70.0
Western:  -160.0Eastern:  159.0

Project

Parent Project Information:

Title:MSA: Macrosystems EDDIE: An undergraduate training program in macrosystems science and ecological forecasting
Personnel:
Individual: Cayelan C. Carey
Organization:Virginia Tech
Email Address:
cayelan@vt.edu
Id:https://orcid.org/0000-0001-8835-4476
Role:Principal Investigator
Funding: National Science Foundation 1926050
Related Project:
Title:Collaborative Research: CIBR: Cyberinfrastructure Enabling End-to-End Workflows for Aquatic Ecosystem Forecasting
Personnel:
Individual: Cayelan C. Carey
Organization:Virginia Tech
Email Address:
cayelan@vt.edu
Id:https://orcid.org/0000-0001-8835-4476
Role:Principal Investigator
Funding: National Science Foundation 1933016
Related Project:
Title:NEON RCN: The Ecological Forecasting Initiative RCN: Using NEON-enabled near-term forecasting to synthesize our understanding of predictability across ecological systems and scales
Personnel:
Individual: R. Quinn Thomas
Organization:Virginia Tech
Email Address:
rqthomas@vt.edu
Role:Principal Investigator
Funding: National Science Foundation 1926388

Maintenance

Maintenance:
Description:complete
Frequency:
Other Metadata

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'emlEditor'
        |     |     |___text '\n        '
        |     |     |___element 'app'
        |     |     |     |___text 'EMLassemblyline'
        |     |     |___text '\n        '
        |     |     |___element 'release'
        |     |     |     |___text '3.5.4'
        |     |     |___text '\n      '
        |     |___text '\n    '
        |___text '\n  '

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo