Definition: | Paper title | Digital object identifier (doi) from Web of Science | Author list | Year of publication | Journal or conference in which the paper was published | Forecast spatial scale, classified into five categories | Geographic coordinates of the forecast site using decimal degrees. For papers with multiple locations, locations are separated using a semicolon. Locations for regional and national forecasts are approximately the center of the forecast area | Forecast ecosystem: forest, grassland, freshwater, marine, desert, tundra, atmosphere, agricultural, urban, global, other | Forecast class: biogeochemical or organismal (which includes population or community) | Identity of forecast variables | Model dimension: 0D, 1D, 2D, 3D | Model type: empirical (dependent on correlative or statistical relationships) or process-based (explicitly simulating ecological processes). For forecasting workflows that involve a pipeline of multiple models, this refers to the final model that forecasts the forecast variable of interest | If specified: more detailed description of model: for example, Bayesian hierarchical, machine learning, named model (e.g., PROTECH), etc. | Are meteorological covariates used in this forecast? 1 = yes, 0 = no | Are physical covariates (e.g., streamflow) used in this forecast? 1 = yes, 0 = no | Are biological covariates used in this forecast? 1 = yes, 0 = no | Are chemical covariates used in this forecast? 1 = yes, 0 = no | Does the paper include an ensemble forecast (ensemble within model)? 1 = yes, 0 = no | Number of ensemble members | Does the paper use an ensemble of models to produce one output? 1 = yes, 0 = no | How many models in the ensemble model | Are multiple models with different model structures compared (NOT including null models)? 1 = yes, 0 = no | How many models with different structures are compared? | Was a forecast null model (persistence or climatology) included? 1 = yes, 0 = no | How many null models? | What type of null model (climatology or persistence)? | Maximum time into the future that the forecast predicts in this paper, described in days | Time step of forecast output. For example, a forecast that gives predictions for the next 16 days but was only run once a week would have a time step of one day (not one week) | Are the forecasts described in the papers iterative (i.e., data updating forecasts iteratively)? Any form of iteration counts here: updating initial conditions with new data, refitting the model to incorporate new dta, updating parameter values, etc. State updating via the autoregressive term counts as data assimilation for autoregressive models | What technique of data assimilation was used? For example, KF, enKF, refit, update IC, etc. | Extent to which uncertainty is included in the forecast, classified within 5 categories: no (this model does not contain uncertainty), contains (the model contains uncertainty, but uncertainty is not derived from data; e.g. uncertainty comes from spin-up initial conditions or hand-tuned parameters), data_driven (the model contains data-driven uncertainty; e.g. uncertainty in meteorological drivers), propagates (the model propagates some source of uncertainty), assimilates (the model iteratively updates uncertainty through data assimilation). NOTE: this is assumed to be a hierarchy (e.g. if the forecast contains data driven uncertainty and propagates that uncertainty, it would be marked propagates) | What sources of uncertainty were incorporated? | Was observation uncertainty included? 1 = yes, 0 = no | Are at least two different sources of uncertainty quantified and compared? 1 = yes, 0 = no. NOTE: the two sources may be in the same category of uncertainty (e.g. two forms of driver data) | Initial condition uncertainty partitioned? 1 = yes, 0 = no | Driver uncertainty partitioned? 1 = yes, 0 = no | Parameter uncertainty partitioned? 1 = yes, 0 = no | Process uncertainty partitioned? 1 = yes, 0 = no | Other partitioned sources of uncertainty? 1 = yes, 0 = no | If at least two categories of uncertainty were partitioned, what was the dominant source of uncertainty? | If the dominant source varies by forecast horizon, season, etc. please describe here | Paper states that forecast was evaluated? 1 = yes, 0 = no | Forecast evaluation results reported in paper? 1 = yes, 0 = no | List all skill metrics used (e.g. R2, RMSE, bias, MAE). SD and Bayesian credible intervals are not skill metrics | Is forecast performance assessed at multiple forecast horizons (results must be reported in paper/supplemental info)? 1 = yes, 0 = no | Maximum forecast horizon such that the forecast was better than the null model (out of any models used) | Temporal coverage of data used to create this forecasting paper | Was new data (driver and/or observations) available to the model in real time (<24 hours from collection) without any manual effort when the system was working as intended? 1 = yes, 0 = no | Forecast archiving described in text? 1 = yes, 0 = no | Repository in which forecasts are archived | Archiving website is still accessible via the link in the paper as of 14 Jun 2021? 1 = yes, 0 = no | Text specifies that driver data are publicly available to reproduce the forecasts? 1 = yes, 0 = no | Specific end user identified (proper noun)? 1 = yes, 0 = no | Partnership with the end user in forecast development mentioned in paper? 1 = yes, 0 = no | Forecast being used by the end user according to paper? 1 = yes, 0 = no | Forecast delivery method identified? 1 = yes, 0 = no | Forecast delivery method? | Any ethical considerations mentioned? 1 = yes, 0 = no |
Measurement Values Domain: | | Definition | Digital object identifier (doi) from Web of Science |
| | Unit | nominalYear | Type | natural | Min | 1932 | Max | 2020 |
| Definition | Journal or conference in which the paper was published |
| Allowed Values and DefinitionsEnumerated Domain | | Code Definition | Code | global | Definition | e.g. coral bleaching stress in world oceans | Source | |
| Code Definition | Code | multipoint | Definition | several distinct forecast locations, such as three different lakes | Source | |
| Code Definition | Code | national | Definition | spanning all of one nation, such as nationwide production of an agricultural crop | Source | |
| Code Definition | Code | point | Definition | localized to one discrete site, such as pollen forecasts for a city or algal forecasts for a lake | Source | |
| Code Definition | Code | regional | Definition | localized to a broad geographic region, such as coral bleaching forecasts that span a sea | Source | |
|
|
| Definition | Geographic coordinates of the forecast site using decimal degrees. For papers with multiple locations, locations are separated using a semicolon. Locations for regional and national forecasts are approximately the center of the forecast area |
| Allowed Values and DefinitionsEnumerated Domain | | Code Definition | Code | agricultural | Definition | Agricultural forecast ecosystem | Source | |
| Code Definition | Code | atmosphere | Definition | Atmospheric forecast ecosystem | Source | |
| Code Definition | Code | desert | Definition | Desert forecast ecosystem | Source | |
| Code Definition | Code | forest | Definition | Forest forecast ecosystem | Source | |
| Code Definition | Code | freshwater | Definition | Freshwater forecast ecosystem | Source | |
| Code Definition | Code | grassland | Definition | Grassland forecast ecosystem | Source | |
| Code Definition | Code | marine | Definition | Marine forecast ecosystem | Source | |
| Code Definition | Code | other | Definition | Forecast ecosystem does not fit within the other predefined ecosystem types (e.g. a forecast for bird migration across North America) | Source | |
| Code Definition | Code | tundra | Definition | Tundra forecast ecosystem | Source | |
| Code Definition | Code | urban | Definition | Urban forecast ecosystem | Source | |
|
|
| Allowed Values and DefinitionsEnumerated Domain | | Code Definition | Code | biogeochemical | Definition | Forecast variable is biogeochemical, but not organismal | Source | |
| Code Definition | Code | both | Definition | both organismal and biogeochemical forecasts presented in text | Source | |
| Code Definition | Code | organismal | Definition | forecast varaible relates to a population or community of organisms | Source | |
|
|
| Definition | Identity of forecast variables |
| Allowed Values and DefinitionsEnumerated Domain | | Code Definition | Code | 0D | Definition | Zero dimensional model | Source | |
| Code Definition | Code | 1D | Definition | One dimentsional model | Source | |
| Code Definition | Code | 2D | Definition | Two dimensional model | Source | |
| Code Definition | Code | 3D | Definition | Three dimensional model | Source | |
|
|
| Allowed Values and DefinitionsEnumerated Domain | | Code Definition | Code | both | Definition | Both empirical and process-based models are used | Source | |
| Code Definition | Code | empirical | Definition | Final model used to generate forecasts is empirical | Source | |
| Code Definition | Code | process-based | Definition | Final model used to generate forecasts is process-based | Source | |
|
|
| Definition | If specified: more detailed description of model: for example, Bayesian hierarchical, machine learning, named model (e.g., PROTECH), etc. |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Definition | Number of ensemble members |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | natural | Min | 1 | Max | 10 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 49 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 2 |
| Definition | What type of null model (climatology or persistence)? |
| Definition | Maximum time into the future that the forecast predicts in this paper, described in days |
| Definition | Time step of forecast output. For example, a forecast that gives predictions for the next 16 days but was only run once a week would have a time step of one day (not one week) |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Definition | What technique of data assimilation was used? For example, KF, enKF, refit, update IC, etc. |
| Allowed Values and DefinitionsEnumerated Domain | | Code Definition | Code | assimilates | Definition | the model iteratively updates uncertainty through data assimilation | Source | |
| Code Definition | Code | contains | Definition | the model contains uncertainty, but uncertainty is not derived from data; e.g. uncertainty comes from spin-up initial conditions or hand-tuned parameters | Source | |
| Code Definition | Code | data_driven | Definition | the model contains data-driven uncertainty; e.g. uncertainty in meteorological drivers | Source | |
| Code Definition | Code | no | Definition | this model does not contain uncertainty | Source | |
| Code Definition | Code | propagates | Definition | the model propagates some source of uncertainty | Source | |
|
|
| Definition | What sources of uncertainty were incorporated? |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 0 |
| Definition | If at least two categories of uncertainty were partitioned, what was the dominant source of uncertainty? |
| Definition | If the dominant source varies by forecast horizon, season, etc. please describe here |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Definition | List all skill metrics used (e.g. R2, RMSE, bias, MAE). SD and Bayesian credible intervals are not skill metrics |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | nominalDay | Type | whole | Min | 0 | Max | 1865 |
| Unit | nominalDay | Type | natural | Min | 17 | Max | 52925 |
| Allowed Values and DefinitionsEnumerated Domain | | Code Definition | Code | At least one data stream | Definition | At least one stream of data used to make forecasts is available to the model within 24 hours when the system is working as intended | Source | |
| Code Definition | Code | No data streams | Definition | No data streams are available to the model within 24 hours even when the system is working as intended | Source | |
| Code Definition | Code | UNK | Definition | unknown (not specified in text) | Source | |
| Code Definition | Code | Yes all data streams | Definition | All data streams used to make forecasts are available to the model within 24 hours when the system is working as intended | Source | |
|
|
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Definition | Repository in which forecasts are archived |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
| Definition | Forecast delivery method? |
| Unit | dimensionless | Type | whole | Min | 0 | Max | 1 |
|