Data Package Metadata   View Summary

The influence of environmental factors on the distribution and density of invasive Centaurea stoebe across Northeastern USA, 2013 - 2018

General Information
Data Package:
Local Identifier:edi.584.1
Title:The influence of environmental factors on the distribution and density of invasive Centaurea stoebe across Northeastern USA, 2013 - 2018
Alternate Identifier:DOI PLACE HOLDER
Abstract:

Centaurea stoebe (Asteraceae; spotted knapweed) is an emerging invader in northeast US, and is a major invasive plant in the northern Midwest and western USA. Although it has been present in New York State (NYS) for over 100 years, its apparent recent population increases and spread provide a rare opportunity to study a plant in the early stages of invasion. Therefore, a study was carried out understand how distinct environmental factors influence the distribution, density and change in density C. stoebe at different spatial scales within its novel range in the northeastern USA. First, we collected field data on the occurrence, density and change in density of this species in North Eastern United States, from 2013 to 2014. Then, using species distribution models, we assessed the potential influence of environmental factors on the invasion of spotted knapweed in northeast US. Within different parts of C. stoebe‘s range, different factors explained its occurrence, density and change in density over 2 years. Across northeast US, climate and soil factors were the most influential predictors explaining C. stoebe‘s distribution, while within Long Island in southeastern NYS and the Adirondack Mountains in northern NYS, precipitation and disturbance respectively were the most important. These results are published in the paper titled The influence of environmental factors on the distribution and density of invasive Centaurea stoebe across Northeastern USA (Akin-Fajiye and Gurevitch, 2018).

Publication Date:2020-07-31

Time Period
Begin:
2013
End:
2018

People and Organizations
Contact:Gurevitch, Jessica (Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA) [  email ]
Creator:Akin-Fajiye, Morodoluwa (Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA)
Creator:Gurevitch, Jessica (Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA)
Associate:Rollinson, Emily (1) Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA, Field Work, Locating Field Sites)
Associate:Lowry, David Edward (1) Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA, Field Work, Locating Field Sites)
Associate:Rodriguez-Castañeda, Genoveva (Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA, Field Work, Locating Field Sites)
Associate:Waring, David (1) Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA, Field Work, Locating Field Sites, Data Analysis)

Data Entities
Data Table Name:
Centaurea_Pres.csv
Description:
Sites in the Northeastern United States where C. stoebe presence was reported or found by the authors
Data Table Name:
data.adk.csv
Description:
Environmental data for study sites in the Adirondacks
Data Table Name:
data.li.csv
Description:
Environmental data for study sites on Long Island
Data Table Name:
df.csv
Description:
Environmental data for Northeastern United States
Detailed Metadata

Data Entities


Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/584/1/fed8ce58e4efab8e9c5814e616a22b11
Name:Centaurea_Pres.csv
Description:Sites in the Northeastern United States where C. stoebe presence was reported or found by the authors
Number of Records:607
Number of Columns:8

Table Structure
Object Name:Centaurea_Pres.csv
Size:51268 bytes
Authentication:c1e691c62df29388740b31048b3360bb Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:id  
source  
Lat  
Lon  
STATE_NAME  
SUB_REGION  
STATE_ABBR  
COUNTY_NAME  
Definition:A unique ID for every site where C. stoebe was found or reportedSource of the data (that is, who reported the presence of C. stoebe at the given site) Acronyms used: GBIF - Global Biodiversity Information Facility, IPANE - the Invasive Plant Atlas of New England, SBU – data was collected by the authors, Morris Arboretum - Morris Arboretum (The acronyms ‘GBIF’ and ‘IPANE’ are usually followed by name of person/organization that collected the data)Latitude of the siteLongitude of the siteName of the state in which the given site liesSub region of the United States of America in which the given site liesStandard two letter abbreviation used for the state in which the site liesName of the county in which the site lies
Storage Type:string  
string  
float  
float  
string  
string  
string  
string  
Measurement Type:nominalnominalratiorationominalnominalnominalnominal
Measurement Values Domain:
DefinitionA unique ID for every site where C. stoebe was found or reported
DefinitionSource of the data (that is, who reported the presence of C. stoebe at the given site) Acronyms used: GBIF - Global Biodiversity Information Facility, IPANE - the Invasive Plant Atlas of New England, SBU – data was collected by the authors, Morris Arboretum - Morris Arboretum (The acronyms ‘GBIF’ and ‘IPANE’ are usually followed by name of person/organization that collected the data)
Unitdegree
Typereal
Min39.67777 
Max44.93482 
Unitdegree
Typereal
Min-80.3725 
Max-67.9247 
DefinitionName of the state in which the given site lies
DefinitionSub region of the United States of America in which the given site lies
DefinitionStandard two letter abbreviation used for the state in which the site lies
DefinitionName of the county in which the site lies
Missing Value Code:    
CodeNA
Explnot available
CodeNA
Explnot available
       
Accuracy Report:                
Accuracy Assessment:                
Coverage:                
Methods:                

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/584/1/3f40a738d8eab26df92d4b6c37da1402
Name:data.adk.csv
Description:Environmental data for study sites in the Adirondacks
Number of Records:924
Number of Columns:13

Table Structure
Object Name:data.adk.csv
Size:100904 bytes
Authentication:4bc96dd78118d05d604b1640494c3edf Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:site  
pres  
weight  
sand_statsgo  
sand_ssurgo  
soilph_statsgo  
soilph_ssurgo  
precip_gs  
mintemp_gs  
nlcd  
treecover_gfc  
lon  
lat  
Definition:A unique number for every Adirondacks site in the studyWhether C. stoebe was present or absent at a particular siteWhether the absences are true absence or pseudo-absenceSoil percent sand data from STATSGOSoil percent sand data from SSURGOSoil pH data from STATSGOSoil pH data from SSURGO30 year mean precipitation during the growing season (data from PRISM, resolution – 1 km)30 year mean minimum temperature during the growing season (data from PRISM, resolution – 1 km)Present day land cover (data from NLCD, resolution – 1 km)Percentage tree cover (data from global forest change)UTM Easting of the site (UTM zone 18)UTM Northing of the site (UTM zone 18)
Storage Type:string  
string  
string  
float  
float  
float  
float  
float  
float  
string  
float  
float  
float  
Measurement Type:nominalnominalnominalratioratioratioratioratiorationominalratioratioratio
Measurement Values Domain:
DefinitionA unique number for every Adirondacks site in the study
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code1
Definitionpresent
Source
Code Definition
Code0
Definitionabsent
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code1
Definitionpresence or true absence
Source
Code Definition
Code0.1
Definitionpseudo-absence
Source
Unitpercent
Typereal
Min14.836 
Max81.417 
Unitpercent
Typereal
Min
Max96.4 
Unitdimensionless
Typereal
Min4.665909091 
Max6.376 
Unitdimensionless
Typereal
Min4.327777778 
Max7.6 
Unitmillimeter
Typereal
Min79.80999756 
Max154.1883392 
Unitcelsius
Typereal
Min3.961666584 
Max10.70833397 
DefinitionPresent day land cover (data from NLCD, resolution – 1 km)
Unitpercent
Typewhole
Min
Max100 
Unitdimensionless
Typereal
Min456664.2247 
Max629224.2247 
Unitdimensionless
Typereal
Min4764457.599 
Max4965937.599 
Missing Value Code:      
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
 
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
Accuracy Report:                          
Accuracy Assessment:                          
Coverage:                          
Methods:                          

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/584/1/b9d319ffc4179f0c89e1b58d655a8ecb
Name:data.li.csv
Description:Environmental data for study sites on Long Island
Number of Records:770
Number of Columns:13

Table Structure
Object Name:data.li.csv
Size:87006 bytes
Authentication:ff8532e1756cd508ca6e818cc4658b29 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:site  
pres  
weight  
sand_statsgo  
sand_ssurgo  
soilph_statsgo  
soilph_ssurgo  
precip_gs  
mintemp_gs  
nlcd  
treecover_gfc  
lon  
lat  
Definition:A unique number for every Long Island site in the studyWhether C. stoebe was present or absent at the given siteWhether the absences are true absence or pseudo-absenceSoil percent sand data from STATSGOSoil percent sand data from SSURGOSoil pH data from STATSGOSoil pH data from SSURGO30 year mean precipitation during the growing season (data from PRISM, resolution – 1 km)30 year mean minimum temperature during the growing season (data from PRISM, resolution – 1 km)Present day land cover (data from NLCD, resolution – 1 km)Percentage tree cover (data from global forest change)UTM Easting of the site (UTM zone 18)UTM Northing of the site (UTM zone 18)
Storage Type:string  
string  
string  
float  
float  
float  
float  
float  
float  
string  
float  
float  
float  
Measurement Type:nominalnominalnominalratioratioratioratioratiorationominalratioratioratio
Measurement Values Domain:
DefinitionA unique number for every Long Island site in the study
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code1
Definitionpresent
Source
Code Definition
Code0
Definitionabsent
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code1
Definitionpresence or true absence
Source
Code Definition
Code0.1
Definitionpseudo-absence
Source
Unitpercent
Typereal
Min28.18787879 
Max97.86 
Unitpercent
Typereal
Min21.2 
Max94.1 
Unitdimensionless
Typereal
Min4.62826087 
Max6.273809524 
Unitdimensionless
Typereal
Min4.6 
Max6.5 
Unitmillimeter
Typereal
Min94.33333588 
Max113.7149963 
Unitcelsius
Typereal
Min12.47666645 
Max14.75666618 
DefinitionPresent day land cover (data from NLCD, resolution – 1 km)
Unitpercent
Typewhole
Min
Max100 
Unitdimensionless
Typereal
Min620524.2247 
Max761944.2247 
Unitdimensionless
Typereal
Min4495567.599 
Max4560937.599 
Missing Value Code:      
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
 
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
Accuracy Report:                          
Accuracy Assessment:                          
Coverage:                          
Methods:                          

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/584/1/fc18a619f8dc5643b0b06af2c19e19d1
Name:df.csv
Description:Environmental data for Northeastern United States
Number of Records:4496
Number of Columns:10

Table Structure
Object Name:df.csv
Size:466293 bytes
Authentication:ae8157f85899e36e19befdacbc7c74cb Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:site  
sand  
soilph  
prec4000  
mint4000  
land4000  
tree4000  
lon  
lat  
prediction  
Definition:A unique number for every Northeastern USA site in the studySoil percent sand (data from STATGO)Soil pH (data from STATGO)30 year mean precipitation during the growing season (data from PRISM, resolution – 4 km)30 year mean minimum temperature during the growing season (data from PRISM, resolution – 4 km)Present day land cover (data of NLCD, resolution – 4 km)Percentage tree cover (data from global forest change, resolution – 4 km)UTM Easting of the site (UTM zone 18)UTM Northing of the site (UTM zone 18)Predicted likelihood from occurrence of C. stoebe (based on the habitat suitability model)
Storage Type:string  
float  
float  
float  
float  
string  
float  
float  
float  
float  
Measurement Type:nominalratioratioratiorationominalratioratioratioratio
Measurement Values Domain:
DefinitionA unique number for every Northeastern USA site in the study
Unitpercent
Typereal
Min10.086 
Max97.86 
Unitdimensionless
Typereal
Min4.360465116 
Max6.819047619 
Unitmillimeter
Typereal
Min81.79071216 
Max175.347254 
Unitcelsius
Typereal
Min4.930807442 
Max15.88266176 
DefinitionPresent day land cover (data of NLCD, resolution – 4 km)
Unitpercent
Typewhole
Min
Max100 
Unitdimensionless
Typereal
Min34834.22468 
Max1062455.93 
Unitdimensionless
Typereal
Min4383247.599 
Max4986187.599 
Unitdimensionless
Typereal
Min0.022626678 
Max0.810500413 
Missing Value Code:  
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
 
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
CodeNA
Explnot available
Accuracy Report:                    
Accuracy Assessment:                    
Coverage:                    
Methods:                    

Data Package Usage Rights

This information is released under the Creative Commons license - Attribution - CC BY (https://creativecommons.org/licenses/by/4.0/). The consumer of these data (\"Data User\" herein) is required to cite it appropriately in any publication that results from its use. The Data User should realize that these data may be actively used by others for ongoing research and that coordination may be necessary to prevent duplicate publication. The Data User is urged to contact the authors of these data if any questions about methodology or results occur. Where appropriate, the Data User is encouraged to consider collaboration or co-authorship with the authors. The Data User should realize that misinterpretation of data may occur if used out of context of the original study. While substantial efforts are made to ensure the accuracy of data and associated documentation, complete accuracy of data sets cannot be guaranteed. All data are made available \"as is.\" The Data User should be aware, however, that data are updated periodically and it is the responsibility of the Data User to check for new versions of the data. The data authors and the repository where these data were obtained shall not be liable for damages resulting from any use or misinterpretation of the data. Thank you.

Keywords

By Thesaurus:
(No thesaurus)Invasive species, Spotted knapweed, Centaurea stoebe, Northeast United States, New York
LTER Controlled Vocabularydistribution, field methods

Methods and Protocols

These methods, instrumentation and/or protocols apply to all data in this dataset:

Methods and protocols used in the collection of this data package
Description:

C. stoebe data collection

We obtained C. stoebe presence data for northeastern USA (Pennsylvania, New York, New Jersey, Vermont, Maine, Connecticut, Massachusetts, New Hampshire and Rhode Island), from the Invasive Plant Atlas of New England (IPANE), Global Biodiversity Information Facility (GBIF), Morris Arboretum, and our own surveys (SBU) conducted on Long Island (Suffolk and Nassau counties) and in the Adirondacks (parts of Clinton, Essex, Fulton, Hamilton, St. Lawrence, Saratoga, Warren, Washington, Franklin, Lewis, Herkimer, and Franklin counties of New York state). We examined and corrected the downloaded data for errors such as missing negative signs in the longitude coordinate, or switched latitude and longitude coordinates. We removed all the data points with no coordinates for either longitude or latitude, and removed all the duplicates.

In summer 2013 and 2014, we located sites with C. stoebe on Long Island and in the Adirondacks by selecting 10 areas that were centered on populations of spotted knapweed identified from pilot studies. Within each area, we created a list of feasible roads to be examined within a 10 mile radius centered on known location of populations, and selected sites for sampling using the following criteria: we used sites if it appeared from Google Earth imagery that the road had open canopies (i.e., non-forested) edges to provide habitat for C. stoebe, and if it was safe to stop along the road. Along each road, we established transects at exactly every mile, until we either located five C. stoebe populations or until we had sampled 25 locations (noting absences after careful examination) without locating at least five populations. If we encountered absences, we noted the latitudinal and longitudinal coordinates, and recorded that spot. When C. stoebe was determined to be present, we recorded the coordinates, and sampled the site using either an intensive or extensive protocol (described below) depending on the site population. In all, we identified 135 sites on Long Island and 106 in the Adirondacks.

Environmental predictors

We selected environmental variables (Methods table 1) based on findings of previous distribution models of C. stoebe (Broennimann et al. 2007, 2014). We used a total of 6 environmental variables at a resolution of 4000 m for the large scale/coarser grain model and 1000 m for the small scale/finer grain models (see methods table 1 for resolutions). We used the 30-year average of both average precipitation and minimum temperature during the growing season (April–September) from PRISM (PRISM Climate Group) to account for the effect of seasonality. For the change in density models, we used the mean growing season minimum temperature and mean growing season precipitation between the years 2013 and 2014. We obtained data on soil pH and percent sandiness from both Soil Survey Geographic Database (SSURGO), and States Soil Geographic database (STATSGO). We used STATSGO for the Northeast models and SSURGO for the Long Island and Adirondack models. We obtained data on the tree cover from the Global Forest Change database of the University of Maryland (Hansen et al. 2013). Finally, we used the National Land Cover Database (NLCD) developed by the Multi Resolution Land Characteristics (MRLC) to capture the land cover of these areas modelled. The NLCD classifies land cover into 8 broad categories: water, developed, barren, forest, shrub land, herbaceous, planted/cultivated and wetlands (Jin et al. 2013), each of which is further divided into various sub-categories. For this study, we classified high intensity developed areas as defined by NLCD as highly disturbed, while medium intensity developed and low intensity developed areas were less disturbed.

Methods table 1: Environmental variables used as predictors for species distribution modelling (LI – Long Island, ADK – Adirondacks)

Environmental variables, Data source, Resolution, Scale

Soil percent sand, STATSGO, 4 km, Northeast USA

Soil pH, STATSGO, 4 km, Northeast USA

Soil percent sand, SSURGO, 1 km, LI, ADK

Soil pH, SSURGO, 1 km, LI, ADK

30 year mean precipitation during the growing season, PRISM climate group, 4 km, 1 km, Northeast USA, LI, ADK

30 year mean minimum temperature during the growing season, PRISM climate group, 4 km, 1 km, Northeast USA, LI, ADK

Present day land cover (categorical), National LAND COVER DATABASE, 4 km, 1 km, Northeast USA, LI, ADK

Tree cover percentage, Global forest change, 4 km, 1 km, Northeast USA, LI, ADK

Model construction

We used Boosted Regression Trees (BRTs) to model factors associated with species presence, density and change in density. BRTs are a data mining approach that combines algorithms for regression trees and boosting (Hastie et al. 2001). The boosting process involves an iterative stage-wise process of minimizing the deviance of the model in which at each stage the tree that maximally reduces the deviance is selected (Elith et al. 2008). The final model is determined by the application of a boosting technique to a large number of regression trees produced, in order to come to an optimal prediction. To fit the models, we used the dismo package (Elith et al. 2008) in the statistical software R (R Core Team 2015) to find the optimal parameters for each of the models. In building our BRT models, we manually adjusted three model parameters, the bag size, tree complexity and learning rate in order to maximize model performance. We explored the presence of interactions between variables and obtained partial dependence plots to visualize the effect of each variable. In order to reduce sampling bias due to oversampling, we thinned the presences to within 2000 m of one another for the small grain models and 4000 m for the large grain model.

For each of our distribution models, we selected pseudo-absences (simulated absences) by defining a 25 km buffer around presences from which pseudo-absences were sampled, in order to decrease the probability of sampling from unsuitable areas. Pseudo-absences are artificially generated absence data selected from the area within which the study is being conducted without on ground visits. Known absences however, are points that have been visited and have been verified to not have the species being studied. In total, after thinning, we had 486 C. stoebe presences across the northeastern United States: 59 of these were located on Long Island and 98 from the Adirondacks region. We also had 80 known absences, which did not cover the full range of the area being studied, they were therefore supplemented by pseudo-absences. For each model, we generated ten times the number of pseudo-absences as known presences. We assigned known presences and absences a weight of 1, and down-weighted the pseudo-absences, assigning each pseudo-absence a weight of 1/(total number of pseudo-absences). We built occurrence models for Northeastern USA, Long Island and the Adirondacks using the presences, absences and pseudo-absences. We divided the data equally into training and test datasets based on geographical location (50:50), we used half of the data to train the models and the other half for model testing.

People and Organizations

Creators:
Individual: Morodoluwa Akin-Fajiye
Organization:Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
Email Address:
morodoluwa.akin-fajiye@stonybrook.edu
Id:https://orcid.org/0000-0001-8078-3970
Individual: Jessica Gurevitch
Organization:Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
Email Address:
jessica.gurevitch@stonybrook.edu
Id:https://orcid.org/0000-0003-0157-4332
Contacts:
Individual: Jessica Gurevitch
Organization:Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
Email Address:
jessica.gurevitch@stonybrook.edu
Id:https://orcid.org/0000-0003-0157-4332
Associated Parties:
Individual: Emily Rollinson
Organization:1) Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
Email Address:
erollinson@esu.edu
Role:Field Work, Locating Field Sites
Individual: David Edward Lowry
Organization:1) Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
Email Address:
delowry@hsc.edu
Role:Field Work, Locating Field Sites
Individual: Genoveva Rodriguez-Castañeda
Organization:Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
Email Address:
genioveva@gmail.com
Role:Field Work, Locating Field Sites
Individual: David Waring
Organization:1) Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
Role:Field Work, Locating Field Sites, Data Analysis

Temporal, Geographic and Taxonomic Coverage

Temporal, Geographic and/or Taxonomic information that applies to all data in this dataset:

Time Period
Begin:
2013
End:
2018
Geographic Region:
Description:Northeastern parts of United States of America, including the states of Pennsylvania, New York, New Jersey, Vermont, Maine, Connecticut, Massachusetts, New Hampshire, and Rhode Island
Bounding Coordinates:
Northern:  45.0124Southern:  39.5818
Western:  -80.6077Eastern:  -67.9247
Taxonomic Range:
Classification:
Rank Name:kingdom
Rank Value:Plantae
Classification:
Rank Name:subkingdom
Rank Value:Viridiplantae
Classification:
Rank Name:infrakingdom
Rank Value:Streptophyta
Classification:
Rank Name:superdivision
Rank Value:Embryophyta
Classification:
Rank Name:division
Rank Value:Tracheophyta
Classification:
Rank Name:subdivision
Rank Value:Spermatophytina
Classification:
Rank Name:class
Rank Value:Magnoliopsida
Classification:
Rank Name:superorder
Rank Value:Asteranae
Classification:
Rank Name:order
Rank Value:Asterales
Classification:
Rank Name:family
Rank Value:Asteraceae
Classification:
Rank Name:genus
Rank Value:Centaurea
Classification:
Rank Name:species
Rank Value:Centaurea stoebe

Project

Parent Project Information:

Title:Collaborative research: Demographic heterogeneity at landscape scales in an emergent invasive species, Centaurea stoebe, in New York State
Personnel:
Individual: Jessica Gurevitch
Role:Principal Investigator
Funding: Division of Environmental Biology, National Science Foundation (NSF) 1119891

Maintenance

Maintenance:
Description:completed
Frequency:
Other Metadata

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'unitList'
        |     |     |___text '\n        '
        |     |     |___element 'unit'
        |     |     |     |  \___attribute 'id' = 'percent'
        |     |     |     |  \___attribute 'multiplierToSI' = ''
        |     |     |     |  \___attribute 'name' = 'percent'
        |     |     |     |  \___attribute 'parentSI' = ''
        |     |     |     |  \___attribute 'unitType' = ''
        |     |     |     |___text '\n          '
        |     |     |     |___element 'description'
        |     |     |     |___text '\n        '
        |     |     |___text '\n      '
        |     |___text '\n    '
        |___text '\n  '

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo