Data Package Metadata   View Summary

Ant Assemblages in Hemlock Removal Experiment at Harvard Forest since 2003 (Reformatted to the ecocomDP Design Pattern)

General Information
Data Package:
Local Identifier:edi.193.4
Title:Ant Assemblages in Hemlock Removal Experiment at Harvard Forest since 2003 (Reformatted to the ecocomDP Design Pattern)
Alternate Identifier:DOI PLACE HOLDER
Abstract:

This data package is formatted as an ecocomDP (Ecological Community Data Pattern). For more information on ecocomDP see https://github.com/EDIorg/ecocomDP. This Level 1 data package was derived from the Level 0 data package found here: https://pasta.lternet.edu/package/metadata/eml/knb-lter-hfr/118/32. The abstract below was extracted from the Level 0 data package and is included for context:

Ants comprise a considerable amount of animal biomass in terrestrial ecosystems and play major roles in ecological processes ranging from seed dispersal to soil turnover. Invasion by the hemlock woolly adelgid will transform late-successional hemlock forests into earlier successional mixed hardwood - white pine forests or red-maple wetlands. Understanding how ant assemblages vary in different habitat types allows for predictions of how hemlock decline could alter the composition of ant assemblages, with implications for a wide range of ecosystem processes. As part of the Hemlock Removal Experiment at the Simes Tract, we annually monitor ant species composition and abundance.

Publication Date:2021-05-13

Time Period
Begin:
2003
End:
2018

People and Organizations
Contact:Smith, Colin (Environmental Data Initiative) [  email ]
Contact:Ellison, Aaron (Harvard Forest) [  email ]
Creator:Ellison, Aaron 

Data Entities
Data Table Name:
observation
Description:
This is the core table, which holds the observations being analyzed, eg, organism abundance or density. Observations must be linked to a taxon and to a location. Linking to ancillary observations (via observation_id) is optional. The event_id column is a placeholder, as we are considering a structure to model sampling events in the future.
Data Table Name:
observation_ancillary
Description:
Ancillary information about an observational event for context, but that are not about the organism or sampling location (e.g. water depth, height of a tower, temperature of medium). These are very often environmental driver data in analyses.
Data Table Name:
location
Description:
Identifying information about a place (lonitude, latitude, elevation). The table is self-referencing so that sites can be nested.
Data Table Name:
location_ancillary
Description:
Additional information about a place that does not change frequently (e.g. lake area or depth, experimental treatment). Features that change frequently are more closely related to the observational event, and are thus kept in the observation_ancillary table. Ancillary observations are linked through the location_id, and one location_id may have many ancillary observations about it.
Data Table Name:
taxon
Description:
Identifying information about a taxon (e.g. name, id and system)
Data Table Name:
taxon_ancillary
Description:
Additional info about an organism that does not change frequently (e.g. trophic level). Features that change frequently are probably observations. Ancillary observations are linked through the taxon_id, and one taxon_id may have many ancillary observations about it.
Data Table Name:
dataset_summary
Description:
Summary info about the dataset.
Data Table Name:
variable_mapping
Description:
Information linking a variable_name used in a data table to an external definition.
Other Name:
create_ecocomDP
Description:
A function for converting knb-lter-hrf.118 to ecocomDP
Detailed Metadata

Data Entities


Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/e09491aee3bd9ec02e805ffdac0beb12
Name:observation
Description:This is the core table, which holds the observations being analyzed, eg, organism abundance or density. Observations must be linked to a taxon and to a location. Linking to ancillary observations (via observation_id) is optional. The event_id column is a placeholder, as we are considering a structure to model sampling events in the future.
Number of Records:2931
Number of Columns:7

Table Structure
Object Name:observation.csv
Size:125975 bytes
Authentication:c60115954464b0db7393259ef2472dfe Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:observation_id  
package_id  
location_id  
datetime  
taxon_id  
variable_name  
value  
Definition:Identifier assigned to each unique observation.Identifier of this data package. References the package_id field of the dataset_summary table.A reference to a location. References the location_id field of the location table.Date and time of the observation.A reference to a taxon. References the taxon_id field of the taxon table.Name of the measured variable.Value of the measured variable.
External Measurement Definition, Link: contains measurements of type study location identifier contains measurements of type date and time of measurement contains measurements of type Measurement Type
Storage Type:string  
string  
string  
date  
string  
string  
float  
Measurement Type:nominalnominalnominaldateTimenominalnominalratio
Measurement Values Domain:
DefinitionIdentifier assigned to each unique observation.
DefinitionIdentifier of this data package. References the package_id field of the dataset_summary table.
DefinitionA reference to a location. References the location_id field of the location table.
FormatYYYY-MM-DD
Precision
DefinitionA reference to a taxon. References the taxon_id field of the taxon table.
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codeabundance
Definitionhow many
Source
Unitdimensionless
Typenatural
Min
Max96 
Missing Value Code:
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
Accuracy Report:              
Accuracy Assessment:              
Coverage:              
Methods:              

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/10324ae4aa6a0a67239e782c739fda7e
Name:observation_ancillary
Description:Ancillary information about an observational event for context, but that are not about the organism or sampling location (e.g. water depth, height of a tower, temperature of medium). These are very often environmental driver data in analyses.
Number of Records:5862
Number of Columns:4

Table Structure
Object Name:observation_ancillary.csv
Size:147289 bytes
Authentication:53ebc79bb2997a98f2f68d4fb462bbc7 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:observation_ancillary_id  
observation_id  
variable_name  
value  
Definition:Identifier of the observation ancillary information.A reference to an observation. References the observation_id field of the observation table.Name of the measured variable.Value of the measured variable.
External Measurement Definition, Link: contains measurements of type Measurement Type
Storage Type:string  
string  
string  
string  
Measurement Type:nominalnominalnominalnominal
Measurement Values Domain:
DefinitionIdentifier of the observation ancillary information.
DefinitionA reference to an observation. References the observation_id field of the observation table.
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codetrap.type
Definitiontrap type
Source
Code Definition
Codetrap.num
Definitionapplies only to pitfall cups
Source
DefinitionValue of the measured variable.
Missing Value Code:
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
Accuracy Report:        
Accuracy Assessment:        
Coverage:        
Methods:        

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/d5189de027922f81005951e6efe0efd5
Name:location
Description:Identifying information about a place (lonitude, latitude, elevation). The table is self-referencing so that sites can be nested.
Number of Records:10
Number of Columns:1

Table Structure
Object Name:location.csv
Size:45 bytes
Authentication:3b94b87b8e417e26c0f62d30b3189536 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:
Quote Character:"

Table Column Descriptions
 
Column Name:location_id  
Definition:Identifier assigned to each unique location.
External Measurement Definition, Link: contains measurements of type study location identifier
Storage Type:string  
Measurement Type:nominal
Measurement Values Domain:
DefinitionIdentifier assigned to each unique location.
Missing Value Code:
CodeNA
ExplNot available
Accuracy Report:  
Accuracy Assessment:  
Coverage:  
Methods:  

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/b49b65c5350c1fd5875236e399470db4
Name:location_ancillary
Description:Additional information about a place that does not change frequently (e.g. lake area or depth, experimental treatment). Features that change frequently are more closely related to the observational event, and are thus kept in the observation_ancillary table. Ancillary observations are linked through the location_id, and one location_id may have many ancillary observations about it.
Number of Records:32
Number of Columns:4

Table Structure
Object Name:location_ancillary.csv
Size:842 bytes
Authentication:c8cd8d19fec41651b8d3c4271f88cce2 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:location_ancillary_id  
location_id  
variable_name  
value  
Definition:Identifier of the location ancillary information.A reference to a location. References the location_id field of the location table.Name of the measured variable.Value of the measured variable.
External Measurement Definition, Link: contains measurements of type study location identifier contains measurements of type Measurement Type
Storage Type:string  
string  
string  
string  
Measurement Type:nominalnominalnominalnominal
Measurement Values Domain:
DefinitionIdentifier of the location ancillary information.
DefinitionA reference to a location. References the location_id field of the location table.
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codemoose.cage
Definitiontreatment type
Source
Code Definition
Codetreatment
Definitionlocation of grid with respect to moose exclosure
Source
DefinitionValue of the measured variable.
Missing Value Code:
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
Accuracy Report:        
Accuracy Assessment:        
Coverage:        
Methods:        

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/cc5d33161352c025d0385ba6a7262983
Name:taxon
Description:Identifying information about a taxon (e.g. name, id and system)
Number of Records:53
Number of Columns:2

Table Structure
Object Name:taxon.csv
Size:1250 bytes
Authentication:b7ff89572967226a275d673575d7499b Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:taxon_id  
taxon_name  
Definition:Identifier assigned to each unique organism.Taxonomic name of the organism.
External Measurement Definition, Link: contains measurements of type taxonomic classification
Storage Type:string  
string  
Measurement Type:nominalnominal
Measurement Values Domain:
DefinitionIdentifier assigned to each unique organism.
DefinitionTaxonomic name of the organism.
Missing Value Code:
CodeNA
ExplNot available
CodeNA
ExplNot available
Accuracy Report:    
Accuracy Assessment:    
Coverage:    
Methods:    

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/a3afe3d06df63ce3b0e779169618232f
Name:taxon_ancillary
Description:Additional info about an organism that does not change frequently (e.g. trophic level). Features that change frequently are probably observations. Ancillary observations are linked through the taxon_id, and one taxon_id may have many ancillary observations about it.
Number of Records:742
Number of Columns:4

Table Structure
Object Name:taxon_ancillary.csv
Size:20812 bytes
Authentication:7815d0cd181734deac82cf61032d0afd Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:taxon_ancillary_id  
taxon_id  
variable_name  
value  
Definition:Identifier of the taxon ancillary information.A reference to a taxon. References the taxon_id field of the taxon table.Name of the measured variable.Value of the measured variable.
External Measurement Definition, Link: contains measurements of type Measurement Type
Storage Type:string  
string  
string  
string  
Measurement Type:nominalnominalnominalnominal
Measurement Values Domain:
DefinitionIdentifier of the taxon ancillary information.
DefinitionA reference to a taxon. References the taxon_id field of the taxon table.
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Codesubfamily
Definitionant subfamily
Source
Code Definition
Codehl
Definitionhead length. We used trait definitions from Del Toro et al. (2015) and filled in missing species’ data with information from Ellison et al.
Source
Code Definition
Coderel
Definitioneye length relative to body size
Source
Code Definition
Coderll
Definitionfemur length relative to body size
Source
Code Definition
Codecolony.size
Definitionsize of colony for each species
Source
Code Definition
Codefeeding.preference
Definitionfeeding preference for each species
Source
Code Definition
Codenest.substrate
Definitionnest substrate
Source
Code Definition
Codeprimary.habitat
Definitionprimary habitat
Source
Code Definition
Codesecondary.habitat
Definitionsecondary habitat associations
Source
Code Definition
Codeseed.disperser
Definitionwhether or not a seed dispersing species
Source
Code Definition
Codeslavemaker.sp
Definitionwhether or not a slavemaking species
Source
Code Definition
Codebehavior
Definitionclassifications based on behavioral interactions with other ants
Source
Code Definition
Codebiogeographic.affinity
Definitionbiogeographic affinity based on available occurrence records
Source
Code Definition
Codesource
Definitionwhere trait information was found. Full citations for literature are as follows: Del Toro, I., R.R. Silva, and A.M. Ellison. 2015. Predicated impacts of climatic change on ant functional diversity and distributions in eastern North American forests. Diversity and Distributions 21:781-791; Ellison, A.M., N.J. Gotelli, G. Alpert, and E.J. Farnsworth. 2012. A field guide to the ants of New England. Yale University Press, New Haven, Connecticut, USA.
Source
DefinitionValue of the measured variable.
Missing Value Code:
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
Accuracy Report:        
Accuracy Assessment:        
Coverage:        
Methods:        

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/8eba90ee8c1a2707f993d9cca5b352dc
Name:dataset_summary
Description:Summary info about the dataset.
Number of Records:1
Number of Columns:5

Table Structure
Object Name:dataset_summary.csv
Size:125 bytes
Authentication:bffb310a03ecf700aeeea6d78fdd9945 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:package_id  
length_of_survey_years  
number_of_years_sampled  
std_dev_interval_betw_years  
max_num_taxa  
Definition:Identifier of this data package.Number of years the study has been ongoing.Number of years within the period of the study that samples were taken.Standard deviation of the interval between sampling events.Number of unique values in the taxon table.
External Measurement Definition, Link: contains measurements of type number of years contains measurements of type number of years
Storage Type:string  
float  
float  
float  
float  
Measurement Type:nominalratioratioratioratio
Measurement Values Domain:
DefinitionIdentifier of this data package.
UnitnominalYear
Typenatural
Min15 
Max15 
UnitnominalYear
Typenatural
Min14 
Max14 
UnitnominalYear
Typereal
Min0.67 
Max0.67 
Unitnumber
Typenatural
Min53 
Max53 
Missing Value Code:
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
Accuracy Report:          
Accuracy Assessment:          
Coverage:          
Methods:          

Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/afb9b11828f813f17c18143618690322
Name:variable_mapping
Description:Information linking a variable_name used in a data table to an external definition.
Number of Records:19
Number of Columns:3

Table Structure
Object Name:variable_mapping.csv
Size:650 bytes
Authentication:de42c56e1f9c9e4222ef2d5b11b5202c Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:variable_mapping_id  
table_name  
variable_name  
Definition:Identifier of the variable mapping information.Name of the table containing this variable.Name of the measured variable.
External Measurement Definition, Link: contains measurements of type Measurement Type
Storage Type:string  
string  
string  
Measurement Type:nominalnominalnominal
Measurement Values Domain:
DefinitionIdentifier of the variable mapping information.
DefinitionName of the table containing this variable.
DefinitionName of the measured variable.
Missing Value Code:
CodeNA
ExplNot available
CodeNA
ExplNot available
CodeNA
ExplNot available
Accuracy Report:      
Accuracy Assessment:      
Coverage:      
Methods:      

Non-Categorized Data Resource

Name:create_ecocomDP
Entity Type:unknown
Description:A function for converting knb-lter-hrf.118 to ecocomDP
Physical Structure Description:
Object Name:create_ecocomDP.R
Size:16672 bytes
Authentication:4170a8c23a51ac944100311f5eaf2695 Calculated By MD5
Externally Defined Format:
Format Name:text/plain
Data:https://pasta-s.lternet.edu/package/data/eml/edi/193/4/0e153e5174857545b0cd01f5d50bca85

Data Package Usage Rights

This dataset is released to the public under Creative Commons license CC BY (Attribution). Please keep the designated contact person informed of any plans to use the dataset. Consultation or collaboration with the original investigators is strongly encouraged. Publications and data products that make use of the dataset must include proper acknowledgement.

Keywords

By Thesaurus:
LTER controlled vocabularyabundance, ants, hemlock, hemlock woolly adelgid, species composition
LTER core areapopulations, disturbance
HFR defaultHarvard Forest, HFR, LTER, USA
LTER Controlled Vocabularycommunities, community composition, community dynamics, community patterns, species composition, species diversity, species richness
EDI Controlled VocabularyecocomDP
Darwin Core TermsbasisOfRecord: HumanObservation

Methods and Protocols

These methods, instrumentation and/or protocols apply to all data in this dataset:

Methods and protocols used in the collection of this data package
Description:

This data package conforms to the Ecological Community Data Pattern (ecocomDP) developed by the Environmental Data Initiative. The scripts used to convert the source data package to ecocomDP are published with this derived data package. Any questions about this data package should be directed to the first person listed under contacts.

Description:

A permanent 10 x 10-m grid with 25 equally-spaced sample stations was established near the center of each of the 8 plots in the Hemlock Removal Experiment. Pitfall traps (200-ml cups) are buried flush with the soil surface at each point and capped. Caps are removed for sampling - in June, July, and August in 2003-2005; July and August in 2006; and July only since 2007 - 10 ml of soapy water placed in each cup, and cups left open for 48 hours in dry weather. Trap contents are removed and caps replaced after 48 hours. After pitfall trapping is complete, baits (30 g of crumbled Pecan Sandies cookies on white index cards) are placed at the 25 points and allowed to accumulate ants for 1 hour. Representative individuals are collected from each bait station. Three 3-L litter samples are collected from random locations in the large plots, outside of the sample grid, and sifted in the field. Any ants found in the sifted litter are collected. Plots are walked haphazardly for 1 person-hour and additional foraging ants encountered are collected.

Description:

This method step describes provenance-based metadata as specified in the LTER EML Best Practices.

This provenance metadata does not contain entity specific information.

Data Source
Ant Assemblages in Hemlock Removal Experiment at Harvard Forest since 2003

People and Organizations

Publishers:
Organization:Environmental Data Initiative
Email Address:
info@environmentaldatainitiative.org
Web Address:
https://environmentaldatainitiative.org
Creators:
Individual: Aaron Ellison
Contacts:
Individual: Colin Smith
Organization:Environmental Data Initiative
Email Address:
ecocomdp@gmail.com
Individual: Aaron Ellison
Organization:Harvard Forest
Address:
324 North Main Street,
Petersham, MA 01366 USA
Phone:
(978) 724-3302 (voice)
Email Address:
aellison@fas.harvard.edu

Temporal, Geographic and Taxonomic Coverage

Temporal, Geographic and/or Taxonomic information that applies to all data in this dataset:

Time Period
Begin:
2003
End:
2018
Geographic Region:
Description:Simes Tract (Harvard Forest)
Bounding Coordinates:
Northern:  +42.48Southern:  +42.47
Western:  -72.22Eastern:  -72.21
Altitude Minimum:200Altitude Maximum:240
Taxonomic Range:
Classification:
Rank Name:family
Rank Value:Dolichoderinae
Classification:
Rank Name:family
Rank Value:Formicidae
Classification:
Rank Name:family
Rank Value:Formicinae
Classification:
Rank Name:family
Rank Value:Myrmicinae
Classification:
Rank Name:family
Rank Value:Ponerinae
Classification:
Rank Name:genus
Rank Value:Aphaenogaster
Classification:
Rank Name:species
Rank Value:rudis
Classification:
Rank Name:genus
Rank Value:Tsuga
Classification:
Rank Name:species
Rank Value:canadensis

Project

Other Metadata

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'additionalClassifications'
        |     |     |___text '\n        '
        |     |     |___element 'researchTopic'
        |     |     |     |___text 'plot'
        |     |     |___text '\n        '
        |     |     |___element 'researchTopic'
        |     |     |     |___text 'community'
        |     |     |___text '\n        '
        |     |     |___element 'status'
        |     |     |     |___text 'ongoing'
        |     |     |___text '\n        '
        |     |     |___element 'studyType'
        |     |     |     |___text 'long-term measurement'
        |     |     |___text '\n      '
        |     |___text '\n    '
        |___text '\n  '

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'additionalLinks'
        |     |     |___text '\n        '
        |     |     |___element 'url'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'name'
        |     |     |     |     |___text 'http://harvardforest.fas.harvard.edu:8080/exist/xquery/data.xq?id=hf147'
        |     |     |     |___text '\n        '
        |     |     |___text '\n        '
        |     |     |___element 'url'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'name'
        |     |     |     |     |___text 'http://harvardforest.fas.harvard.edu:8080/exist/xquery/data.xq?id=hf119'
        |     |     |     |___text '\n        '
        |     |     |___text '\n        '
        |     |     |___element 'url'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'name'
        |     |     |     |     |___text 'http://harvardforest.fas.harvard.edu:8080/exist/xquery/data.xq?id=hf106'
        |     |     |     |___text '\n        '
        |     |     |___text '\n        '
        |     |     |___element 'url'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'name'
        |     |     |     |     |___text 'http://harvardforest.fas.harvard.edu:8080/exist/xquery/data.xq?id=hf097'
        |     |     |     |___text '\n        '
        |     |     |___text '\n        '
        |     |     |___element 'url'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'name'
        |     |     |     |     |___text 'http://harvardforest.fas.harvard.edu:8080/exist/xquery/data.xq?id=hf065'
        |     |     |     |___text '\n        '
        |     |     |___text '\n      '
        |     |___text '\n    '
        |___text '\n  '

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo