Data Package Metadata   View Summary

Journey North - Tulip observations by volunteer community scientists across North America (1996-2020)

General Information
Data Package:
Local Identifier:edi.1171.1
Title:Journey North - Tulip observations by volunteer community scientists across North America (1996-2020)
Alternate Identifier:DOI PLACE HOLDER
Abstract:
This data package contains tulip phenology data consisting of 21,148 total observational reports from 1996 - 2020 across North America. These data were collected by 6,645 community scientists for Journey North, a crowdsourced participatory science program of the University of Wisconsin-Madison Arboretum. The Journey North Tulip Test Garden Project is an ongoing study of tulip phenology conducted at broad spatial and temporal scales. Since 1996, community scientists have tracked planting, emergence, and blooming of tulips (Tulipa L.) in the United States. Most observations should be of the Red Emperor Tulip, but not all observations can be validated as this species. Researchers are encouraged to read observer comments to confirm tulip species. Observers also provide estimates of the number of tulips sighted. However, observers do not follow standardized methods for counting species observed. Observers do not observe at set times of the day, do not repeat observations regularly, and are not required to provide the length of time during which a specified number of species observed were counted. Therefore, it is recommended that this dataset be analyzed to indicate broad phenological information. Researchers are encouraged to read the rich information provided by volunteers in their comments. These comments provide qualitative information about observational reports. Researchers are also encouraged to refer to submitted photographs that also provide context for observational reports. The Journey North Tulip Test Garden Project dataset is hosted by the University of Wisconsin-Madison Shared Web Hosting Service.
Publication Date:2022-07-22
For more information:
Visit: DOI PLACE HOLDER

Time Period
Begin:
1996-01-06
End:
2020-12-24

People and Organizations
Contact:Sheehan, Nancy (University of Wisconsin - Madison Arboretum, Journey North Program, Program Coordinator) [  email ]
Creator:Sheehan, Nancy (University of Wisconsin - Madison Arboretum, Journey North Program, Program Coordinator)
Creator:Abarca, Maricela (University of Wisconsin - Madison, Data Manager (EDI Fellow))
Associate:Weber-Grullon, Luis (Data Manager (EDI Fellow), Data Manager (EDI Fellow))
Associate:Smith, Garrett (University of Wisconsin - Madison, Software Developer, Software Developer)
Associate:Isenbarger, Annie (University of Wisconsin - Madison Arboretum, Data Verifier, Data Verifier)
Associate:Abarca, Maricela (University of Wisconsin - Madison, Data Manager (EDI Fellow))

Data Entities
Data Table Name:
UWMadisonArb_JNorth_Project_Tulips_1996-2020
Description:
Journey North - Tulip test garden observational data across North America (1996-2020)
Other Name:
UW-MadisonArb.JourneyNorthProgram_Project_Initial-Subset-Cleaning
Description:
This notebook documents initial cleaning steps for each Journey North Project subset, such as changing sighting date format to YYYY-MM-DD, filtering for relevant dates, filtering each species column for relevant subset data, and creating copies of the species column to modify further in the dataPrep process in OpenRefine.
Other Name:
dataPrepProcess_tulips_OpenRefine
Description:
This document outlines the steps taken in OpenRefine to standardize the variations of reporting categories submitted by volunteers to the categories currently implemented by Journey North.
Other Name:
UW-MadisonArboretum.JourneyNorthProgram_finalEDICleaning
Description:
Documentation of final cleaning steps for EDI publishing. These include: capitalizing proper nouns, dropping columns with sensitive information, replacing blank/empty cells with coded values, dropping records with missing coordinates, rounding coordinates for privacy, and filtering out records with coordinates that do not fall into the right geographic scope.
Detailed Metadata

Data Entities


Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/1171/1/77f4e1acf0c50282575daefa09b2b633
Name:UWMadisonArb_JNorth_Project_Tulips_1996-2020
Description:Journey North - Tulip test garden observational data across North America (1996-2020)
Number of Records:21148
Number of Columns:16

Table Structure
Object Name:UWMadisonArb_JNorth_Project_Tulips_1996-2020.csv
Size:6790397 byte
Authentication:23df10eb2abdd18026854c6a060e5f24 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 idsighting_datespeciesnumberlatitudelongitudeflag_locationcommentsschoolgradeimage_urlcustomeridnumber_oldflag_numberspecies_oldflag_species
Column Name:id  
sighting_date  
species  
number  
latitude  
longitude  
flag_location  
comments  
school  
grade  
image_url  
customerid  
number_old  
flag_number  
species_old  
flag_species  
Definition:A unique identifier for each sighting submitted.Date when the sighting was submitted.Designation for the type of sighting being reported.Number of individuals observed per sighting category. Data review indicated errors in the number field: missing data/null values or "0" and/or numbers entered did not correspond with numbers reported in the comment field. The initial cleaning process for this field did not include filtering the comments for this data or extracting those reported numbers. It only replaced missing/null values or zeros with a "1". Because some observers reported actual numbers of species sighted, researchers are encouraged to read the comments.Latitude in decimal degrees specified by the observer, determined using the "report sighting" user interface, in the submitted sighting.Longitude in decimal degrees specified by the observer, determined using the "report sighting" user interface, in the submitted sighting.A column designated to flag those locations (longitude and latitude) that their values were obtained based on different methods.Any commentary or detail that the user wishes to include as part of their sighting.The name of the school reporting the data.The student or classroom grade where the data is reported by the school.The journeynorth.org website URL to the image posted with the corresponding sightingA unique identifier of the user who reported the sighting.Original number of individuals observed per sighting previous data cleaning.A column designated to flag those sightings that the values of their number column were changed from its original entry. Data review indicated errors in the number field: missing data/null values or "0" and/or numbers entered did not correspond with numbers reported in the comment field. The initial cleaning process for this field did not include filtering the comments for this data or extracting those reported numbers. It only replaced missing/null values or zeros with a "1". Because some observers reported actual numbers of species sighted, researchers are encouraged to read the comments. This column flags where the number has been modified by the data cleaning process. Original designation for the reporting category prior data cleaning.A column designated to flag those reporting categories that the value of their species column were changed.
Storage Type:string  
dateTime  
string  
integer  
float  
float  
string  
string  
string  
string  
string  
string  
float  
string  
string  
string  
Measurement Type:nominaldateTimenominalratioratiorationominalnominalnominalnominalnominalnominalrationominalnominalnominal
Measurement Values Domain:
DefinitionA unique identifier for each sighting submitted.
FormatYYYY-MM-DD
Precision
Allowed Values and Definitions
Enumerated Domain 
Code Definition
CodeTulip BLOOMED
DefinitionA spring reporting category (January 1-July 31). Journey North specifically tracks the spring blooming of tulips. Observers report blooms using this reporting category.
Source
Code Definition
CodeTulips (OTHER Observations)
DefinitionA year-round reporting category. Observer reports include but are not limited to: the health of their tulips (mortality, survival rates, growth rates, etc), both abiotic and biotic factors in their tulips’ growth (climate, animal interactions, etc), and other phenological changes.
Source
Code Definition
CodeTulips EMERGED
DefinitionA spring reporting category (January 1-July 31). Journey North specifically tracks the spring emergence of tulips. Observers report first emergence using this reporting category.
Source
Code Definition
CodeTulips PLANTED
DefinitionA fall reporting category (August-December ). Observers report the dates when they first plant tulip bulbs.
Source
Unitnumber
Typeinteger
UnitdecimalDegree
Typereal
UnitdecimalDegree
Typereal
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code0
DefinitionThe location (latitude and longitude) was recorded using the zip code of the community scientist when they first registered online with Journey North.
Source
Code Definition
Code1
DefinitionThe location (latitude and longitude) was recorded by the community scientists using the Google Maps location picker available on the Journey North web interface at the time of submission of the sighting.
Source
DefinitionAny commentary or detail that the user wishes to include as part of their sighting.
DefinitionThe name of the school reporting the data.
DefinitionThe student or classroom grade where the data is reported by the school.
DefinitionThe journeynorth.org website URL to the image posted with the corresponding sighting
DefinitionA unique identifier of the user who reported the sighting.
Unitnumber
Typereal
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code0
Definitionno change
Source
Code Definition
Code1
DefinitionWhen the value was missing, a 1 was added as a new value.
Source
Code Definition
Code2
DefinitionWhen the value was zero, a 1 was added as a new value.
Source
DefinitionOriginal designation for the reporting category prior data cleaning.
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code0
DefinitionNo change
Source
Code Definition
Code1
DefinitionReclassification based on current reporting categories and or typos.
Source
Missing Value Code:              
Code-999999.999
ExplThe user did not include any value as it is not required.
Code-999999.999
ExplThe user did not include any value as it is not required.
Code-999999.999
ExplThe user did not include any value as it is not required.
Code-999999.999
ExplNot all sightings contain an image URL as a photo submission of the sighting is not required.
 
Code-999999.999
ExplThe user forgot to include the number of sighted hummingbirds.
 
Code-999999.999
ExplSome sightings did not have any value as the reporting category.
 
Accuracy Report:                                
Accuracy Assessment:                                
Coverage:                                
Methods:                                

Non-Categorized Data Resource

Name:UW-MadisonArb.JourneyNorthProgram_Project_Initial-Subset-Cleaning
Entity Type:ipynb
Description:This notebook documents initial cleaning steps for each Journey North Project subset, such as changing sighting date format to YYYY-MM-DD, filtering for relevant dates, filtering each species column for relevant subset data, and creating copies of the species column to modify further in the dataPrep process in OpenRefine.
Physical Structure Description:
Object Name:UW-MadisonArb.JourneyNorthProgram_Project_Initial-Subset-Cleaning.ipynb
Size:25623 byte
Authentication:ac50a4433d3acdd69adf645e219d6276 Calculated By MD5
Externally Defined Format:
Format Name:ipynb
Data:https://pasta-s.lternet.edu/package/data/eml/edi/1171/1/3afc10ec47980a8394732608b2617593

Non-Categorized Data Resource

Name:dataPrepProcess_tulips_OpenRefine
Entity Type:pdf
Description:This document outlines the steps taken in OpenRefine to standardize the variations of reporting categories submitted by volunteers to the categories currently implemented by Journey North.
Physical Structure Description:
Object Name:dataPrepProcess_tulips_OpenRefine.pdf
Size:122272 byte
Authentication:d2d71a9ef2435d71cb7bd095db5b73e4 Calculated By MD5
Externally Defined Format:
Format Name:pdf
Data:https://pasta-s.lternet.edu/package/data/eml/edi/1171/1/af5ac99cd6500f9f194625e53c97493c

Non-Categorized Data Resource

Name:UW-MadisonArboretum.JourneyNorthProgram_finalEDICleaning
Entity Type:ipynb
Description:Documentation of final cleaning steps for EDI publishing. These include: capitalizing proper nouns, dropping columns with sensitive information, replacing blank/empty cells with coded values, dropping records with missing coordinates, rounding coordinates for privacy, and filtering out records with coordinates that do not fall into the right geographic scope.
Physical Structure Description:
Object Name:UW-MadisonArboretum.JourneyNorthProgram_finalEDICleaning.ipynb
Size:26450 byte
Authentication:89c8be8fdeab372b48a24443e024dccb Calculated By MD5
Externally Defined Format:
Format Name:ipynb
Data:https://pasta-s.lternet.edu/package/data/eml/edi/1171/1/dae1bffdbdb70faf441452405b2bede2

Data Package Usage Rights

This information is released under the Creative Commons license - Attribution - CC BY (https://creativecommons.org/licenses/by/4.0/). The consumer of these data ("Data User" herein) is required to cite it appropriately in any publication that results from its use. The Data User should realize that these data may be actively used by others for ongoing research and that coordination may be necessary to prevent duplicate publication. The Data User is urged to contact the authors of these data if any questions about methodology or results occur. Where appropriate, the Data User is encouraged to consider collaboration or co-authorship with the authors. The Data User should realize that misinterpretation of data may occur if used out of context of the original study. While substantial efforts are made to ensure the accuracy of data and associated documentation, complete accuracy of data sets cannot be guaranteed. All data are made available "as is." The Data User should be aware, however, that data are updated periodically and it is the responsibility of the Data User to check for new versions of the data. The data authors and the repository where these data were obtained shall not be liable for damages resulting from any use or misinterpretation of the data. Thank you.

Keywords

By Thesaurus:
(No thesaurus)participatory science, citizen science, North America, tulip, botany, Tulipa
LTER Controlled Vocabularyphenology, flowers, plants, flowering, plant growth, plant phenology

Methods and Protocols

These methods, instrumentation and/or protocols apply to all data in this dataset:

Methods and protocols used in the collection of this data package
Description:
Data Collection After registering with https://journeynorth.org/reg/, citizen community scientists (hereinafter volunteers or observers) enter their observations (or “sightings”) by visiting https://journeynorth.org/sightings/ and selecting the phenological event reporting categories that most fit their observations. The occurrence of a phenological event can be determined by a single date and time of a life cycle stage. For the Journey North program, the first arrival date of a hummingbird species in the spring is an example of such a phenological event reporting category. (As noted by USA-NPN: The definition of the term “phenological event” has not yet been standardized and varies among scientists. USA-NPN website.) Once a reporting category has been selected, the volunteers enter additional information about their observations (or “sightings”) including: • location of observations (based on known street address, Google Maps location picker, or known latitude/longitude coordinates) (required field) • number of individual species observed (optional field) • date of observations (required field) • comments describing ecological context (optional field) • photo voucher of observations (optional field) Once the information has been entered, volunteers click on “Submit Report” to complete the process. Journey North provides online support for volunteers. Protocols and identification tips are described on the website. Journey North also fields inquiries via a helpline.
Instrument(s):Website for reporting a sighting: https://journeynorth.org/projects
Description:
Data verification: • Expert review by project staff. • Digital vouchers - photo submissions. • Contacting participants about unusual reports. • Participants known and qualified. • Data triangulation --corroboration from other data sources, e.g., remote sensing data, qualitative data, historical trend data. • Data quality documentation -- metadata documentation
Description:
Data processing and cleaning. • The identification and correction of any inconsistencies in formatting for each column of the dataset, while keeping the meaning of the data intact. • The creation of a copy of the original version of any column to be modified, as well as the creation of another column to flag - Filling in a "1" for rows in the number column where the record contained a zero or missing value. The assumption is that observers must observe at least one individual to report a sighting, however some observers reported the number of individuals sighted in the comments field. The initial cleaning process for this field did not include filtering the comments for this data or extracting those reported numbers. • Filling gaps for the ‘customerid’ column by using the emails of each observational report as a unique identifier, followed by the exclusion of the emails from the dataset to ensure the privacy of our volunteers. • Fixing typos on the reporting categories that occurred because some of the older data were entered manually in our system, rather than using our current drop-down menu option. • Replacing any empty value with a standard code ‘-999999.999’ given that not all of the information provided by the users are required fields on the online platform. • Ensuring that each column data type was in fact consistent with the type of data that it contained. • Rounding up longitude and latitude values to only 3 decimals after the dot to ensure the privacy of our volunteers. • Excluding any observational report that seems beyond recovery in terms of its reliability due to different reasons such as: a) missing latitude and longitude, b) missing email AND customerid together, c) dates of sightings seem to be extremely old (before Journey North foundation). • Ensuring that there were not any duplicates of any observational report.
Instrument(s):MySQL Server, Jupyter Notebook (Python), OpenRefine, Git & Github.

People and Organizations

Publishers:
Organization:Environmental Data Initiative
Email Address:
info@edirepository.org
Web Address:
https://edirepository.org
Id:https://ror.org/0330j0z60
Creators:
Individual: Nancy Sheehan
Organization:University of Wisconsin - Madison Arboretum, Journey North Program
Position:Program Coordinator
Email Address:
nsheehan@wisc.edu
Id:https://orcid.org/0000-0002-2632-0796
Individual: Maricela Abarca
Organization:University of Wisconsin - Madison
Position:Data Manager (EDI Fellow)
Email Address:
abarca.maricela@gmail.com
Id:https://orcid.org/0000-0002-0890-8887
Contacts:
Individual: Nancy Sheehan
Organization:University of Wisconsin - Madison Arboretum, Journey North Program
Position:Program Coordinator
Email Address:
nsheehan@wisc.edu
Id:https://orcid.org/0000-0002-2632-0796
Associated Parties:
Individual: Luis Weber-Grullon
Position:Data Manager (EDI Fellow)
Address:
85281-2443
Email Address:
luisweberg@gmail.com
Id:https://orcid.org/0000-0002-6548-8268
Role:Data Manager (EDI Fellow)
Individual: Garrett Smith
Organization:University of Wisconsin - Madison
Position:Software Developer
Email Address:
garrett.smith@wisc.edu
Role:Software Developer
Individual: Annie Isenbarger
Organization:University of Wisconsin - Madison Arboretum
Position:Data Verifier
Email Address:
annie.isenbarger@wisc.edu
Role:Data Verifier
Individual: Maricela Abarca
Organization:University of Wisconsin - Madison
Email Address:
abarca.maricela@gmail.com
Id:https://orcid.org/0000-0002-0890-8887
Role:Data Manager (EDI Fellow)

Temporal, Geographic and Taxonomic Coverage

Temporal, Geographic and/or Taxonomic information that applies to all data in this dataset:

Time Period
Begin:
1996-01-06
End:
2020-12-24
Geographic Region:
Description:North America
Bounding Coordinates:
Northern:  64.779Southern:  19.5
Western:  -159.795Eastern:  -57.57
Taxonomic Range:
General Coverage:Taxa were determined by the common names used in the species column.
Classification:
Rank Name:Kingdom
Rank Value:Plantae
Common Name:plants
Identifer:https://www.itis.gov
ID: 202422
Classification:
Rank Name:Subkingdom
Rank Value:Viridiplantae
Common Name:green plants
Identifer:https://www.itis.gov
ID: 954898
Classification:
Rank Name:Infrakingdom
Rank Value:Streptophyta
Common Name:land plants
Identifer:https://www.itis.gov
ID: 846494
Classification:
Rank Name:Division
Rank Value:Tracheophyta
Common Name:vascular plants
Identifer:https://www.itis.gov
ID: 846496
Classification:
Rank Name:Subdivision
Rank Value:Spermatophytina
Common Name:spermatophytes
Identifer:https://www.itis.gov
ID: 846504
Classification:
Rank Name:Class
Rank Value:Magnoliopsida
Identifer:https://www.itis.gov
ID: 18063
Classification:
Rank Name:Superorder
Rank Value:Lilianae
Common Name:monocots
Identifer:https://www.itis.gov
ID: 846542
Classification:
Rank Name:Order
Rank Value:Liliales
Identifer:https://www.itis.gov
ID: 42612
Classification:
Rank Name:Family
Rank Value:Liliaceae
Identifer:https://www.itis.gov
ID: 42633
Classification:
Rank Name:Genus
Rank Value:Tulipa
Common Name:tulip
Identifer:https://www.itis.gov
ID: 43104

Project

Parent Project Information:

Title:Tulip Test Garden Observations in North America (1999-2020)
Personnel:
Individual: Nancy Sheehan
Organization:University of Wisconsin - Madison Arboretum
Position:Journey North Program Coordinator
Email Address:
nsheehan@wisc.edu
Id:https://orcid.org/0000-0002-2632-0796
Role:Journey North Program Coordinator
Abstract:Launched in 1994, the Journey North program had three goals: 1) To understand how species respond, across spatial and temporal scales, to climate change. 2) To understand the migration patterns and distribution of species during breeding and overwintering, across spatial and temporal scales, to inform land management and conservation decisions (such as critical habitat designations). 3) To build public awareness of habitat requirements of migratory species that span their life cycles to spur conservation efforts.

Launched in 1994, the Journey North program had three goals: 1) To understand how species respond, across spatial and temporal scales, to climate change. 2) To understand the migration patterns and distribution of species during breeding and overwintering, across spatial and temporal scales, to inform land management and conservation decisions (such as critical habitat designations). 3) To build public awareness of habitat requirements of migratory species that span their life cycles to spur conservation efforts.

Additional Award Information:
Funder:Annenberg Foundation
Title:Journey North Program

Maintenance

Maintenance:
Description:This is an ongoing data set that is being maintained by the University of Wisconsin - Madison Arboretum, Journey North program.
Frequency:asNeeded
Other Metadata

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'importedFromXML'
        |     |        \___attribute 'dateImported' = '2022-07-20'
        |     |        \___attribute 'filename' = 'tulips.xml'
        |     |        \___attribute 'taxonomicCoverageExempt' = 'True'
        |     |___text '\n    '
        |___text '\n  '

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'emlEditor'
        |     |        \___attribute 'app' = 'ezEML'
        |     |        \___attribute 'release' = '2022.06.04'
        |     |___text '\n    '
        |___text '\n  '

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo