Data Package Metadata   View Summary

NEON distributed initial soil characterization dataset subsetted for statistical analysis of organic carbon and extractable metals in Hall and Thompson (2021)

General Information
Data Package:
Local Identifier:edi.476.1
Title:NEON distributed initial soil characterization dataset subsetted for statistical analysis of organic carbon and extractable metals in Hall and Thompson (2021)
Alternate Identifier:DOI PLACE HOLDER
Abstract:

We compiled National Ecological Observatory Network (NEON) datasets related to the initial distributed soil sampling effort and subsetted them (removed samples with missing values for certain variables, and several samples with extreme values) for use in statistical analyses to describe relationships between soil organic carbon (SOC) and metals measured in several soil chemical extractions. The NEON provisional data products we used were DP1.10047.001 and DP1.10008.001, which were subsequently combined by NEON as a single data product DP1.10047.001, “Soil physical and chemical properties, distributed initial characterization”. These datasets were used for the analyses reported in a manuscript by Hall and Thompson (2021) in the Soil Science Society of America Journal.

Publication Date:2021-09-27

Time Period
Begin:
2020-12-01
End:
2021-08-01

People and Organizations
Contact:Hall, Steven J (Iowa State University) [  email ]
Creator:Hall, Steven J (Iowa State University)
Creator:Thompson, Aaron (University of Georgia)

Data Entities
Data Table Name:
NEON dataset for SOC and metal analysis
Description:
NEON dataset for SOC and metal analysis
Detailed Metadata

Data Entities


Data Table

Data:https://pasta-s.lternet.edu/package/data/eml/edi/476/1/4a74579113c0e664e125ca132cfd1395
Name:NEON dataset for SOC and metal analysis
Description:NEON dataset for SOC and metal analysis
Number of Records:2540
Number of Columns:53

Table Structure
Object Name:NEON_soil_chem_for_stats_plots.csv
Size:969696 bytes
Authentication:3ea20cf2e09e973f3b261cf53a60f835 Calculated By MD5
Text Format:
Number of Header Lines:1
Record Delimiter:\r\n
Orientation:column
Simple Delimited:
Field Delimiter:,
Quote Character:"

Table Column Descriptions
 
Column Name:plotID  
horizonID  
pitID  
domainID  
siteID  
horizonName  
biogeoTopDepth  
biogeoCenterDepth  
biogeoBottomDepth  
soilOrder  
soilSuborder  
soilGreatGroup  
soilSubgroup  
soilFamily  
soilSeries  
Andic  
Spodic  
soilSamplingMethod  
decimalLatitude  
decimalLongitude  
elevation  
sandTotal  
siltTotal  
clayTotal  
silt_plus_clay  
carbonTot  
nitrogenTot  
ctonRatio  
estimatedOC  
phH2o  
phCacl2  
alOxalate  
feOxalate  
siOxalate  
alOxalate_mol_kg  
feOxalate_mol_kg  
siOxalate_mol_kg  
alCitDithionate_mol_kg  
feCitDithionate_mol_kg  
siCitDithionate_mol_kg  
siOxalate_mol_kg_minus_siCitDithionate_mol_kg  
alOxalate_mol_kg_minus_alCitDithionate_mol_kg  
caNh4d  
mgNh4d  
caNh4d_plus_mgNh4d  
alCitDithionate  
feCitDithionate  
feCitDithionate_minus_feOxalate  
NLCD_veg  
grouped_NLCD_veg  
map_minus_pet  
mat  
map  
Definition:Plot identifier (NEON site code_XXXX)An identifier for the soil horizonpitIDUnique identifier of the NEON domainFour letter NEON site codeSoil horizon nameTop depth of the biogeochemistry sampleDepth of the center of the biogeochemistry sampleBottom depth of the biogeochemistry sampleSoil taxonomy at the order levelSoil taxonomy at the suborder levelSoil taxonomy at the great group levelSoil taxonomy at the subgroup levelSoil taxonomy at the family levelSoil taxonomy at the series levelSample was collected from an Andisol or a soil designated with andic propertiesSample was collected from a Spodosol or a soil designated with spodic propertiesThe methodology used for collecting soil at a plot (pit or core)The geographic latitude (in decimal degrees, WGS84) of the geographic center of the reference areaThe geographic longitude (in decimal degrees, WGS84) of the geographic center of the reference areaElevation (in meters) above sea levelTotal sand (0.047-2 mm) content on a weight basis of the <2 mm fractionTotal silt (0.002-0.047 mm) content on a weight basis of the <2 mm fractionTotal clay (<0.002 mm) content on a weight basis of the <2 mm fractionTotal silt plus clay content on a weight basis of the <2 mm fractionTotal carbon concentration of the <2 mm fractionTotal nitrogen concentration of the <2 mm fractionRatio of total Carbon to total Nitrogen of the <2 mm fractionEstimated organic carbon concentration of the <2 mm fractionpH of the <2 mm fraction in waterpH of the <2 mm fraction in CaCl2Total soil Al as estimated by the ammonium oxalate extraction method, reported as weight percent on the <2 mm fractionTotal soil noncrystalline iron as measured by the ammonium oxalate extraction method, reported as weight percent on the <2mm fractionTotal soil silica content as measured by the ammonium oxalate extraction method, reported as a weight percent on a <2 mm baseOxalate-extractable Al, reported as moles per kg on a <2 mm basisOxalate-extractable Fe, reported as moles per kg on a <2 mm basisOxalate-extractable Si, reported as moles per kg on a <2 mm basisCitrate-dithionite extractable Al, reported as moles per kg on a <2 mm basisCitrate-dithionite extractable Fe, reported as moles per kg on a <2 mm basisCitrate-dithionite extractable Si, reported as moles per kg on a <2 mm basisThe difference between oxalate-extractable Si and citrate-dithionite-extractable Si, reported as moles per kg on a <2 mm basisThe difference between oxalate-extractable Al and citrate-dithionite-extractable Al, reported as moles per kg on a <2 mm basisAmmonium acetate extractable calcium from the <2 mm fractionAmmonium acetate extractable magnesium from the <2 mm fractionThe sum of ammonium acetate extractable calcium and magnesium from the <2 mm fractionDithionite citrate extractable aluminum content, reported as weight percent on the <2mm fractionDithionite citrate extractable iron, a general measure of total pedogenic iron, reported as weight percent on the <2mm fractionThe difference between Fe measured in citrate-dithionite and oxalate extraction, reported as weight percent on the <2mm fractionVegetation type according to the NLCD classificationBinned NLCD vegetation typeMean annual precipitation minus potential evapotranspiration (WorldClim)Mean annual temperature (WorldClim)Mean annual precipitation (WorldClim)
Storage Type:string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
string  
float  
float  
float  
float  
float  
float  
float  
float  
float  
string  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
float  
string  
string  
float  
float  
float  
Measurement Type:nominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalnominalratioratioratioratioratioratioratioratiorationominalratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratioratiorationominalnominalratioratioratio
Measurement Values Domain:
DefinitionPlot identifier (NEON site code_XXXX)
DefinitionAn identifier for the soil horizon
DefinitionpitID
DefinitionUnique identifier of the NEON domain
DefinitionFour letter NEON site code
DefinitionSoil horizon name
DefinitionTop depth of the biogeochemistry sample
DefinitionDepth of the center of the biogeochemistry sample
DefinitionBottom depth of the biogeochemistry sample
DefinitionSoil taxonomy at the order level
DefinitionSoil taxonomy at the suborder level
DefinitionSoil taxonomy at the great group level
DefinitionSoil taxonomy at the subgroup level
DefinitionSoil taxonomy at the family level
DefinitionSoil taxonomy at the series level
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code0
Definitionno
Source
Code Definition
Code1
Definitionyes
Source
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code0
Definitionno
Source
Code Definition
Code1
Definitionyes
Source
DefinitionThe methodology used for collecting soil at a plot (pit or core)
Unitdegree
Typereal
Min17.951583 
Max71.316715 
Unitdegree
Typereal
Min-156.650984 
Max-66.824633 
Unitmeter
Typereal
Min2.1 
Max3500.3 
Unitpercent
Typereal
Min1.2 
Max98.9 
Unitpercent
Typereal
Min0.3 
Max80.7 
Unitpercent
Typereal
Min
Max84.7 
Unitpercent
Typereal
Min1.1 
Max98.8 
UnitgramPerKilogram
Typereal
Min0.5 
Max206 
UnitgramPerKilogram
Typereal
Min
Max16.2 
DefinitionRatio of total Carbon to total Nitrogen of the <2 mm fraction
UnitgramPerKilogram
Typenatural
Min
Max193 
Unitdimensionless
Typereal
Min3.1 
Max10 
Unitdimensionless
Typereal
Min2.6 
Max9.5 
Unitpercent
Typereal
Min0.001 
Max2.87 
Unitpercent
Typereal
Min0.001 
Max2.95 
Unitpercent
Typereal
Min0.001 
Max1.44 
UnitmolePerKilogram
Typereal
Min
Max1.063 
UnitmolePerKilogram
Typereal
Min
Max0.528 
UnitmolePerKilogram
Typereal
Min
Max0.513 
UnitmolePerKilogram
Typereal
Min
Max0.751 
UnitmolePerKilogram
Typereal
Min
Max1.488 
UnitmolePerKilogram
Typereal
Min
Max0.179 
UnitmolePerKilogram
Typereal
Min-0.112 
Max0.419 
UnitmolePerKilogram
Typereal
Min-0.337 
Max0.726 
UnitcentimolePerKilogram
Typereal
Min
Max143.7 
UnitcentimolePerKilogram
Typereal
Min
Max34.8 
UnitcentimolePerKilogram
Typereal
Min
Max155.1 
Unitpercent
Typereal
Min
Max2.03 
Unitpercent
Typereal
Min
Max8.31 
Unitpercent
Typereal
Min
Max7.98 
DefinitionVegetation type according to the NLCD classification
Allowed Values and Definitions
Enumerated Domain 
Code Definition
Code1
Definitionforest
Source
Code Definition
Code2
Definitiongrassland/shrubland
Source
Code Definition
Code3
Definitioncropland
Source
Code Definition
Code4
Definitionwetland
Source
Unitmillimeter
Typeinteger
Min-2084 
Max1382 
Unitcelsius
Typereal
Min-12 
Max26.1 
Unitmillimeter
Typenatural
Min106 
Max2449 
Missing Value Code:
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
CodeNA
Explmissing data
Accuracy Report:                                                                                                          
Accuracy Assessment:                                                                                                          
Coverage:                                                                                                          
Methods:                                                                                                          

Data Package Usage Rights

This information is released under the Creative Commons license - Attribution - CC BY (https://creativecommons.org/licenses/by/4.0/). The consumer of these data ("Data User" herein) is required to cite it appropriately in any publication that results from its use. The Data User should realize that these data may be actively used by others for ongoing research and that coordination may be necessary to prevent duplicate publication. The Data User is urged to contact the authors of these data if any questions about methodology or results occur. Where appropriate, the Data User is encouraged to consider collaboration or co-authorship with the authors. The Data User should realize that misinterpretation of data may occur if used out of context of the original study. While substantial efforts are made to ensure the accuracy of data and associated documentation, complete accuracy of data sets cannot be guaranteed. All data are made available "as is." The Data User should be aware, however, that data are updated periodically and it is the responsibility of the Data User to check for new versions of the data. The data authors and the repository where these data were obtained shall not be liable for damages resulting from any use or misinterpretation of the data. Thank you.

Keywords

By Thesaurus:
(No thesaurus)Macrosystems Biology, NEON, Soil, Soil Organic Matter, Biogeochemistry, Soil Horizon, Soil Carbon, Oxalate, Citrate Dithionite, Poorly Crystalline Mineral, Iron, Aluminum, Silicon, Soil Extraction, Metal

Methods and Protocols

These methods, instrumentation and/or protocols apply to all data in this dataset:

Methods and protocols used in the collection of this data package
Description:

We downloaded the provisional NEON datasets referenced below on December 12, 2020. These samples were collected and analyzed by USDA NRCS staff in collaboration with NEON, and further information about methodology of sample collection and analysis is available in the original datasets cited below. Inspection of the data revealed that metal concentrations reported for the citrate-dithionite extraction had been rounded to a single decimal place, so these values were not readily comparable with the ammonium oxalate extraction. Therefore, we obtained the raw metal concentrations from the USDA NRCS staff who conducted the original analyses (USDA NRCS, personal communication, April 15, 2021); the new citrate-dithionite data reported here were rounded to two decimal places for comparison to the ammonium oxalate extraction data.

Multiple data files from these downloaded NEON datasets (spc_particlesize.csv, spc_perhorizon.csv, spc_perplot.csv, spc_biogeochem.csv) were merged in R (version 3.6.3) to create a single file, using “horizonID” or “pitID” as the common variable for each sample, and columns with extraneous information not immediately relevant to our analyses were removed. Climate data from the WorldClim database (Fick and Hijmans 2017) were added for each sample as described in a previous related publication (Yu et al., 2021). We also added vegetation type data obtained from the R package “geoNEON” (NEON 2020), following the terminology of the National Land Cover Database; these were binned into four broader classes (forest, grassland/shrubland, wetland, cropland) for subsequent use as model predictors, following Yu et al. (2021).

After combining these datasets into a single file, we trimmed the dataset and created additional predictor variables using the following criteria. Horizons classified as organic by NRCS staff (O horizon) were removed, along with any samples with SOC concentrations > 200 mg/g (consistent with the definition of an O horizon). Samples with undetectable SOC were also removed. For use as predictor variables, concentrations of silt- and clay-sized particles were summed, as were concentrations of exchangeable calcium and magnesium. The mass concentrations of Al + 0.5Fe in the oxalate extraction were summed for use as a predictor, as this has traditionally been used as an approximation for the molar sum of poorly-crystalline Fe and Al. Crystalline iron (Fe) phases were estimated as the difference between citrate-dithionite and oxalate extractable Fe; two samples with large negative values (< -1%) were removed, and remaining negative values were set to zero.

We include two versions of this dataset, each used for different purposes. The dataset “NEON_distributed_soil_dataset_for_gamm.csv” was used for statistical analyses (generalized additive mixed models, GAMMs) to test relationships between SOC and extractable metals. Here, 9 samples with oxalate-extractable Al and Fe > 3% were excluded, as they had very high leverage. We also removed any samples with missing values for any predictor used in the statistical analyses: oxalate-extractable Al, Fe or Si, citrate-dithionite-extractable Al or Fe, or silt+clay. This left 2540 soil horizon samples for GAMM analysis. The second dataset “NEON_soil_chem_for_metals_plots.csv” was used to examine pairwise relationships between metals in the citrate-dithionite and ammonium oxalate extractions. No samples were removed from this dataset, for a total of 2,596 samples. We conducted preliminary analyses where we assessed the of ratios of metals (iron, aluminum, silicon) in the oxalate and citrate-dithionite extractions, so zero values for any metal were replaced with a small positive number (0.001%; smallest reported positive values in the dataset were 0.01%), and mass concentrations were converted to mole concentrations for bivariate analyses. Andisols or Spodosols or soils with andic designators in Family or Subgroup names in the USDA soil taxonomy were denoted with a binary categorical variable for analysis purposes (i.e., comparison of all soils, soils with andic/spodic properties, and soils without andic/spodic properties).

<emphasis role="strong">References</emphasis>

Fick, S. E., & Hijmans, R. J. (2017). WorldClim 2: New 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology, 37(12), 4302–4315. https://doi.org/10.1002/joc.5086

NEON (National Ecological Observatory Network). Soil physical properties, distributed initial characterization (DP1.10047.001). https://data.neonscience.org (accessed December 12, 2020)

NEON (National Ecological Observatory Network). Soil chemical properties, distributed initial characterization (DP1.10008.001). https://data.neonscience.org (accessed December 12, 2020)

National Ecological Observatory Network (2020). geoNEON: Geolocation Data Access for NEON Data. R package version 1.0.0.9100.

Yu, W, Weintraub, S.R., and S.J. Hall. 2021. <link xlink:href="https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2020GB006781">Climatic and geochemical controls on soil carbon at the continental scale: interactions and thresholds</link>. Global Biogeochemical Cycles 35: e2020GB006781.

Description:

This method step describes provenance-based metadata

This provenance metadata does not contain entity specific information.

Data Source
Soil physical and chemical properties, distributed initial characterization (DP1.10047.001)

People and Organizations

Publishers:
Organization:Environmental Data Initiative
Email Address:
info@environmentaldatainitiative.org
Web Address:
https://environmentaldatainitiative.org
Creators:
Individual: Steven J Hall
Organization:Iowa State University
Email Address:
stevenjh@iastate.edu
Id:https://orcid.org/0000-0002-7841-2019
Individual: Aaron Thompson
Organization:University of Georgia
Email Address:
aaront@uga.edu
Id:https://orcid.org/0000-0001-6301-7377
Contacts:
Individual: Steven J Hall
Organization:Iowa State University
Email Address:
stevenjh@iastate.edu
Id:https://orcid.org/0000-0002-7841-2019

Temporal, Geographic and Taxonomic Coverage

Temporal, Geographic and/or Taxonomic information that applies to all data in this dataset:

Time Period
Begin:
2020-12-01
End:
2021-08-01
Geographic Region:
Description:North America
Bounding Coordinates:
Northern:  69Southern:  18
Western:  -150Eastern:  -65

Project

Parent Project Information:

Title:Collaborative Research: MSB-ECA: Resolving controls on lignin decomposition at the continental scale to reconcile classical and modern paradigms of soil organic matter
Personnel:
Individual: Steven J Hall
Organization:Iowa State University
Email Address:
stevenjh@iastate.edu
Id:https://orcid.org/0000-0002-7841-2019
Role:Principal Investigator
Funding: NSF DEB 1802745

Maintenance

Maintenance:
Description:completed
Frequency:
Other Metadata

Additional Metadata

additionalMetadata
        |___text '\n    '
        |___element 'metadata'
        |     |___text '\n      '
        |     |___element 'unitList'
        |     |     |___text '\n        '
        |     |     |___element 'unit'
        |     |     |     |  \___attribute 'id' = 'centimolePerKilogram'
        |     |     |     |  \___attribute 'multiplierToSI' = '100'
        |     |     |     |  \___attribute 'name' = 'centimolePerKilogram'
        |     |     |     |  \___attribute 'parentSI' = 'molePerKilogram'
        |     |     |     |  \___attribute 'unitType' = 'amountOfSubstanceWeight'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'description'
        |     |     |     |     |___text '1/100 of a mole per kilogram'
        |     |     |     |___text '\n        '
        |     |     |___text '\n        '
        |     |     |___element 'unit'
        |     |     |     |  \___attribute 'id' = 'gramPerKilogram'
        |     |     |     |  \___attribute 'multiplierToSI' = '1/1000'
        |     |     |     |  \___attribute 'name' = 'gramPerKilogram'
        |     |     |     |  \___attribute 'parentSI' = 'gramPerGram'
        |     |     |     |  \___attribute 'unitType' = 'massPerMass'
        |     |     |     |___text '\n          '
        |     |     |     |___element 'description'
        |     |     |     |     |___text 'gram per kilogram'
        |     |     |     |___text '\n        '
        |     |     |___text '\n      '
        |     |___text '\n    '
        |___text '\n  '

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo