This data package was submitted to a staging environment for testing purposes only. Use of these data for anything other than testing is strongly discouraged.

Data Package Summary    View Full Metadata

  • Mapping EDI, NEON and DataONE units to the QUDT ontology, 2022
  • Porter, John; University of Virginia
    O'Brien, Margaret; University of California, Santa Barbara
    Frants, Marina; Scripps Institution of Oceanography
    Earl, Stevan; Arizona State University
    Martin, Mary; University of New Hampshire
    Laney, Christine; National Ecological Observatory Network (NEON), Battelle
  • 2024-07-31
  • Porter, J., M. O'Brien, M. Frants, S. Earl, M. Martin, and C. Laney. 2024. Mapping EDI, NEON and DataONE units to the QUDT ontology, 2022 ver 1. Environmental Data Initiative. https://doi.org/DOI_PLACE_HOLDER (Accessed 2024-12-27).
  • In the metadata of digital environmental datasets, automated processing is hindered by the wide variety of representations for unit that may be human-readable, but may not be unambiguous or machine-interpretable, (e.g., grams per square meter, gm/m2, g/m2, gm-2, g/m^2, g.m-2, g m-2 and gramPerMeterSquared). Matching disparate representations of the same unit into a single unit concept from an ontology assists with interpretation and reuse by providing a linkage to a complete unit definitions with label, description, dimensions. Datasets with shared units can be identified during searches, and are more suitable for automating analyses and potential transformation.

    This dataset contains data and code associated with a project to map units in ecological metadata collected between 2013 and 2022 by DataONE, the Environmental Data Initiative and the U.S. National Ecological Observatory Network to the QUDT ontology using successive string transformations. Data entities include

    a) raw metadata as received (355,057 unit instances)

    b) integrated raw data

    c) substitution tables for string transformations

    d) resulting lookup table for 896 distinct units matched to QUDT units

    e) associated R code used for QUDT matching plus a web service and R functions for adding annotation elements to Ecological Metadata Language metadata documents.

    Using these substitutions and code, 91% of unit instances in the raw metadata could be matched to QUDT. Data and results are discussed in “Porter JH, M O’Brien, M Frants, S Earl, M Martin, C Laney. (in review) Using a Units Ontology to Annotate Pre-Existing Metadata. Submitted to Scientific Data.

  • edi.1715.1  (Uploaded 2024-07-31)  
  • This data package is released to the "public domain" under Creative Commons CC0 1.0 "No Rights Reserved" (see: https://creativecommons.org/publicdomain/zero/1.0/). It is considered professional etiquette to provide attribution of the original work if this data package is shared in whole or by individual components. A generic citation is provided for this data package on the website https://portal.edirepository.org (herein "website") in the summary metadata page. Communication (and collaboration) with the creators of this data package is recommended to prevent duplicate research or publication. This data package (and its components) is made available "as is" and with no warranty of accuracy or fitness for use. The creators of this data package and the website shall not be liable for any damages resulting from misinterpretation or misuse of the data package or its components. Periodic updates of this data package may be available from the website. Thank you.
  • DOI PLACE HOLDER
  • Analyze this data package using:           

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo