A Blueprint of Terminology Services in Earth System Science
Slideshow
Overview
abstract
- The exponential growth of data due to technological developments, combined with the increased recognition of research data as a relevant research output over the last decades, highlights fundamental challenges in terms of interoperability, reproducibility and re-use of scientific information. Research in Earth System Science is at its core cross-disciplinary, encompassing diverse fields such as palaeontology, marine science, biodiversity research, atmospheric sciences and molecular biology. Furthermore, different types of data, such as observations and simulations, and a wide range of data sizes (from climate model output to images of specimens) illustrate the challenges of implementing the I and R in FAIR data principles in particular. Different disciplines have developed their own methods and terms for indexing, cataloguing, describing and retrieving scientific data, resulting in a large number of controlled vocabularies, taxonomies and thesauri. However, this volume of disparate semantic artefacts often contains ambiguous, duplicate and inconsistent terms, posing numerous challenges for interoperability and standardisation, particularly for automated data selection. Terminologies can help by enabling researchers and infrastructure providers to realise a machine-processable expression of the information contained in their research data and other scholarly outputs. The BITS project (BluePrints for the Integration of Terminology Services in Earth System Sciences) aims to address the inadequate implementation of encoding semantics by establishing a terminology service (TS) that can serve the entire Earth System Science (ESS) community at national, European and international level. It is part of the existing terminology service of the TIB - Leibniz Information Centre for Science and Technology, which will be maintained on a long-term basis. Within this service, an ESS Collection already contains 40 terminologies relevant to the ESS, with the option to add more. New terminologies for the ESS collection can be suggested at any time via the ESS homepage, and new terms for terminologies hosted on Github can also be suggested and forwarded to the developers of that terminology. The implementation of this TS in two data repositories (World Data Center for Climate at the German Climate Computing Centre and a data collection at Senckenberg - Leibniz Institution for Biodiversity and Earth System Research) will demonstrate the benefits for two very different use cases at WDCC and Senckenberg.
authors
status
publication date
- 2025