abstract
- The diversity of individual research fields in the engineering sciences leads to numerous individual contributions on the topic of metadata for research data management within NFDI4ING [1]. In order to standardise these contributions and promote the consolidation of metadata schemas, this work describes the development of a Common Information Model (CIM). The CIM will be realized as a comprehensive ontology based on Metadata4Ing, the mid-level ontology for the modeling of scientific workflows [2]. FAIR data concepts such as SciMesh [3] for experimental provenance metadata handling samples, DataDesc [4] for describing interfaces within computational workflows and the HPC-Ontology [5] for metadata in high performance computing workflows will be integrated. It will serve as the foundation for NFDI4ING's services while also enabling connections to other relevant terminologies. The CIM will encompass all pertinent engineering entities, including datasets, software, actors, methods, tools, instruments and their configurations, as well as mathematical models, building upon existing vocabularies and ontologies both within and outside of the consortium. The Metadata4Ing (m4i) ontology serves as a bridge, linking and enriching engineering related research data with software, methods, instruments, and tools. The m4i ontology is accessible via the NFDI4ING Terminology Service [6]. The concepts of m4i can also be used in the NFDI4ING Metadata Profile Service [7] to create, share, and maintain subject-specific metadata profiles in form of SHACL shapes. Metadata profiles can themselves be used to structure knowledge graphs in platforms like Coscine [8]. The CIM aims to keep this level of service integration while coordinating and aligning with recent developments in related fields including the MatWerk Ontology [9] and PMD Core Ontology for materials science [10], as well as MathModDB [11] for mathematics. Designed to facilitate seamless information networking, the provision of Knowledge Graphs is valuable for the standardized integration of information sources and the reduction of information silos. The CIM will form the core of the Knowledge Graphs provided by NFDI4ING and will serve as the basis for harmonized, federated queries that enable extensive reuse of research data. NFDI4ING will offer services to integrate the creation of FAIR Digital Objects (FDOs) into existing scientific workflows. During the first funding period, NFDI4ING has already made progress in the development of Knowledge Graphs. The second funding period will focus on establishing a federation of Knowledge Graphs and ensuring the availability of FDOs within this framework, including the provision of interfaces for programmatic access. The instantiation of FDOs is planned in close collaboration with the Working Group on Knowledge Graphs within NFDI-MatWerk, the NFDI section on Metadata, Terminologies, and Provenance, as well as the PID4NFDI base service. The development of a CIM as well as a federated Knowledge Graph ecosystem will assist in the harmonization of services developed within NFDI4ING. As a result, consolidation of the CIM with the NFDI Task Force Metadata and the Working Group on Ontology Harmonization will take place. In addition, the CIM will help to enable integration of NFDI4ING services into the daily work of researchers within the community.