Section (Meta)data, Termino­­lo­gies, Provenance

About us

To ensure that research data can be found and used efficiently, it must be associated with metadata. Metadata is information about the data, e.g., about the researchers who collected the data or about the license under which the data may be used.

The section supports processes of the NFDI consortia in jointly developing and establishing common data and metadata standards as a foundation for effective reuse of research data in accordance with FAIR principles.

In order to be able to link data from different sources in an interdisciplinary manner, semantic technologies (ontologies, knowledge graphs) are to be integrated in NFDI, which make it possible to generate knowledge and information in a machine-readable form. Uniform documentation procedures will make the creation and processing context (provenance) of data transparent.

In the topic area of metadata and research data, the section addresses issues of (meta)data harmonization, discoverability of data, general data and metadata standards with a perspective on a possible NFDI core metadata format, as well as format conversions and persistent identifier systems.

In the topic area of terminologies, the section will address cross-community and cross-disciplinary definitions of top-level ontologies and mappings of ontologies, as well as best practices for modeling terminologies, vocabularies, and ontologies, and data integration services based on them.

In the area of provenance, the section will address legal, technical, and cultural aspects of the context of creation of (meta)data and design proposals for uniform and traceable documentation procedures to answer the questions of what, where, when, who, how, and why of data creation and data processing.

The section will work in close cooperation with the sections “Common Infrastructures” and “Ethical, Social and Legal Aspects”, especially on the topics of terminologies and provenance.

Goals

The section includes all consortia and communities and strives for connecting and harmonizing the developments in consortia that work with similar data structures, standards and tools in the topics of the section and thus pursue similar goals:

  • Cooperative definition of top-level ontologies / mapping of ontologies.
  • Concepts and recommendations for the harmonization of (meta)data and terminology services for the NFDI and beyond.
  • Definition of cross-disciplinary minimum information standards / metadata for the scientific disciplines represented by NFDI
  • Development of concepts for annotating metadata elements with concepts from terminologies
  • Overview and evaluation of existing approaches and (emerging) best practices, harmonization of interfaces.

The section thus paves the way for the exploitation of synergies in the development of domain-independent metadata standards and use of infrastructure, and creates connecting nodes for the domain-specific integration and validation of basic services.

Working groups

Cookbooks, Guidance and Best Practice

Researchers need guidance in discovering and applying standards, and tools for (meta)data, terminologies, and provenance addressing their specific needs. In NFDI each consortium identifies, collects, and develops approaches that might be relevant for the broader research activities by and in the communities. The Working Group “Cookbooks, Guidance, and Best Practices” aims at identifying, collecting, exchanging and, where appropriate, harmonizing concepts, workflows, and best practices from consortia related to these topics and will develop support strategies to advance high-quality metadata production and provision.

Contact: Susanne Arndt

Terminology Services

The core of FAIR metadata for research data management (RDM) are terminologies. Terminologies have various distinct characteristics: they describe domain knowledge (e.g. Chemistry, Biology or Information Science use their own set of terminologies), and they are constantly evolving over time and need to be accepted and agreed upon (mapped) within their knowledge domain and beyond. The overall goal of the Terminology Services Working Group is to drive common developments, share best practices, and reduce barriers to the use of terminologies.

Contact: Roman Baum

Ontology Harmonization and Mapping

Consistent use of terminologies, and the organization of vocabulary items into structured ontologies are prerequisites of FAIR data, in particular regarding cross-community re-use and machine-actionability of research data. Each scientific discipline and each consortium in NFDI comes with pre-existing terminologies, shaped by respective needs. They are prone to have conceptual overlaps and – as in natural language – there are different ways of expressing the same concept. Harmonization refers to finding an agreement on a common terminology, whereas mapping refers to an “agreement to disagree”, while using a well-defined interface between terminologies. This working group is going to coordinate this essential work for the federation of data that are structured with domain ontologies.

Contact: Holger Israel

Search and Harvesting

This working group focuses on how to extract, find, and use (meta)data, both for discovery and consumption of data by researchers (search/discovery) and for aggregation and indexing by discovery services (harvesting). The actors can be humans or machines.

Contact: Brigitte Mathiak, Heinrich Widmann

Dr. Oliver Koepler
Speaker: Dr. Oliver Koepler

(TIB – Leibniz Information Centre for Science and Technology University Library)

Dr. Brigitte Mathiak

Deputy speaker: Dr. Brigitte Mathiak 

(GESIS – Leibniz Institute for Social Sciences e.V.)

Participating institutions
  • Akademie der Wissenschaften und der Literatur Mainz
  • Akademie der Wissenschaften zu Göttingen
  • Bayerische Akademie der Wissenschaften
  • Bayerische Staatsbibliothek
  • Bundesverband Theatersammlungen in Deutschland (TheSiD) e.V.
  • Christian-Albrechts-Universität zu Kiel
  • Deutsche Nationalbibliothek
  • Deutsche Zentralbibliothek für Medizin (ZB MED) – Informationszentrum Lebenswissenschaften
  • Deutsches Elektronen-Synchrotron DESY
  • Deutsches Klimarechenzentrum GmbH
  • Deutsches Primatenzentrum GmbH – Leibniz-Institut für Primatenforschung
  • Deutsches Zentrum für Hochschul- und Wissenschaftsforschung GmbH (DZHW)
  • Deutsches Zentrum für Luft- und Raumfahrt (DLR)
  • digiCULT-Verbund eG
  • DIPF Leibniz-Institut für Bildungsforschung und Bildungsinformation
  • Eberhard Karls Universität Tübingen
  • FAIR-DI e.V.
  • FernUniversität in Hagen
  • FIZ Karlsruhe – Leibniz-Institut für Informationsinfrastruktur GmbH
  • Forschungszentrum Jülich GmbH
  • Freie Universität Berlin
  • Friedrich-Schiller-Universität Jena
  • GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel
  • Georg-August-Universität Göttingen
  • Germanisches Nationalmuseum – Leibniz-Forschungsmuseum für Kulturgeschichte (GNM)
  • Gesellschaft für Biologische Daten e.V. (GFBio e.V.)
  • Gesellschaft für Informatik e. V.
  • GESIS – Leibniz-Institut für Sozialwissenschaften e.V.
  • Heidelberger Akademie der Wissenschaften
  • Helmholtz Zentrum für Infektionsforschung
  • Helmholtz-Zentrum für Umweltforschung GmbH – UFZ
  • Helmholtz-Zentrum Hereon
  • Herzog August Bibliothek Wolfenbüttel
  • HITS gGmbH
  • Humboldt-Universität zu Berlin
  • IAML – Deutschland e. V.
  • Innovationszentrum für Prozessdatentechnik (Inprodat) e.V.
  • Internationales Quellenlexikon der Musik e. V.
  • Karlsruher Institut für Technologie
  • Klassik Stiftung Weimar
  • Leibniz-Institut DSMZ-Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH
  • Leibniz-Institut für Agrarlandschaftsforschung (ZALF) e.V.
  • Leibniz-Institut für Astrophysik Potsdam (AIP)
  • Leibniz-Institut für Bildungsverläufe (LIfBi) e.V.
  • Leibniz-Institut für Deutsche Sprache (IDS)
  • Leibniz-Institut für Katalyse e.V.
  • Leibniz-Institut für Maritime Geschichte
  • Leibniz-Institut für Pflanzenbiochemie
  • Leibniz-Institut für Photonische Technologien e.V. (IPHT)
  • Ludwig-Maximilians-Universität München
  • Martin-Luther-Universität Halle-Wittenberg
  • Max Weber Stiftung – Deutsche Geisteswissenschaftliche Institute im Ausland
  • Otto-Friedrich-Universität Bamberg
  • Philipps-Universität Marburg
  • Physikalisch-Technische Bundesanstalt
  • Technische Hochschule Köln
  • Technische Universität Braunschweig
  • Technische Universität Darmstadt
  • Technische Universität Dortmund
  • Technische Universität Dresden
  • Technische Universität Kaiserslautern
  • TIB – Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek
  • Universität Bielefeld
  • Universität Bremen
  • Universität der Künste Berlin
  • Universität Duisburg-Essen
  • Universität Leipzig
  • Universität Paderborn
  • Universität Stuttgart
  • Universitätsmedizin Greifswald KdöR
  • Verbundzentrale des GBV (VZG)
  • Verein für Computergenealogie e.V.
  • Westfälische Wilhelms-Universität Münster (WWU)
  • Zoologisches Forschungsmuseum Alexander Koenig – Leibniz-Institut für Biodiversität der Tiere
Participating persons
  • Volker Adam
  • Ivonne Anders
  • Tobias Arera-Rütenik
  • Christian Aßmann
  • Linda Baldewein
  • Renate Behrens
  • Alexander Berg-Weiß
  • Noemi Betancort
  • Mirjam Blümm
  • Thomas Bocklitz
  • Christian Bonatto Minella
  • David Broneske
  • Torsten Bronger
  • Björn Brötz
  • Jan Bumberger
  • Boyke Bunk
  • Silvia Chiacchiera
  • Andreas Christ
  • Philipp Cimiano
  • Roberto Cozatl
  • Andreas Daniel
  • Silvia Daniel
  • Michael Denker
  • Jürgen Diet
  • Stephan Dörschel
  • Frank Dührkohp
  • Ulrike Ehmig
  • Harry Enke
  • Michael Finkel
  • Juliane Fluck
  • Konrad Förstner
  • Marc Fuhrmans
  • Christoph Garth
  • Eva-Maria Gerstner
  • Luca Ghiringhelli
  • Martin Golebiewski
  • Sandra Göller
  • Peter Grobe
  • Matthias Grönewald
  • Stephan Hachinger
  • Michael Hanke
  • Felix Henninger
  • Christin Henzen
  • Christian Himpe
  • Julian Hniopek
  • Volker Hofmann
  • Timothy Holbrook
  • Dorothea Iglezakis
  • Holger Israel
  • Ariane Jeßulat
  • Angela Kailus
  • Naouel Karam
  • Ashish Karmacharya
  • Bhavin Katabathuni
  • Jürgen Kett
  • Wibke Kleina
  • Paul Klemm
  • Olaf Kletke
  • Oliver Koepler
  • Chantal Köppl
  • Anne Koziolek
  • Angela Kranz
  • Alessandra Kuntz
  • Herbert Lange
  • Elfrun Lehmann
  • Johannes Liermann
  • David Linke
  • Brigitte Mathiak
  • Katrin Moeller
  • Robert Nasarek
  • Steffen Neumann
  • Dennis Niewerth
  • Sven Ohmann
  • Judith Pfeiffer
  • Klaus Pietschmann
  • Renate Polak-Bennemann
  • Etienne Posthumus
  • Andreas Redelbach
  • Stephanie Rehwald
  • Lorenz Reimer
  • Thomas Richter
  • Raphael Ritz
  • Thomas Rose
  • Daniel Röwenstrunk
  • Harald Sack
  • Stefan Sandfeld
  • Kristin Sauerland
  • Björn Schembera
  • Hansjörg Scherberger
  • Gunther Schiefer
  • Sirko Schindler
  • Carsten Schmidt
  • Thomas Schörner
  • Torsten Schrade
  • Daniela Schulz
  • Emanuel Söding
  • Xenia Specka
  • Arnost Stanzel
  • Regine Stein
  • Rainer Stotzka
  • Martin Stricker
  • Dieta Svoboda-Baas
  • Dzulia Terzijska
  • Thorsten Trippel
  • Björn Usadel
  • Frank Weber
  • Dirk Weisbrod
  • Jörg Wettlaufer
  • Heinrich Widmann
  • Rebecca Wilm
  • Dirk Wintergrün
  • Atinkut Alamirrew Zeleke