Consent

This site uses third party services that need your consent.

Logo - NFDI4 DataScience

NFDI4DataScience

Contact
Employee photo Mr. Adamantios Koumpis
Adamantios Koumpis (Dr.) Research and Teaching Coordinator
Project Status
running
Kick-off Date
March 10, 2026

Description

The vision of "Nationale Forschungsdateninfrastruktur for Data Science" (NFDI4DataScience, NFDI4DS) is to support all steps of the complex and interdisciplinary research data lifecycle, including collecting/creating, processing, analyzing, publishing, archiving, and reusing resources in Data Science and Artificial Intelligence. The past years have seen a paradigm shift, with computational methods increasingly relying on data-driven and often deep learning-based approaches, leading to the establishment and ubiquity of Data Science as a discipline driven by advances in the field of Computer Science. Transparency, reproducibility and fairness have become crucial challenges for Data Science and Artificial Intelligence due to the complexity of contemporary Data Science methods, often relying on a combination of code, models and data used for training. NFDI4DS will promote fair and open research data infrastructures supporting all involved resources such as code, models, data, or publications through an integrated approach. The overarching objective of NFDI4DS is the development, establishment, and sustainment of a national research data infrastructure (NFDI) for the Data Science and Artificial Intelligence community in Germany. This will also deliver benefits for a wider community requiring data analytics solutions, within the NFDI and beyond. The key idea is to work towards increasing the transparency, reproducibility and fairness of Data Science and Artificial Intelligence projects, by making all digital artifacts available, interlinking them, and offering innovative tools and services. Based on the reuse of these digital objects, this enables new and innovative research. NFDI4DS intends to represent the Data Science and Artificial Intelligence community in academia, which is an interdisciplinary field rooted in Computer Science. We aim to reuse existing solutions and to collaborate closely with the other NFDI consortia and beyond. In the initial phase, NFDI4DS will focus on four Data Science intense application areas: language technology, biomedical sciences, information sciences and social sciences. The expertise available in NFDI4DS ensures that metadata standards are interoperable across domains and that new ways of dealing with digital objects arise.

Collaboration Partner

ZB MED
info@zbmed.de
Learn More
Fraunhofer-Gesellschaft Learn More
Leibniz University Hannover Learn More
Leibniz Center for Informatics Learn More
TIB - Leibniz Information Centre for Science and Technology and University Library Learn More
GESIS – Leibniz Institute for the Social Sciences Learn More
Technische Universität Berlin (TU Berlin) Learn More
RWTH Aachen University Learn More
Dresden University of Technology Learn More
Leipzig University Learn More
DFKI - German Research Center for Artificial Intelligence
info@dfki.de
Learn More
FIZ Karlsruhe – Leibniz Institute for Information Infrastructure Learn More
ZBW – Leibniz Information Centre for Economics Learn More
Leuphana University Lüneburg Learn More

Publications

ELSA and the Data Scientist: A Qualitative Approach

2024 - Open Access -
Maria Christoforaki, Stephanie von Maltzan, Hannes Oellerich

Data Science (DS) and Artificial Intelligence (AI) are transforming research, industry and society at an unprecedented pace, enabling advances in areas such as healthcare, finance, e-commerce and beyond. Despite their potential, the rapid development and widespread use of DS and AI raise (novel) issues of reliability, accuracy, copyright and data protection, and bias and discrimination, among others, see for instance [1], [2], [3], [4]. It is therefore vital for data scientists to acknowledge the Ethical, Legal and Societal Aspects (ELSA) encountered in DS and AI projects, as this can promote critical thinking and reflection, thereby ensuring that data-driven systems and their underlying technologies are developed, deployed and used responsibly. In the framework of NFDI4DataScience, specifically in the Community and Training task area, we aim to develop ELSA guidelines for data scientists [5]. In order to achieve this objective, we tried to assess the landscape by conducting interviews with researchers and practitioners in the field, aiming to identify and analyse the most common ELSA challenges encountered in DS/AI projects and how to cope with them. The interviews were semi-structured interviews as this form is well suited to our purpose of collecting experiences, reflections and opinions from the participants [6]. A total of 30 were conducted between November 2022 and February 2024. The participants came mainly from academia, but the industry was also well represented. The application domains included a.o., healthcare, finance, engineering and digital humanities. In order to systematically interpret the material for manifest and latent meanings, we used qualitative content analysis [7]. Consequently, a categorisation of the material was developed to provide the basis for this interpretation [8, p. 33]. Initial categories were developed deductively, derived from the interview guide, which itself was based on existing theory and research. Subcategories were created inductively from the interviews following initial coding with the main categories. The categories reflected the key ELSA challenges faced by data scientists, including data protection, but also more specifically fairness, transparency, consent, intellectual property, and data scientists' knowledge (and also attitudes) towards ELSA challenges and how they influence their decision-making processes in the project. The results of our analysis reveal that data scientists are generally aware of ELSA issues, some more acutely than others; for example, legal issues, especially data protection, are more prominent, especially in application domains such as healthcare; bias is considered more during the data collection and less in connection to the model used or the system deployment; issues of transparency and explainability are also crucial although not prevalent. Insight was also provided regarding interdisciplinary cooperation, institutionalised ELSA support, and project documentation. Additionally, we have recorded critical assessments of the practices followed, spanning from issues with the application of laws to the responsibility and accountability of practitioners during a project life cycle. Finally, our findings emphasise the necessity of enhancing ELSA literacy and establishing and providing a strong foundational understanding of ethical and legal principles to data scientists. Developing recommendations/best practices for data scientists was regarded as a positive first step towards this goal

Community and Training in NFDI4DS

2023 - Open Access -
Lorenz, Anna-Lena; Christoforaki, Maria; Hennig, Christine; Kraft, Angelie; von Maltzan, Stephanie; Schimmler, Sonja

Ethical, Legal, and Societal Aspects of Data Science as manifested via a series of Interviews conducted within the framework of NFDI4DataScience, Study guide and anonymised transcripts

2025 - Open Access -
Maria Christoforaki, Stephanie von Maltzan

Dataset and Study Report

The upload consists of the two following files: 

  1. The study report on the Ethical, Legal, and Societal Aspects of Data Science, as manifested through a series of Interviews conducted within the framework of NFDI4DataScience Task Area 1: Community and Training. In the interviews, data science researchers and practitioners were asked to identify and analyse the most common legal and ethical challenges encountered in DS projects and how they address them. The objective was to use the collected information to create ELSA guidelines for Data Scientists. The file consists of the study guide itself and the following supplementary material:

    1. Call for participants email

    2. Call for participants email

    3. Email to the interviewees explaining the process

    4. Consent form template

    5. Anonymity Policy Clarification email

    6. Anonymisation Protocol

  2. The anonymized transcripts of the interviews