
This work package collects use cases based on the requirements of scientific communities. Pilots will run to validate the proposed solutions, which will be trans-national by design and general enough to be extended to other communities with minimal changes.
The pilots will also include resources and cost study, in order to understand their feasibility and propose a viable business model for the resulting services. The selected use-cases are connected to real scientific production and intent to improve the usage of data in a FAIR perspective.
The need for “data FAIRisation” is important, diversified and specific for each community (nanotechnologies, environment -ocean, atmosphere, continental surfaces, health, humanities, biodiversity, solid earth). They are more or less structured and have developed tools and services adapted to their specific needs and constraints. The objective is to prepare scientific communities to be involved in EOSC.
The initial set of scientific use cases will be expanded through an open call for participation, inviting communities and their developers to bring forward thematic services to enhance the portfolio of the project and the EOSC Portal.
The main objectives of this work package are to:
A bottom-up approach will be adopted from use-cases covering several themes. Each use-case will analyse different tools and services for the FAIRisation of data and services and type of governance according to the community needs.
Pilot use-case analysis must take into account the entire chain of data use, from acquisition to scientific exploration.
![]() |
Defining procedures and services to enforce data provenance for thematic communities and beyondTo enable a well-defined data provenance in the scientific experiments workflow, from data production to data usage, this task will elaborate cross-domain, FAIR-oriented procedures and recommendations to enforce data provenance and developing the proper adaptations/extensions on top of existing scientific data services, made available by the participating partners, at the national level Lead: CNR |
![]() |
Agile FAIR data for environment and earth system communitiesThis use case will build a representative set of FAIR-oriented use cases based on Earth System communities engagement (ocean, atmosphere, continental surfaces, solid earth, etc.) supporting services for visualization, navigation and processing of data on demand for involved communities users and service providers Lead: IFREMER |
![]() |
Integration of data repositories into EOSC based on communities approachesBased on existing community data repositories, and specially agri-food, material sciences, heritage data, source code etc, this use case will explore the current limitations and provide concrete recommendations and tools to address interoperability of data repositories within the future EOSC infrastructure. The main goal is to connect and align dataverse and other repositories with EOSC specifications and to reduce fragmentation of the landscape. Lead: INRA |
![]() |
Software source code preservation, reference and accessLeveraging the experience of Software Heritage, this task will design and pilot a solution for the preservation of massive collections of softwares source code (billions of files with links to publications) into EOSC eTDR (European Trusted Digital Repository) service. Lead: INRIA |
![]() |
FAIR principles in data life-cycles for HumanitiesThis case will be a model for linking with other data repositories used in social science and humanities (SSH) communities. It will also liaise with CO-OPERAS GO-FAIR Implementation Network to ensure that SSH needs are correctly taken in account in order to facilitate the integration of SSH in EOSC. Lead: CNRS |
![]() |
Exploring reference data through existing computing services for the bioinformatics communityThe aim of this use case will be to explore the possible interactions between already available Galaxy computing services Lead: INSERM |
![]() |
Suitable data formats for seismological big data provisioning via web servicesThis use case will focus on keeping FAIRness in this community through evaluating new data formats suitable for this use case, testing and implementing tailor made services where possible within the currently defined standards. Finally, service and data quality tests will be performed to assess their quality and usability. Lead: GFZ |
![]() |
Virtual definition of big datasets at seismological data centres according to RDA recommendationsResearch data management practice requires not just describing data collections, but to make them actionable by automated processes to cope with ever increasing volumes of data. The goal of this use case is to provide a production-ready implementation of a system following the Research Data Alliance (RDA) Recommendations from the Research Data Collections Working Group. Lead: GFZ |
![]() |
Integrating heterogenous data on cultural heritageThe management of digital objects remains an area of interest that crosses disciplines, institutions and infrastructures. In this context, the need for building aggregations or collections of such objects has become an essential element. The goal of this use case is to port the TEXTCROWD-a service in EOSC for multiple languages and compare the performance with TEXTCROWD-b developed with a different approach. Lead: INFN |