From 'dark' and 'cold' to FAIR: steps towards a trusted preservation repository for scientific data at EPFL

Abstract

In 2020, EPFL introduced [ACOUA][1], a dedicated service designed to manage 'cold' datasets, such as supplementary data for articles and doctoral theses, completed research projects, or data retained following the closure of a lab or the departure of a researcher. Initially, the project aimed to address both dissemination and long-term preservation needs. Yet, following the tender process, the implemented service is solely oriented towards preservation. Consequently, ACOUA is a ‘dark’ archive, accessible only to authenticated users. This makes it challenging to communicate to researchers the full value of the service. Overall, ACOUA enables to instate robust and OAIS-compliant workflows from data ingestion and curation to audit and retrieval, backed by detailed documentation and dedicated support for end-users. However, feedback from researchers emphazises the crucial importance of '[FAIRification][2]' for preserved objects. This poster outlines the challenges encountered, strategies applied, and effective solutions implemented to meet the FAIR principles. Highlingts of the state-of-the-art include: FINDABLE A locally developed DataCite v.4.4 metadata generator streamlines the process of adding the metadata files to preserved objects. ACCESSIBLE Each preserved object is assigned a unique DataCite DOI. Pre-signed URLs enable the public access to datasets from ACOUA. A specially developed connector facilitates the automatic transfer of datasets from ACOUA to Zenodo. INTEROPERABLE Currently, we maintain a basic OAI-PMH repository. REUSABLE Licenses and readme files are mandatory in ACOUA. DROID characterization and JHOVE validation of file formats are integrated into the process. A checksum check ensures the integrity of retrieved objects. Nevertheless, more needs to be done to reach full FAIRness. The poster also introduces new developments being considered for future work. Examples include, but are not limited to: Use features of the preservation tool to convert files to open formats at ingestions time (SIP), or to prevent obsolescence by evolving file formats on AIP/DIP copies. Explore existing ontologies for specific data collections within ACOUA. Integrate the linked data version of PRONOM, when available. Our ultimate goal is to establish ACOUA as a '[FAIR over time][3]' and trustworthy repository, for an enlarged community of re-users. [1]: https://actu.epfl.ch/news/acoua-a-new-archive-for-preservation-of-research-d/ [2]: https://www.go-fair.org/fair-principles/fairification-process/ [3]: https://zenodo.org/records/5797776

Details

Creators
Alessandra Bianchi; Micha d'Ans
Institutions
Date
2024-09-18 15:30:00 +0100
Keywords
approaches to preservation; start 2 preserve
Publication Type
poster
License
Creative Commons Attribution 4.0 (CC-BY-4.0)
Download
(unknown) bytes

View This Publication