Deliverable 3.1 pertains to Task 3.5 of the SSHOC project and is a report completed under the auspices of Work Package leaders CESSDA/FSD.
First the report presents an inventory of the varied data formats and metadata standards currently used across the full gamut of research infrastructures managed by the main project partners. This information was gleaned from interviews with experts from the partner organisations where a range of data infrastructures are owned or managed across multiple disciplines. Special attention was paid to interoperability aspects.
The report then offers recommendations as to specific data formats and metadata standards designed to increase interoperability. Finally, it lays out the priorities for the provision of conversion services and planning solutions, which will be the topic of much of the future work in the project.
Recommendations include DDI Codebook for the social sciences and CMDI for the language sciences. The writers also recommend providing conversion into Dublin Core or relaxed DataCite to enable interoperability. In terms of data formats, a comprehensive breakdown is categorised by media type and includes flac for audio and CSV for numeric matrices.
Click here to read the full report.