Chem-DCAT-AP: New metadata profile for chemical research data

With the publication of the Chem-DCAT-AP specification (version v0.1.0rc3), a domain-specific metadata profile is now available for the first time; it builds on the established DCAT-AP standard and specifically addresses the requirements of the chemical sciences.

The starting point is the observation that, whilst generic metadata standards such as DCAT enable the interoperable description of datasets, they are only of limited suitability for representing complex scientific contexts. This applies in particular to chemical data, where information on substances, samples, measurement methods or experimental conditions plays a central role.

Chem-DCAT-AP addresses this gap through a targeted extension of DCAT-AP. The profile introduces additional classes and properties that enable chemistry-specific aspects to be captured in a more structured manner. These include, amongst other things, the description of samples, links to analytical methods, and the representation of provenance information. The aim is to improve the traceability and reusability of data without compromising compatibility with existing DCAT-based systems.

The current release version represents an interim stage. It contains a consolidated model, initial technical artefacts, and examples of application. At the same time, the specification is explicitly marked as a ‘Release Candidate’ and will be further developed in consultation with the community. Feedback from practical use will be incorporated into future versions.

For NFDI4Chem, Chem-DCAT-AP is a building block within the work on standardising metadata and interfaces. In the long term, the profile can help to describe data from different sources – such as repositories or electronic lab notebooks – more consistently and link them together more effectively. In doing so, it supports key objectives such as interoperability and FAIR data, without solving them on its own.

Development is taking place in coordination with NFDI4Cat, particularly in the context of the overarching extension DCAT-AP+. Whilst this collaboration provides the conceptual foundations, the focus of Chem-DCAT-AP lies on concrete implementation for chemical use cases. Its practical relevance will be measured by the extent to which the profile is implemented in existing systems and adopted by the community.

The first response came from the DCAT-AP team at SEMIC, who found the work very interesting. It was agreed that we would soon present DCAT-AP+ & ChemDCAT-AP to the DCAT-AP community and propose our DCAT-AP+ approach for reintegration.