FAIR Principles

In 2016, FORCE11 – a group of researchers as well as employees of libraries, archives, publishers and funders – established principles for the handling of research data. The so-called FAIR principles comprise four goals: the findability, accessibility, interoperability and re-usability of data. With the achievement of these goals, the sustainable re-usability of research data is meant to be guaranteed.

Biernacka, Katarzyna, Bierwirth, Maik, Buchholz, Petra, Dolzycka, Dominika, Helbig, Kerstin, Neumann, Janna, … Wuttke, Ulrike. (2020). Train-the-Trainer Concept on Research Data Management (Version 3.0). Zenodo. http://doi.org/10.5281/zenodo.4071471 p. 36, Creative Commons Attribution 4.0 International

Duration:  4:54 mins

ContentIn this video we take a look at FAIR data and the meaning of the individual FAIR principles (findable, accessible, interoperable and reusable).
It also covers how (trusted) data repositories are a key infrastructure that enables FAIR data.

Ghent University Data Stewards (2020). Knowledge clip: FAIR data principles. Available at: https://youtu.be/2uZxFu9SFi8

Licence: CC BY 4.0

What are the FAIR principles and what is FAIR data? FAIR are a set of guiding principles that enable and increase the reuse of data by humans and machines. FAIR is an acronym that stands for Findable, Accessible, Interoperable and Reusable. The FAIR principles originated in the life sciences, but can be applied to all disciplines. They are increasingly gaining traction and becoming a requirement by many research funders among others. Let's have a look at what each FAIR principle means.

FINDABLE: To enable its discovery, data should be described with rich metadata, and it should be assigned a persistent identifier, such as a digital object identifier or DOI. These metadata should be available online in a searchable resource such as a data catalog or repository.

ACCESSIBLE: Metadata and/or the data themselves should be retrievable via their persistent identifier using a standard communication protocol, such as HTTP or HTTPS. This means that following the persistent identifier should take you to the metadata or data. However, keep in mind that accessible does not mean that data must be open in the sense that there are no access restrictions. It rather means that if data has access conditions, these are clear to both humans and machines. Therefore the protocol for accessing the data should allow for an authentication and authorization procedure where necessary. In addition, metadata should be accessible even if the data themselves are no longer available.

INTEROPERABLE: Whenever possible, metadata and data should use recognized standards. By using formats, terms or vocabularies, that a community has agreed upon, we make sure our data is understandable by others but we also make possible for data to be exchanged and combined across computer systems. Interoperability also involves providing context by including references to other relevant metadata and data. For example by linking to another data set on which your data set is built.

REUSABLE: Data should not only be available, but also effectively reusable. To achieve this, data should be abundantly described and documented in accordance with community standards. Metadata and documentation should be able to answer the W-questions, to help others understand what we call the provenance of the data. In other words, where did data come from and what happened to them along the way. All of this is needed when we want others to understand the context of the data and judge how relevant and useful they are. It increases trust and the likelihood of reuse to make data reusable. We also need to let others know what kinds of reuse are permitted, by including a clear data usage license.

Given the multiple aspects of FAIR, data is not either FAIR or UNFAIR. FAIR is a spectrum, in other words data can be FAIR to a greater or lesser extent. So, how can you make your data FAIR? Unfortunately there is not a one-size-fits-all, but note that much of the work for making your research data FAIR can be addressed by depositing your data in a trusted data repository. By choosing an appropriate trusted and preferably domain-specific repository you can score many points in the FAIR game. When you upload your data to a repository you will typically need to provide metadata by filling a form. The elements of the form comply with a specific metadata standard. Your metadata will then become machine-actionable and searchable in an online resource. The repository should also generate a persistent identifier for your data. It will also provide the possibility to include references to other data or metadata; for example to link to related data sets or your ORCiD. In addition, trusted repositories will have authentication and authorization procedures in place to make sure that appropriate access conditions for the data are respected or enforced. And repositories also allow you to choose from machine-readable licenses enhancing the reusability of your data. Domain specific repositories tend to make use of discipline standards and controlled vocabularies, increasing the interoperability of your data.

Data repositories are indeed a key infrastructure enabling FAIR data. However, they won't do all the work for you. After all, you are the one that knows the data best. So you are still responsible to provide rich metadata and documentation to make the data understandable. Besides, if a discipline repository requires the data to be in a certain standard format and to use controlled vocabularies, the standardization process is still your job to do. Therefore, the sooner data is being collected and managed in a FAIR way, the easier it will be to keep the data FAIR in the end. This is sometimes referred to as making data FAIR by design. That is why planning for data management even before you start collecting data is essential. So are you ready to make your data FAIR?

Quiz

Welcome to your FAIR Principles Quiz.

Please click Next to start the quiz.


Further Information

  • Comment in Nature regarding FAIR Principles

Wilkinson, M. D. et al. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3:160018

  • FAIR-Principles and the European Commission

European Commission. Action Plan for FAIR data recommendations.

The EC expert group on FAIR data

EC/H2020 – Guidelines on FAIR Data Management in Horizon 2020

  • GO FAIR Initiative

GO FAIR is a bottom-up, stakeholder-driven and self-governed initiative that aims to implement the FAIR data principles, making data Findable, Accessible, Interoperable and Reusable (FAIR). It offers an open and inclusive ecosystem for individuals, institutions and organisations working together through Implementation Networks (INs). The INs are active in three activity pillars: GO CHANGEGO TRAIN and GO BUILD.

https://www.go-fair.org/fair-principles/

  • The GO FAIR Austria office, which is part of the global GO FAIR initiative, networks researchers and service institutions to implement the FAIR principles.

GO FAIR Austria office

  • Training material for “FAIR” in the train-the-trainer program for Research Data Management

Biernacka, Katarzyna, Bierwirth, Maik, Buchholz, Petra, Dolzycka, Dominika, Helbig, Kerstin, Neumann, Janna, … Wuttke, Ulrike. (2020). Train-the-Trainer Concept on Research Data Management (Version 3.0). Zenodo. http://doi.org/10.5281/zenodo.4071471 (p. 38)

  • OPENAIRE

A network of Open Access repositories, archives and journals that support Open Access policies. The OpenAIRE Consortium is a Horizon 2020 (FP8) project, aimed to support the implementation of the EC and ERC Open Access policies.

https://www.openaire.eu/how-to-make-your-data-fair

  • FAIR-Principles and the Committee on Data for Science and Technology (Codata)

The Committee on Data for Science and Technology (CODATA) is a Paris-based organization with the aim of improving the quality, reliability and accessibility of interesting data from all fields of science and technology.

Hodson, S. (2018). Making FAIR data a reality… and the challenges of interoperability and reusability. Open Science Conference 2018.

Citation

FAIR Data Austria (2021). “FAIR Principles”. In: Research Data Management Open Educational Resources Collection. (https://fair-office.at/index.php/fair-prinzipien/?lang=en).

License: CC BY 4.0 unless otherwise stated.