FAIR principles allied to data access and reuse

analysis of Covid-19 datasets from PubChem repository

Authors

  • Tainá Regly IBICT - UFRJ
  • Viviane Santos de Oliveira Veiga Fiocruz
  • Aline da Silva Alves

Keywords:

Research data, Covid-19, FAIR, PubChem

Abstract

This article aims to verify alignment with FAIR principles in health research data sharing and analyze its potential reuse in the Covid-19 health crisis. It uses a theoretical-descriptive, bibliographic, and exploratory approach to establish criteria for selecting the pilot theme, the data source, and the analysis tool.  The pilot theme was chosen, a drug that has been widely debated in the scientific and media field, for the treatment of Covid-19, Chloroquine. As a source of analysis it uses the chemical disciplinary repository PubChem and FairDataBR as an analysis tool to check the level of adherence of the selected datasets to the FAIR principles in a semi-automated way. As a result, it points out that the Accessible and Reusable principles (scores 8.33 and 8.20) obtained the highest scores, mainly due to ease of access, downloading and sharing. On the other hand, the Findable and Interoperable principles (scores 5.20 and 7.75) performed less well because they do not use persistent identifiers in all the available sets and do not make use of a metadata schema formalized by the chemical community. It concludes that, with respect to FAIR principles, there is partial alignment of the Chloroquine datasets deposited in PubChem. Thus, the analysis shows a reuse potential of (average of scores 7.37), as all principles support reuse. Finally, it reaffirms that for data to be FAIR, the ecosystem needs to be FAIR, so some measures should be taken by PubChem to support data generation, such as the designation or mandatory insertion of persistent identifiers.

References

Schymanski, Emma L., and Evan E. Bolton. "FAIR chemical structures in the Journal of Cheminformatics." Journal of cheminformatics 13.1 (2021): 1-3. https://doi.org/10.1186/s13321-021-00520-4

Simões, Rafael C, , Anjos, Renata Lemos dos & Dias, Guilherme Ataíde. (2021). Análise dos conjuntos de dados disponíveis no repositório COVID-19 Data Sharing/BR à luz dos princípios FAIR. In Princípios FAIR aplicados à gestão de dados de pesquisa (pp. 91–102). Ibict. https://doi.org/10.22477/9786589167242.cap7

REGLY, Tainá. (2022). Dados provenientes da análise do repositório PubChem com a ferramenta FairDataBR [Data set]. Zenodo. https://doi.org/10.5281/zenodo.7071895

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., ... & Mons, B. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific data, 3, 160018. https://doi.org/10.1038/sdata.2016.18

Published

2024-05-29

How to Cite

Tainá Regly, Santos de Oliveira Veiga, V. ., & da Silva Alves, A. . (2024). FAIR principles allied to data access and reuse: analysis of Covid-19 datasets from PubChem repository. UEM Scientific Journal: Arts and Social Sciences Series , 4(1). Retrieved from http://196.3.97.23/revista/index.php/lcs/article/view/222

Most read articles by the same author(s)