When the COVID-19 pandemic swept through Barcelona's hospitals between March 2020 and August 2021, researchers at the Germans Trias i Pujol Research Institute (IGTP) did something that would ripple far beyond their laboratory walls: they meticulously documented everything. Now, that trove of medical knowledge is available to scientists everywhere, free of charge and ready to reshape how researchers understand COVID-19's patterns and impact.
The DIVINE study database contains anonymized clinical information from 5,813 patients hospitalized with COVID-19 across five hospitals in Barcelona's southern metropolitan area during four waves of the pandemic. This isn't just a spreadsheet—it's a comprehensive record that includes clinical characteristics, risk factors, treatments received, and hospital outcomes collected during patient stays and follow-up care. By publishing the dataset as an R package on CRAN, alongside a GitHub repository and Zenodo record, the Biostatistics Unit at IGTP has made the data immediately accessible and traceable for researchers worldwide.
What makes this move significant is both what it contains and how it's being shared. The dataset has already fueled several studies on in-hospital mortality, long-term sequelae, patient stratification, and the development of predictive models. Now, by opening these records to the broader scientific community, the researchers are creating a foundation for future studies that would otherwise require teams to start from scratch. Instead of redundant data collection efforts, scientists can validate predictive models, test new hypotheses, and explore questions that might not have occurred to the original researchers.
"Making clinical research data openly available is not only an act of transparency, but also an ethical commitment to science and society," said Cristian Tebé, head of the Biostatistics Unit at IGTP. "It enables data reuse, the reproduction of analyses, the expansion of knowledge and the acceleration of research, while avoiding the unnecessary duplication of studies." The insight cuts to the heart of modern research: in an age of pressing health challenges, redundancy is a luxury science cannot afford.
The dataset's utility extends beyond epidemiologists and clinicians. The researchers designed it to serve as a teaching resource for students in biostatistics and epidemiology, transforming real pandemic data into an educational tool. This dual purpose—supporting cutting-edge research while training the next generation of health scientists—multiplies the value of the work that Barcelona's hospitals and researchers conducted during an extraordinary crisis.
The publication in Scientific Data, a journal dedicated to promoting the accessibility and reusability of research datasets, represents a growing shift in how the scientific community approaches information sharing. By anonymizing patient records and structuring the data for easy reuse, IGTP's team has modeled how institutions can honor patient privacy while advancing collective knowledge. As researchers worldwide grapple with questions about COVID-19's long-term effects, optimal treatment strategies, and pandemic preparedness, they now have a robust, documented resource from one of Europe's hardest-hit regions. In opening these doors, Barcelona's researchers have demonstrated that transparency and data sharing aren't obstacles to science—they're accelerants.
