When Natàlia Blay and her team at the GCAT|Genomes for Life project in Catalonia began analyzing the genetic risk of frailty across nearly 5,000 participants, they uncovered a telling pattern: as genetic susceptibility increased, so did the prevalence of obesity and major depressive disorder—especially among women. This discovery wasn’t just a scientific insight; it was a demonstration of the power behind PolyGenie, a new open-source tool designed to unlock the hidden potential of genomic data. Developed by researchers at the Germans Trias i Pujol Research Institute (IGTP), PolyGenie is transforming how scientists explore the complex links between genetics and health, making it easier than ever to reuse and reinterpret existing data.
In an era where genomic datasets are growing rapidly, the real value lies not just in generating data, but in reusing it to answer new questions. That’s where PolyGenie steps in. Built using the Nextflow framework, the tool streamlines phenome-wide association studies (PheWAS), allowing researchers to examine how polygenic risk scores—aggregated indicators of genetic predisposition—relate to hundreds or even thousands of traits and diseases. By applying PolyGenie to the GCAT cohort, a population study of nearly 20,000 adults aged 40 to 65 in Catalonia, the team analyzed 135 different polygenic risk scores alongside 1,483 distinct phenotypes, from clinical diagnoses to lifestyle factors and metabolomic profiles. The result? More than 200,000 associations evaluated in a single, systematic sweep.
What sets PolyGenie apart is its accessibility and adaptability. Unlike many bioinformatics tools that require extensive coding expertise, PolyGenie comes with interactive visualization features and can be tailored to new research cohorts through simple configuration files—no code changes needed. It’s also fully open-source, aligning with the FAIR principles (Findable, Accessible, Interoperable, Reusable) that the GCAT team has championed for years. Now integrated into ELIXIR Spain, the national node of a pan-European infrastructure for biological data, PolyGenie is positioned to support researchers across institutions and borders.
"Although tools already exist to calculate polygenic risk scores and other platforms are available to visualize results, there has so far been a lack of resources that facilitate the systematic application of this type of analysis across different cohorts," says Blay. "PolyGenie fills this gap." Her words capture the essence of a tool that doesn’t just analyze data—it democratizes discovery. By turning complex genomic insights into navigable knowledge, PolyGenie empowers scientists in precision medicine, population genetics, and public health to ask bolder questions. As genomic databases grow, tools like this ensure that no data point is left behind, opening new pathways to understanding the biological roots of human health.
