High-definition likelihood inference of genetic colocalization reveals protein biomarkers for human complex diseases
GigaScience, 2026
Li Y., Zhai R., Yang Z., Li T., Pawitan Y., Shen X.
| Disease area | Application area | Sample type | Products |
|---|---|---|---|
Technical Studies | Data Science | Plasma | Olink Explore 3072/384 |
Abstract
Background
Genetic colocalization analysis is essential for understanding the shared genetic basis between phenotypic traits. Such an analysis is particularly useful for identifying plasma proteins with potential as therapeutic targets or clinical biomarkers. Improvements to existing tools are needed for more accurate inference of potentially causal biomarkers.
Findings
We develop HDL-C, a high-definition likelihood inference method for genetic colocalization analysis. Based on simulations and observed rediscovery rates in real data analyses, we demonstrate that the HDL-C approach outperforms state-of-the-art methods, COLOC, SuSiE, and SharePro, in detecting genetic colocalization, thus enabling a more complete understanding of genetic connections at specific loci. Analyses of the top 50 protein–disease pairs identified by HDL-C in the male and female cohorts of the UK Biobank uncovered 40 previously validated drug-protein–disease combinations with approved drugs matching the phenotypes and 62 combinations with potential drug repurposing opportunities. Additionally, we identified 63 novel protein–disease pairs that suggest promising candidates for future therapeutic interventions.
Conclusion
This research establishes a robust framework for detecting genetic colocalization signals, enabling the prioritization of disease-relevant protein targets and informing therapeutic development strategies.