HDAnalyzeR: Streamlining Data Analysis for Biomarker Research
Bioinformatics Advances, 2026
Antonopoulos K., Johansson E., Kenrick J., Dahl L., Edfors F., Uhlén M., Bueno Álvez M.
| Disease area | Application area | Sample type | Products |
|---|---|---|---|
Oncology Wider Bioinformatics Studies | Data Science | Plasma | Olink Explore 3072/384 |
Abstract
Motivation
Exploration of large-scale biological datasets remains a central challenge in computational biology. While many tools are available, they are often developed in isolation, leading to fragmented workflows, duplicated efforts, and limited reproducibility. There is a pressing need for flexible, standardized solutions that unify exploratory data analysis and biomarker discovery across diverse platforms.
Results
We present HDAnalyzeR, a user-friendly and extensible R package for the streamlined analysis of high-dimensional biological data. HDAnalyzeR provides modular, reproducible workflows that support a range of analyses, from quality control and dimensionality reduction to differential expression and enrichment analysis. The package features built-in visualization, metadata-aware modeling, and seamless integration with interactive apps and learning resources. We also present two case studies, where HDAnalyzeR dramatically reduced analysis time and code complexity while providing biologically meaningful insights, such as classification of blood cancer types with AUC = 1.0 and identification of thousands of solid tumor-associated genes. HDAnalyzeR is designed to support both beginner users and experienced bioinformaticians, promoting transparency, reproducibility, and publication-quality output.
Availability
HDAnalyzeR is freely available both as an open-source R package at https://github.com/kantonopoulos/HDAnalyzeR.