Serum biomarker-based early detection of pancreatic ductal adenocarcinomas with ensemble learning
Communications Medicine, 2023
Nené N., Ney A., Nazarenko T., Blyuss O., Johnston H., Whitwell H., Sedlak E., Gentry-Maharaj A., Apostolidou S., Costello E., Greenhalf W., Jacobs I., Menon U., Hsuan J., Pereira S., Zaikin A., Timms J.
Disease area | Application area | Sample type | Products |
---|---|---|---|
Oncology | Patient Stratification | Plasma | Olink Target 96 |
Abstract
Background
Earlier detection of pancreatic ductal adenocarcinoma (PDAC) is key to improving patient outcomes, as it is mostly detected at advanced stages which are associated with poor survival. Developing non-invasive blood tests for early detection would be an important breakthrough.
Methods
The primary objective of the work presented here is to use a dataset that is prospectively collected, to quantify a set of cancer-associated proteins and construct multi-marker models with the capacity to predict PDAC years before diagnosis. The data used is part of a nested case-control study within the UK Collaborative Trial of Ovarian Cancer Screening and is comprised of 218 samples, collected from a total of 143 post-menopausal women who were diagnosed with pancreatic cancer within 70 months after sample collection, and 249 matched non-cancer controls. We develop a stacked ensemble modelling technique to achieve robustness in predictions and, therefore, improve performance in newly collected datasets.
Results
Here we show that with ensemble learning we can predict PDAC status with an AUC of 0.91 (95% CI 0.75–1.0), sensitivity of 92% (95% CI 0.54–1.0) at 90% specificity, up to 1 year prior to diagnosis, and at an AUC of 0.85 (95% CI 0.74–0.93) up to 2 years prior to diagnosis (sensitivity of 61%, 95% CI 0.17–0.83, at 90% specificity).
Conclusions
The ensemble modelling strategy explored here outperforms considerably biomarker combinations cited in the literature. Further developments in the selection of classifiers balancing performance and heterogeneity should further enhance the predictive capacity of the method.