A Computational Statistics Approach to Evaluate Blood Biomarkers for Breast Cancer Risk Stratification

Kaan Oktay, Ashlie Santaliz-Casiano, Meera Patel, Natascia Marino, Anna Maria Storniolo, Hamdi Torun, Burak Acar, Zeynep Madak-Erdogan

Research output: Contribution to journalArticlepeer-review

20 Citations (Scopus)
21 Downloads (Pure)

Abstract

Breast cancer is the second leading cause of cancer mortality among women. Mammography and tumor biopsy followed by histopathological analysis are the current methods to diagnose breast cancer. Mammography does not detect all breast tumor subtypes, especially those that arise in younger women or women with dense breast tissue, and are more aggressive. There is an urgent need to find circulating prognostic molecules and liquid biopsy methods for breast cancer diagnosis and reducing the mortality rate. In this study, we systematically evaluated metabolites and proteins in blood to develop a pipeline to identify potential circulating biomarkers for breast cancer risk. Our aim is to identify a group of molecules to be used in the design of portable and low-cost biomarker detection devices. We obtained plasma samples from women who are cancer free (healthy) and women who were cancer free at the time of blood collection but developed breast cancer later (susceptible). We extracted potential prognostic biomarkers for breast cancer risk from plasma metabolomics and prote- omics data using statistical and discriminative power analyses. We pre-processed the data to ensure the quality of subse- quent analyses, and used two main feature selection methods to determine the importance of each molecule. After further feature elimination based on pairwise dependencies, we measured the performance of logistic regression classifier on the remaining molecules and compared their biological relevance. We identified six signatures that predicted breast cancer risk with different specificity and selectivity. The best performing signature had 13 factors. We validated the difference in level of one of the biomarkers, SCF/KITLG, in plasma from healthy and susceptible individuals. These biomarkers will be used to develop low-cost liquid biopsy methods toward early identification of breast cancer risk and hence decreased mortality. Our findings provide the knowledge basis needed to proceed in this direction.
Original languageEnglish
Pages (from-to)17-33
Number of pages17
JournalHormones and Cancer
Volume11
Issue number1
Early online date19 Dec 2019
DOIs
Publication statusPublished - 1 Feb 2020

Keywords

  • Breast cancer risk
  • Circulating biomarker
  • Feature selection
  • Liquid biopsy
  • Machine learning

Fingerprint

Dive into the research topics of 'A Computational Statistics Approach to Evaluate Blood Biomarkers for Breast Cancer Risk Stratification'. Together they form a unique fingerprint.

Cite this