Causal Bio-miner: Response Biomarkers Discovery Framework for Microarray Transcriptomics Treatment Subgroups Classification

Ala'a El-Nabawy*, Ossama Alshabrawy, Wai Lok Woo

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

In this paper, a response biomarkers discovery framework based on discriminant analysis and causal inference is introduced. The framework has two main stages, causal bio-mining and bio-markers validation. At the causal bio-mining stage, the significant biomarkers are extracted from the randomized controlled trial (RCT) dataset by different techniques, discriminant analysis, feature ranking, statistical significance and association scoring. The extracted biomarkers are then assessed with respect to the treatment group classification, using causal inference propensity score matching. The causal biomarkers when applied to the subgroups classification provided better accuracy results, however using the minimum possible features, when their causal estimate is higher than 0.15 for both the treated and the control groups. The proposed framework’s efficacy was confirmed on two publicly available datasets: LiTMUS (GEO: GSE45484) and Breast Cancer (GEO: GSE20271). The performance of the framework was compared to established techniques, including those based on statistical variance and diagonal linear discriminant analysis (DLDA). The proposed framework demonstrably outperformed these benchmark methods. Using 3 features the Lithium subgroup classification accuracy is 83.33%, while the Non-Lithium subgroup classification accuracy is 93.75%, based on causal score>=0.2. Meanwhile, using 12 features the FAC×6 subgroup classification accuracy is 81.90%, and using 13 features the T/FAC subgroup classification accuracy is 92.70%, based on causal score >=0.15.
Original languageEnglish
Article number130503
JournalExpert Systems with Applications
Volume302
Early online date22 Nov 2025
DOIs
Publication statusE-pub ahead of print - 22 Nov 2025

Keywords

  • Bio-markers
  • Causal inference
  • Discriminant analysis
  • Response
  • Transcriptomics
  • Treatment

Cite this