DNA Methylation Biomarkers Discovery Approach Using Weighted Causal Multi-Outcome Double Machine Learning Estimation

Ala'a El-Nabawy, Ossama Alshabrawy, Wai Lok Woo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper a biomarker discovery approach is proposed, where a causal double machine learning method is utilized for identifying DNA methylation biomarkers. Based on multi-outcome causal estimate, biomarkers are assessed with respect to their integrated target expressed genes, the selected biomarkers are then validated for tumor/ normal classification. The paper proposes various multi-outcome weighted functions to calculate the average multi-outcome causal estimate for a given CpG site; correlation, fold change, discriminant analysis weights, and feature importance. The proposed approach is applied on CPTAC-3 Head&Neck cancer in comparison to the commonly LIMMA approach. The proposed approach provided better classification accuracy, however with minimum possible features, the highest reported accuracy 98.79% with 11 features resulted from causal feature importance weight function, while the LIMMA accuracy is 97.67% with 92,815 features.
Original languageEnglish
Title of host publication2024 Sixth International Conference on Intelligent Computing in Data Sciences (ICDS)
EditorsYouness Oubenaalla, El Habib Nfaoui, Jaouad Boumhidi, Chakir Loqman, Cesare Alippi
Place of PublicationPiscataway, US
PublisherIEEE
Pages1-8
Number of pages8
ISBN (Electronic)9798350351200
ISBN (Print)9798350351217
DOIs
Publication statusPublished - 23 Oct 2024
Event2024 Sixth International Conference on Intelligent Computing in Data Sciences (ICDS) - Marrakech, Morocco
Duration: 23 Oct 202424 Oct 2024
https://www.researchnetwork.ma/icds2024/

Conference

Conference2024 Sixth International Conference on Intelligent Computing in Data Sciences (ICDS)
Abbreviated titleICDS2024
Country/TerritoryMorocco
CityMarrakech
Period23/10/2424/10/24
Internet address

Keywords

  • Causal Inference
  • Multi-outcome
  • Double Ma-chine Learning
  • DNA Methylation
  • Biomarkers

Cite this