Abstract
In this paper a biomarker discovery approach is proposed, where a causal double machine learning method is utilized for identifying DNA methylation biomarkers. Based on multi-outcome causal estimate, biomarkers are assessed with respect to their integrated target expressed genes, the selected biomarkers are then validated for tumor/ normal classification. The paper proposes various multi-outcome weighted functions to calculate the average multi-outcome causal estimate for a given CpG site; correlation, fold change, discriminant analysis weights, and feature importance. The proposed approach is applied on CPTAC-3 Head&Neck cancer in comparison to the commonly LIMMA approach. The proposed approach provided better classification accuracy, however with minimum possible features, the highest reported accuracy 98.79% with 11 features resulted from causal feature importance weight function, while the LIMMA accuracy is 97.67% with 92,815 features.
Original language | English |
---|---|
Title of host publication | 2024 Sixth International Conference on Intelligent Computing in Data Sciences (ICDS) |
Editors | Youness Oubenaalla, El Habib Nfaoui, Jaouad Boumhidi, Chakir Loqman, Cesare Alippi |
Place of Publication | Piscataway, US |
Publisher | IEEE |
Pages | 1-8 |
Number of pages | 8 |
ISBN (Electronic) | 9798350351200 |
ISBN (Print) | 9798350351217 |
DOIs | |
Publication status | Published - 23 Oct 2024 |
Event | 2024 Sixth International Conference on Intelligent Computing in Data Sciences (ICDS) - Marrakech, Morocco Duration: 23 Oct 2024 → 24 Oct 2024 https://www.researchnetwork.ma/icds2024/ |
Conference
Conference | 2024 Sixth International Conference on Intelligent Computing in Data Sciences (ICDS) |
---|---|
Abbreviated title | ICDS2024 |
Country/Territory | Morocco |
City | Marrakech |
Period | 23/10/24 → 24/10/24 |
Internet address |
Keywords
- Causal Inference
- Multi-outcome
- Double Ma-chine Learning
- DNA Methylation
- Biomarkers