TY - JOUR
T1 - Brain tumor classification utilizing pixel distribution and spatial dependencies higher-order statistical measurements through explainable ML models
AU - Akter, Sharmin
AU - Talukder, Md. Simul Hasan
AU - Mondal, Sohag Kumar
AU - Aljaidi, Mohammad
AU - Sulaiman, Rejwan Bin
AU - Alshammari, Ahmad Abdullah
PY - 2024/10/28
Y1 - 2024/10/28
N2 - Brain tumors are among the most fatal and devastating diseases, and they often result in a significant reduction in life expectancy. The devising of treatment plans that can extend the lives of affected individuals hinges on an accurate diagnosis of these tumors. Identifying and analyzing large volumes of magnetic resonance imaging (MRI) data manually proves to be both challenging and time-consuming. As a result, there exists a pressing need for a reliable machine-learning approach to accurately diagnose brain tumors, and numerous methods have already been proposed over the last decade. In this paper, a novel, comprehensive approach is proposed for identifying and classifying a given MR brain image as abnormal. Three common brain diseases, namely glioma, meningioma, and pituitary tumor, are chosen as abnormal brains, and the Figshare MRI brain image dataset was collected from the Kaggle and IEEE websites. The proposed method is initiated by employing 1st-order statistics, 2nd-order statistics, and higher-order transformed (DWT) feature extraction to extract features from images. Then missing data is addressed and handled using KNNImputer, followed by the application of the ExtratreesClassifier and PCA feature selection methods to identify the most relevant features and reduce the dimensions of these features. Subsequently, the reduced features are submitted to seven machine learning models, namely RF, GB, CB, SVM, LGBM, DT, and LR. The strategy of k-fold cross-validation is utilized to enhance the performance of those models. Finally, the models are evaluated using XAI approaches, which ensure transparent decision-making processes and provide insights into the model’s predictions. Remarkably, our approach achieves the highest accuracy, precision, recall, F1 score, MCC, Kappa, AUC-ROC, and R2, as well as the lowest loss, among the seven models evaluated, proving its effectiveness and applicability in multiple analytic applications relying on publicly available datasets.
AB - Brain tumors are among the most fatal and devastating diseases, and they often result in a significant reduction in life expectancy. The devising of treatment plans that can extend the lives of affected individuals hinges on an accurate diagnosis of these tumors. Identifying and analyzing large volumes of magnetic resonance imaging (MRI) data manually proves to be both challenging and time-consuming. As a result, there exists a pressing need for a reliable machine-learning approach to accurately diagnose brain tumors, and numerous methods have already been proposed over the last decade. In this paper, a novel, comprehensive approach is proposed for identifying and classifying a given MR brain image as abnormal. Three common brain diseases, namely glioma, meningioma, and pituitary tumor, are chosen as abnormal brains, and the Figshare MRI brain image dataset was collected from the Kaggle and IEEE websites. The proposed method is initiated by employing 1st-order statistics, 2nd-order statistics, and higher-order transformed (DWT) feature extraction to extract features from images. Then missing data is addressed and handled using KNNImputer, followed by the application of the ExtratreesClassifier and PCA feature selection methods to identify the most relevant features and reduce the dimensions of these features. Subsequently, the reduced features are submitted to seven machine learning models, namely RF, GB, CB, SVM, LGBM, DT, and LR. The strategy of k-fold cross-validation is utilized to enhance the performance of those models. Finally, the models are evaluated using XAI approaches, which ensure transparent decision-making processes and provide insights into the model’s predictions. Remarkably, our approach achieves the highest accuracy, precision, recall, F1 score, MCC, Kappa, AUC-ROC, and R2, as well as the lowest loss, among the seven models evaluated, proving its effectiveness and applicability in multiple analytic applications relying on publicly available datasets.
KW - PCA
KW - CB
KW - KNNImputer
KW - GB
KW - DT
KW - MRI
KW - LR
KW - SVM
KW - ExtratreesClassifier
KW - XAI
KW - RF
KW - DWT
KW - LGBM
U2 - 10.1038/s41598-024-74731-8
DO - 10.1038/s41598-024-74731-8
M3 - Article
SN - 2045-2322
VL - 14
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 25800
ER -