Skip to main navigation Skip to search Skip to main content

A hybrid machine learning model for flood prediction with recursive feature elimination informed by training performance

Liying Gong, Wai Lok Woo, Yue Ivan Wu*, Xiujuan Zheng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Flooding is a devastating natural disaster that often causes inestimable human and economic loss. With historical data, flood forecasting with machine learning methods is considered an effective way to evaluate the potential risks in the concerned region. In this work, the recursive feature elimination (RFE) using SHapley Additive exPlanations (SHAP) is employed to select informative features, which iteratively evaluates feature importance and eliminates the least important features. With the least significant feature discarded, a one-dimension-reduced feature vector is formed for the next iteration, the proposed machine learning model is trained with the input samples of updated features. After this recursive feature elimination procedure, the selected features are finalized by finding the iteration where the model’s trained performance is optimal or dramatically drops. With the proposed recursive feature elimination, the training time of the conventional machine learning models, such as XGB, RF, and CatBoost, can be reduced by 17.8%, 56.0%, 30.6% while keeping the loss of prediction accuracy below 1%, respectively. Furthermore, a hybrid machine learning model integrating the conventional Catboost model and Random Forest model is proposed to evaluate flood susceptibility in flood-prone areas of Malawi, where numerical experiments showed its superiority over the individual models.

Original languageEnglish
Article number101
Number of pages25
JournalApplied Intelligence
Volume56
Issue number4
Early online date18 Feb 2026
DOIs
Publication statusPublished - 18 Feb 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • Ensemble learning
  • Feature selection
  • Flood prediction
  • Hybrid model
  • Machine learning

Fingerprint

Dive into the research topics of 'A hybrid machine learning model for flood prediction with recursive feature elimination informed by training performance'. Together they form a unique fingerprint.

Cite this