A Hybrid LSTM-Attention Approach for Missing Data Imputation in IoT Time Series

Ammara Laeeq*, Usman Adeel, Jie Li, Eleanor Starkey

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

IoT systems generate large volumes of time-series data, but sensor malfunctions often lead to missing values that reduce the effectiveness of machine learning models. We propose a novel hybrid architecture that interleaves Long Short-Term Memory (LSTM) layers with a multihead attention mechanism, where the first LSTM layer captures local temporal dependencies, the attention layer highlights long-range relationships, and the second LSTM layer integrates these features into a coherent sequence. This structured design, unlike conventional orderings, enhances robustness against irregular missingness. Evaluated on six months of soil surface temperature data with simulated missing rates from 10% to 90%, in terms of mean absolute error (MAE), R2 score (R2) and root mean squared error (RMSE). Performance is also compared to a statistical technique k-Nearest Neighbour (KNN) and a deep learning technique Bidirectional Recurrent Imputation for Time Series (BRITS) baselines. Importantly, training with simulated missingness further improved generalization, underscoring the novelty and practical relevance of the proposed approach for real-world IoT scenarios.

Original languageEnglish
Title of host publicationIntelligent Data Engineering and Automated Learning – IDEAL 2025
Subtitle of host publication26th International Conference, Jaén, Spain, November 13–15, 2025, Proceedings, Part I
EditorsLuis Martínez, David Camacho, Hujun Yin, Bapi Dutta, Raciel Yera, Rosa M. Rodríguez Domínguez, Antonio Tallón-Ballesteros
Place of PublicationCham, Switzerland
PublisherSpringer
Pages301-312
Number of pages12
Edition1
ISBN (Electronic)9783032104861
ISBN (Print)9783032104854
DOIs
Publication statusPublished - 6 Nov 2025
Event26th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2025 - Jaén, Spain
Duration: 13 Nov 202515 Nov 2025

Publication series

NameLecture Notes in Computer Science
Volume16238 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference26th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2025
Country/TerritorySpain
CityJaén
Period13/11/2515/11/25

Keywords

  • Data Imputation
  • LSTM
  • Missing Data
  • Multihead Attention
  • Neural Network
  • Time Series

Cite this