Skip to main navigation Skip to search Skip to main content

Optimal transport filtering for robust cross-modal retrieval with open-set noisy labels

Xinliu Liu, Ruitao Pu, Yuan Sun, Yingke Chen, Shudong Huang, Dezhong Peng, Yongsheng Sang*, Ziming Wang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

With the rapid growth of the Internet, cross-modal retrieval (CMR) has attracted increasing attention for its ability to bridge heterogeneous modalities. Most existing studies assume that data annotations are completely accurate, but this rarely holds in practice since both human non-expert and machine annotations often introduce noisy labels. To mitigate the effect of label noise, various robust learning approaches have been proposed with promising results. However, these methods are generally developed under a closed-set assumption, where noisy samples remain within the known label space. In real-world scenarios, noisy samples may originate from unseen categories, referred to as open-set noisy labels, under which existing methods often fail. To address this challenge, we propose a novel CMR framework named Optimal Transport Filtering with Open-Set Noisy Labels (OTOS). Specifically, a Discriminative Reinforcement Learning (DRL) module is introduced to enhance instance-level discrimination and reduce multimodal heterogeneity, while an Optimal Transport Filtering (OTF) module leverages geometric distances to identify clean samples, closed-set noise, and open-set noise effectively. Furthermore, specialized learning strategies are designed for different instance types to fully exploit the information contained in both closed-set and open-set data. Extensive experiments on three benchmark datasets demonstrate the superior performance of OTOS, particularly in handling multimodal data with open-set noisy labels.

Original languageEnglish
Article number113377
Number of pages11
JournalPattern Recognition
Volume178
Early online date3 Mar 2026
DOIs
Publication statusE-pub ahead of print - 3 Mar 2026

Keywords

  • Cross-modal retrieval
  • Noisy labels
  • Open-set noise

Fingerprint

Dive into the research topics of 'Optimal transport filtering for robust cross-modal retrieval with open-set noisy labels'. Together they form a unique fingerprint.

Cite this