Reinforcement Learning Driven Intra-modal and Inter-modal Representation Learning for 3D Medical Image Classification - IMT Mines Alès Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Reinforcement Learning Driven Intra-modal and Inter-modal Representation Learning for 3D Medical Image Classification

Résumé

Multi-modality 3D medical images play an important role in the clinical practice. Due to the effectiveness of exploring the complementary information among different modalities, multi-modality learning has attracted increased attention recently, which can be realized by Deep Learning (DL) models. However, it remains a challenging task for two reasons. First, the prediction confidence of multi-modality learning network cannot be guaranteed when the model is trained with weakly-supervised volume-level labels. Second, it is difficult to effectively exploit the complementary information across modalities and also preserve the modality-specific properties when fusion. In this paper, we present a novel Reinforcement Learning (RL) driven approach to comprehensively address these challenges, where two Recurrent Neural Networks (RNN) based agents are utilized to choose reliable and informative features within modality (intra-learning) and explore complementary representations across modalities (inter-learning) with the guidance of dynamic weights. These agents are trained via Proximal Policy Optimization (PPO) with the confidence increment of the prediction as the reward. We take the 3D image classification as an example and conduct experiments on a multi-modality brain tumor MRI data. Our approach outperforms other methods when employing the proposed RL-based multi-modality representation learning.
Fichier principal
Vignette du fichier
Reinforcement_Learning_Driven_Intra_modal.pdf (636.85 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03790158 , version 1 (04-10-2022)

Identifiants

Citer

Zhonghang Zhu, Liansheng Wang, Baptiste Magnier, Lei Zhu, Defu Zhang, et al.. Reinforcement Learning Driven Intra-modal and Inter-modal Representation Learning for 3D Medical Image Classification. MICCAI 2022 - The 25th International Conference on Medical Image Computing and Computer Assisted Intervention, Sep 2022, Singapour, Singapore. pp.604-613, ⟨10.1007/978-3-031-16437-8_58⟩. ⟨hal-03790158⟩
52 Consultations
122 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More