Reinforcement Learning Driven Intra-modal and Inter-modal Representation Learning for 3D Medical Image Classification

Zhonghang Zhu; Liansheng Wang; Baptiste Magnier; Lei Zhu; Defu Zhang; Lequan Yu

doi:10.1007/978-3-031-16437-8_58

Communication Dans Un Congrès Année : 2022

Reinforcement Learning Driven Intra-modal and Inter-modal Representation Learning for 3D Medical Image Classification

(1) , (1) , (2) , (3) , (1) , (4)

1
2
3
4

Zhonghang Zhu

Fonction : Auteur

Xiamen University

Liansheng Wang

Fonction : Auteur

Xiamen University

Baptiste Magnier

Fonction : Auteur
PersonId : 170358
IdHAL : baptiste-magnier
ORCID : 0000-0003-3458-0552
IdRef : 232618720

EuroMov - Digital Health in Motion

Lei Zhu

Fonction : Auteur

Hong Kong University of Science and Technology

Defu Zhang

Fonction : Auteur

Xiamen University

Lequan Yu

Fonction : Auteur

The University of Hong Kong

Résumé

Multi-modality 3D medical images play an important role in the clinical practice. Due to the effectiveness of exploring the complementary information among different modalities, multi-modality learning has attracted increased attention recently, which can be realized by Deep Learning (DL) models. However, it remains a challenging task for two reasons. First, the prediction confidence of multi-modality learning network cannot be guaranteed when the model is trained with weakly-supervised volume-level labels. Second, it is difficult to effectively exploit the complementary information across modalities and also preserve the modality-specific properties when fusion. In this paper, we present a novel Reinforcement Learning (RL) driven approach to comprehensively address these challenges, where two Recurrent Neural Networks (RNN) based agents are utilized to choose reliable and informative features within modality (intra-learning) and explore complementary representations across modalities (inter-learning) with the guidance of dynamic weights. These agents are trained via Proximal Policy Optimization (PPO) with the confidence increment of the prediction as the reward. We take the 3D image classification as an example and conduct experiments on a multi-modality brain tumor MRI data. Our approach outperforms other methods when employing the proposed RL-based multi-modality representation learning.

Mots clés

Multi-modality learning 3D medical images Reinforcement learning Classification

Domaines

Informatique [cs] Médecine humaine et pathologie

Fichier principal

Reinforcement_Learning_Driven_Intra_modal.pdf (636.85 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Administrateur IMT - Mines Alès : Connectez-vous pour contacter le contributeur

https://imt-mines-ales.hal.science/hal-03790158

Soumis le : mardi 4 octobre 2022-09:54:27

Dernière modification le : mardi 28 février 2023-15:36:25

Archivage à long terme le : jeudi 5 janvier 2023-18:19:25

Dates et versions

hal-03790158 , version 1 (04-10-2022)

Identifiants

HAL Id : hal-03790158 , version 1
DOI : 10.1007/978-3-031-16437-8_58

Citer

Zhonghang Zhu, Liansheng Wang, Baptiste Magnier, Lei Zhu, Defu Zhang, et al.. Reinforcement Learning Driven Intra-modal and Inter-modal Representation Learning for 3D Medical Image Classification. MICCAI 2022 - The 25th International Conference on Medical Image Computing and Computer Assisted Intervention, Sep 2022, Singapour, Singapore. pp.604-613, ⟨10.1007/978-3-031-16437-8_58⟩. ⟨hal-03790158⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM EM-ALES UNIV-MONTPELLIER EUROMOV-DHM

52 Consultations

122 Téléchargements

Reinforcement Learning Driven Intra-modal and Inter-modal Representation Learning for 3D Medical Image Classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager