Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images

Table of Contents

Title: Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images

Authors: Ming Y. Lu, Bowen Chen, Andrew Zhang, Drew F.K. Williamson, Richard J. Chen, Tong Ding, Long Phi Le, Yung-Sung Chuang, Faisal Mahmood

Published: Jun 13, 2023

Summary (Generated by Microsoft Copilot):

Introduction:

The paper presents MI-Zero, a framework for zero-shot transfer in histopathology images using contrastive visual language pretraining.

Challenges:

Data limitations: Lack of large-scale, publicly available paired image-text datasets.
Computational challenges: Handling gigapixel whole slide images (WSIs) that can span up to 100,000 × 100,000 pixels.

Methods:

Multiple Instance Learning (MIL): Reformulates zero-shot transfer to handle large images.
Pretraining: Uses over 550k pathology reports and 33k histopathology image-caption pairs.

Novelties:

Zero-shot transfer: First application in pathology for WSIs.
MI-Zero framework: Utilizes pretrained visual language encoders for diagnostic tasks without additional labels.

Results:

Achieves an average median zero-shot accuracy of 70.2% across three cancer subtyping tasks.

Performances:

Limitations:

Discussion:

Potential for semi-supervised learning workflows and applications in other fields like satellite imaging. Future work includes collecting more datasets and improving sample efficiency.