Salient instance inference-based Multiple Instance Learning (SiiMIL)

This is the offical implementation of our paper: Attention2Minority: A salient instance inference-based multiple instance learning for classifying small lesions in breast cancer whole slide images. Paper

Requirements

Camelyon16 dataset, torch, torchvision, tensorboard, openslide, PIL, pandas, numpy, scikit-learn, tqdm, opencv

Extract foreground patches coordinates

Extract the coordinates of the top-left corner of each patch from CAM16 raw slides:

$ python extraction.py --slidedir <>

Or use your own:

data
    ├── pts
          ├── cam16l1p224s224
                            ├── slide_1.npy
                            ├── slide_2.npy
                            └── ...

Patch encoding using Resnet50

Encoding patches from CAM16 raw slides using Resnet50(pretrained on ImageNet, and truncated at the third block):

$ python encoding_pts.py --slidedir <>

Or use your own:

data
   ├── feats
           ├── cam16res
                      ├── train
                              ├── normal
                                       ├── slide_1.npy
                                       ├── slide_2.npy
                                       └── ...
                              └── tumor
                                      └── ...
                      └── test
                             ├── normal
                                      └── ...
                             └── tumor
                                     └── ...

Representation learning from negative instances

Learn representative negative instances (i.e., Key set)

$ python keyset_lrn.py -t 100

Or download the learned key set.