Key Points

  • Basic strategy: self-supervised learning to reduce reliance on human-labeling
  • Downstream tasks: dense prediction (segmentation, depth estimation, etc.)
  • Dataset: Pascal VOC 2012
  • Baselines: MoCo and SimCLR with semantic segmentation head (a decoder)
  • Main effort: design contrastive pretext tasks for segmentation
  • Possible direction: auto-encoder and energy-based method (why or why not)

    To-do’s

  • [x] Intro to MMSegmentation toolbox

  • Convert MoCo & SimCLR model checkpoints to MMSegmentation
  • Related work about auto-encoder style self-supervised learning