Mask RCNN

Introduction

What is instance segmentation?

Instance segmentation require correct detection of all objects while also precisely segmenting each instance.

So, it therefore combines two task:

Mask Branch
Mask R-CNN extends Faster R-CNN by adding a branch for predicting segmentation mask on each Region of Interest(RoI), in parallel with the existing branch for classification and bounding box regression.
The mask branch has a $Mask RCNN - 图1$ -dimensional output for each RoI, which encodes $Mask RCNN - 图2$ binary masks of resolution $Mask RCNN - 图3$ , one for each of the $Mask RCNN - 图4$ classes. Specifically, this model predict mask from each RoI using an FCN.

And the experiment found that the different branches have mutual promotion effect.

mask rcnn.png

roi align.png

Decoupling and Branch
If a complicated problem can be split into several simple task, we may be able to create different branches to deal with different task. And this will help to decouple a complicated problem.
Mask R-CNN decouple mask and class prediction: predict a binary mask for each class independently, without competition among classes. (Using sigmoid replace softmax)
Why predict mask using FCN?
- Compared to fc, FCN has less parameters, and can reduce calculation.
- FCN retains spatial information.
Replace quantization with linear interpolation.