IOAI ML Notes Computer VisionDeep Learning

R-CNN, Fast R-CNN, and Faster R-CNN

Region-based CNN detectors and how they evolved from R-CNN to Faster R-CNN.

Syllabus Map


Overview


R-CNN

Core idea

Pipeline

How Selective Search Works (Step-by-Step)

Step 1: Initial over-segmentation

Step 2: Compute region similarities

Step 3: Greedy merging

Step 4: Generate proposals at all hierarchy levels

Step 5: Remove duplicates

Step 6: Hand off to CNN stage

Key traits


Fast R-CNN

Core idea

Pipeline

Why “CNN Once Per Image” Is Better

Key traits


Faster R-CNN

Core idea

Pipeline

How RPN Works (Step-by-Step)

Step 1: Shared feature extraction

Step 2: Dense anchor placement

Step 3: Sliding prediction heads

Step 4: Anchor labeling (training)

Step 5: Proposal decoding

Step 6: Proposal filtering

Step 7: Hand-off to detector

Key traits


Mask R-CNN

Core idea

Pipeline

Key traits


How They Differ


Training Objectives (High Level)


When To Use


Practical Notes

Use stable optimization defaults first

Understand training strategy differences

Prioritize compute-efficient architectures in practice

← Back to Blog