Focal Loss for Dense Object Detection

Discount examples which are correctly predicted with high confidence. The strength of the discounting is specified by an adjustable parameter. Used for imbalanced classification problems like object detectors, where the vast majority of training examples are negative.

Introduces RetinaNet, which is a ResNet trained with focal loss.

see also