The problem
CNNs underperform on long-range pathologies in chest X-rays. Labeled medical data is scarce.
The approach
Self-supervised MAE pre-training on 500k unlabeled chest X-rays, followed by supervised fine-tuning with class-balanced focal loss.
Results
Beat the ResNet-50 baseline by 6.2 AUC points on CheXpert and reached radiologist-level F1 on three out of fourteen findings.