Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories

Poms, Fait; Sarukkai, Vishnu; Mullapudi, Ravi Teja; Sohoni, Nimit S.; Mark, William R.; Ramanan, Deva; Fatahalian, Kayvon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2109.05720 (cs)

[Submitted on 13 Sep 2021]

Title:Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories

Authors:Fait Poms, Vishnu Sarukkai, Ravi Teja Mullapudi, Nimit S. Sohoni, William R. Mark, Deva Ramanan, Kayvon Fatahalian

View PDF

Abstract:For machine learning models trained with limited labeled training data, validation stands to become the main bottleneck to reducing overall annotation costs. We propose a statistical validation algorithm that accurately estimates the F-score of binary classifiers for rare categories, where finding relevant examples to evaluate on is particularly challenging. Our key insight is that simultaneous calibration and importance sampling enables accurate estimates even in the low-sample regime (< 300 samples). Critically, we also derive an accurate single-trial estimator of the variance of our method and demonstrate that this estimator is empirically accurate at low sample counts, enabling a practitioner to know how well they can trust a given low-sample estimate. When validating state-of-the-art semi-supervised models on ImageNet and iNaturalist2017, our method achieves the same estimates of model performance with up to 10x fewer labels than competing approaches. In particular, we can estimate model F1 scores with a variance of 0.005 using as few as 100 labels.

Comments:	Accepted to ICCV 2021; 12 pages, 12 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2109.05720 [cs.CV]
	(or arXiv:2109.05720v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2109.05720

Submission history

From: Fait Poms [view email]
[v1] Mon, 13 Sep 2021 06:01:16 UTC (2,267 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators