A Framework for Cluster and Classifier Evaluation in the Absence of Reference Labels

Joyce, Robert J.; Raff, Edward; Nicholas, Charles

doi:10.1145/3474369.3486867

Computer Science > Machine Learning

arXiv:2109.11126 (cs)

[Submitted on 23 Sep 2021]

Title:A Framework for Cluster and Classifier Evaluation in the Absence of Reference Labels

Authors:Robert J. Joyce, Edward Raff, Charles Nicholas

View PDF

Abstract:In some problem spaces, the high cost of obtaining ground truth labels necessitates use of lower quality reference datasets. It is difficult to benchmark model performance using these datasets, as evaluation results may be biased. We propose a supplement to using reference labels, which we call an approximate ground truth refinement (AGTR). Using an AGTR, we prove that bounds on specific metrics used to evaluate clustering algorithms and multi-class classifiers can be computed without reference labels. We also introduce a procedure that uses an AGTR to identify inaccurate evaluation results produced from datasets of dubious quality. Creating an AGTR requires domain knowledge, and malware family classification is a task with robust domain knowledge approaches that support the construction of an AGTR. We demonstrate our AGTR evaluation framework by applying it to a popular malware labeling tool to diagnose over-fitting in prior testing and evaluate changes whose impact could not be meaningfully quantified under previous data.

Comments:	to appear in Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2109.11126 [cs.LG]
	(or arXiv:2109.11126v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.11126
Related DOI:	https://doi.org/10.1145/3474369.3486867

Submission history

From: Edward Raff [view email]
[v1] Thu, 23 Sep 2021 03:42:01 UTC (952 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.CR

References & Citations

DBLP - CS Bibliography

listing | bibtex

Edward Raff
Charles Nicholas

export BibTeX citation

Computer Science > Machine Learning

Title:A Framework for Cluster and Classifier Evaluation in the Absence of Reference Labels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Framework for Cluster and Classifier Evaluation in the Absence of Reference Labels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators