1. Aggregative Quantification#
Aggregative quantification refers to methods that estimate the prevalence of classes in a dataset by aggregating the predictions made on individual data items. Except for special-purpose learning methods, all aggregative methods share a common structure involving three main steps:
fit: the model is trained on labeled data.
predict: the trained model generates predictions for each individual item in the unlabeled dataset. These predictions can be either hard labels or soft probabilities.
aggregate: this step uses the results from the prediction phase to estimate the class prevalence distribution in the dataset. The
aggregatefunction takes as input the necessary information from the classification results to perform this aggregation.
This clear separation into fit, predict, and aggregate methods allows modular implementation and reuse of standard classifiers as a backbone.
The main aggregative quantification methods will be defined next, with details about their specific approaches and differences.