Query StrategiesΒΆ

One of the key components of AL pipelines is a query strategy that specifies what instances are selected for annotation. ALToolbox provides classical and state-of-the-art query strategies for text classification, sequence tagging, and seq2seq tasks. Strategies implemented in our framework are summarized in the following table.

#

Strategy

Citation

1

ALPS

Citation

2

BADGE

Citation

3

BAIT

Citation

4

BALD

Citation

5

BatchBALD

Citation

6

Breaking Ties (BT) (also Maximum Margin)

Citation

7

Contrastive Active Learning (CAL)

Citation

8

Cluster Margin

Citation

9

Coreset

Citation

10

Expected Gradient Length (EGL)

Citation

11

Embeddings KM

Citation

12

Entropy

Citation

13

Least Confidence (LC)

Citation

14

Mahalanobis Distance

Citation

15

Maximum Normalized Log-Probability (MNLP)

Citation

16

Random (No AL)

-