Typiclust, ProbCover & DCoM Official Code Repository

This is the official implementation for the papers Active Learning on a Budget - Opposite Strategies Suit High and Low Budgets and Active Learning Through a Covering Lens.

This code implements TypiClust, ProbCover and DCoM - Simple and Effective Low Budget Active Learning methods.

Typiclust

Arxiv link, Twitter Post link, Blog Post link

TypiClust first employs a representation learning method, then clusters the data into K clusters, and selects the most Typical (Dense) sample from every cluster. In other words, TypiClust selects samples from dense and diverse regions of the data distribution.

Selection of 30 samples on CIFAR-10:

Selection of 10 samples from a GMM:

TypiClust Results summary

Probability Cover

Arxiv link, Twitter Post link, Blog Post link

ProbCover also uses a representation learning method. Then, around every point is placed a $\delta$-radius ball, and the subset of $b$ (budget) balls which covers the most of the points is selected, with their centers chosen as the samples to be labeled.

Unfolding selection of ProbCover

ProbCover results in the Semi-Supervised training framework

DCoM

DCoM employs a representation learning approach. Initially, a $\Delta_{\text{avg}}$-radius ball is placed around each point. The $\Delta$ list provides a specific radius for each labeled example individually. From these, a subset of $b$ balls is chosen based on their coverage of the most points, with the centers of these balls selected as the samples to be labeled. After training the model, the $\Delta$ list is updated according to the purity of the balls to achieve more accurate radii and coverage. DCoM utilizes this coverage to determine the competence score, which balances typicality and uncertainty.

Illustration of DCoM's $\Delta$ updating

DCoM results in the Supervised training framework

DCoM results in the Semi-Supervised training framework

Usage

Please see USAGE for brief instructions on installation and basic usage examples.

Citing this Repository

This Repository makes use of two repositories: (SCAN and Deep-AL) Please consider citing their work and ours:

@article{hacohen2022active,
  title={Active learning on a budget: Opposite strategies suit high and low budgets},
  author={Hacohen, Guy and Dekel, Avihu and Weinshall, Daphna},
  journal={arXiv preprint arXiv:2202.02794},
  year={2022}
}

@article{yehudaActiveLearningCovering2022,
  title = {Active {{Learning Through}} a {{Covering Lens}}},
  author = {Yehuda, Ofer and Dekel, Avihu and Hacohen, Guy and Weinshall, Daphna},
  journal={arXiv preprint arXiv:2205.11320},
  year={2022}
}

@article{mishal2024dcom,
      title={DCoM: Active Learning for All Learners}, 
      author={Mishal, Inbal and Weinshall, Daphna},
      journal={arXiv preprint arXiv:2407.01804},
      year={2024}
}

License

This toolkit is released under the MIT license. Please see the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
deep-al		deep-al
scan		scan
.gitignore		.gitignore
2d_selection_gif.gif		2d_selection_gif.gif
LICENSE		LICENSE
README.md		README.md
USAGE.md		USAGE.md
cifar_selection.png		cifar_selection.png
dcom_delta_updating.gif		dcom_delta_updating.gif
dcom_semi.png		dcom_semi.png
dcom_supervised.png		dcom_supervised.png
probcover_selection.gif		probcover_selection.gif
probcover_semi.png		probcover_semi.png
results.png		results.png
typiclust-env.txt		typiclust-env.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Typiclust, ProbCover & DCoM Official Code Repository

Typiclust

Probability Cover

DCoM

Usage

Citing this Repository

License

About

Releases

Packages

Contributors 6

Languages

License

avihu111/TypiClust

Folders and files

Latest commit

History

Repository files navigation

Typiclust, ProbCover & DCoM Official Code Repository

Typiclust

Probability Cover

DCoM

Usage

Citing this Repository

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages