ColPali Cookbooks 👀

Introduction

With our new model ColPali, we propose to leverage VLMs to construct efficient multi-vector embeddings in the visual space for document retrieval. By feeding the ViT output patches from PaliGemma-3B to a linear projection, we create a multi-vector representation of documents. We train the model to maximize the similarity between these document embeddings and the query embeddings, following the ColBERT method.

This repository contains notebooks for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases.

	Notebook	Description
Interpretability	ColPali: Generate your own similarity maps	Generate your own similarity maps to interpret ColPali's predictions.
Fine-tuning	Coming soon!

Instructions

Open with Colab

The easiest way to use the notebooks is to open them from the examples directory and click on the Colab button below:

This will open the notebook in Google Colab, where you can run the code and experiment with the models.

Run locally

If you prefer to run the notebooks locally, you can clone the repository and open the notebooks in Jupyter Notebook or in your IDE.

Citation

ColPali: Efficient Document Retrieval with Vision Language Models

Authors: Manuel Faysse*, Hugues Sibille*, Tony Wu*, Bilel Omrani, Gautier Viaud, Céline Hudelot, Pierre Colombo (* denotes equal contribution)

@misc{faysse2024colpaliefficientdocumentretrieval,
      title={ColPali: Efficient Document Retrieval with Vision Language Models}, 
      author={Manuel Faysse and Hugues Sibille and Tony Wu and Bilel Omrani and Gautier Viaud and Céline Hudelot and Pierre Colombo},
      year={2024},
      eprint={2407.01449},
      archivePrefix={arXiv},
      primaryClass={cs.IR},
      url={https://arxiv.org/abs/2407.01449}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assets/interpretability		assets/interpretability
examples		examples
skypilot		skypilot
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ColPali Cookbooks 👀

Introduction

Instructions

Open with Colab

Run locally

Citation

About

License

tonywu71/colpali-cookbooks

Folders and files

Latest commit

History

Repository files navigation

ColPali Cookbooks 👀

Introduction

Instructions

Open with Colab

Run locally

Citation

About

Topics

Resources

License

Stars

Watchers

Forks