This repo contains benchmarks to measure Cody quality for different language models, programming languages and feature flags. This repo is a work-in-progress. More documentation will be added later. In the meantime, reach out to @olafurpg if you want to learn more.
Setting up the virtual environment:
- Install
asdf
- Install uv (A Python package installer and resolver):
asdf install
(orcurl -LsSf https://astral.sh/uv/install.sh | sh
to install globally) - To create a virtual environment:
uv venv
- To activate the virtual environment:
source .venv/bin/activate
- To install packages into the virtual environment:
uv pip install -r requirements.txt
To run the leaderboard:
streamlit run leaderboard.py