Benchmark

This repository contains code and data used to benchmark different pull request review tools.

Overview

The goal of this benchmark is to simulate realistic scenarios where buggy code is submitted via pull requests. Each test case consists of a buggy version of a repository and a corresponding fixed version. By submitting a pull request from the buggy version to the fixed version, we can evaluate how effectively a review tool detects issues.

Repository Structure

Each folder in the main branch represents a fixed version of a repository where a specific issue has already been resolved.
Folder names follow the format: reponame_id, where:
- reponame is the name of the repository.
- id is a unique identifier for the test case.
For each fixed version, there is a corresponding buggy version located in a branch named: test_<reponame_id>

How It Works

The main branch contains the fixed versions of all test repositories.
The test_<reponame_id> branches contain the buggy versions of the same repositories.
To create a benchmark scenario:
- Create a pull request from the test_<reponame_id> branch to the main branch.
- This simulates a developer submitting a buggy pull request.
Run your pull request review tool on the simulated PR to evaluate its performance.

Example

Given a folder shoppingcart_01 in the main branch:

The fixed version of the shoppingcart repo is in main/shoppingcart_01.
The buggy version is in the branch test_shoppingcart_01.
You create a pull request from test_shoppingcart_01 to main, targeting the shoppingcart_01 folder.

Current Dataset

The current benchmark uses real-world issues extracted from the maniple dataset. Specifically, it focuses on bugs found in the tqdm project.

Test Cases: `tqdm`

Each test case below corresponds to a buggy-to-fixed repository pair.

Tqdm Issues

tqdm_1
- Related issues:
tqdm_3
- Related issues:
  - tqdm/tqdm#353
tqdm_4
- Related issues:
  - Not available
tqdm_5
- Related issues:
  - tqdm/tqdm#574
tqdm_6
- Related issues:
  - tqdm/tqdm#539
tqdm_8
- Related issues:
  - Not available

Thefuck Issues

thefuck_1
- Related issues:
  - nvbn/thefuck#1047
thefuck_2
- Related issues:
  - Not available
thefuck_3
- Related issues:
  - nvbn/thefuck#869
thefuck_4
- Related issues:
  - nvbn/thefuck#807
thefuck_5
Related issues:
- nvbn/thefuck#723
thefuck_7
- Related issues:
  - Not available
thefuck_9
- Related issues:
  - nvbn/thefuck#559
  - nvbn/thefuck#558

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Benchmark

Overview

Repository Structure

How It Works

Example

Current Dataset

Test Cases: `tqdm`

Tqdm Issues

Thefuck Issues

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
thefuck_1		thefuck_1
thefuck_2		thefuck_2
thefuck_3		thefuck_3
thefuck_4		thefuck_4
thefuck_5		thefuck_5
thefuck_7		thefuck_7
thefuck_9		thefuck_9
tqdm_1		tqdm_1
tqdm_3		tqdm_3
tqdm_4		tqdm_4
tqdm_5		tqdm_5
tqdm_6		tqdm_6
tqdm_8		tqdm_8
.gitignore		.gitignore
readme.md		readme.md

blarApp/open-benchmark

Folders and files

Latest commit

History

Repository files navigation

Benchmark

Overview

Repository Structure

How It Works

Example

Current Dataset

Test Cases: tqdm

Tqdm Issues

Thefuck Issues

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Test Cases: `tqdm`

Packages