-
-
Notifications
You must be signed in to change notification settings - Fork 8.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI/Build] Fix torch nightly CI dependencies part 3
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#20378
opened Jul 2, 2025 by
zou3519
Loading…
3 of 4 tasks
[Bugfix] Remove executable flag on a few files related to flash_attn and flashinfer
v1
#20377
opened Jul 2, 2025 by
tlrmchlsmth
Loading…
[Docs] Update EAGLE example
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#20375
opened Jul 2, 2025 by
NickLucche
Loading…
[Misc] add handler HF_TOKEN is emptry string
ready
ONLY add when PR is ready to merge/full CI is needed
#20369
opened Jul 2, 2025 by
lengrongfu
Loading…
1 of 4 tasks
[Structured Outputs][V1] Skipping with models doesn't contain tokenizers
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
v1
#20365
opened Jul 2, 2025 by
aarnphm
Loading…
[Bugfix] Fix flaky ONLY add when PR is ready to merge/full CI is needed
test_streaming_response
test
ready
#20363
opened Jul 2, 2025 by
NickLucche
Loading…
[PP][V1]: Integrate Token Throttling into vLLM
v1
#20359
opened Jul 2, 2025 by
gty111
Loading…
4 tasks done
[Core] Move multimodal placeholder from chat utils to model definition
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
frontend
llama
Related to Llama models
qwen
Related to Qwen models
#20355
opened Jul 2, 2025 by
DarkLight1337
Loading…
2 of 4 tasks
[Installation] Fix python only installation wheel packaging missing libs
ci/build
#20351
opened Jul 2, 2025 by
yanyongyu
Loading…
[VLM] Add Nemotron-Nano-VL-8B-V1 support (WIP)
frontend
needs-rebase
#20349
opened Jul 2, 2025 by
kylehh
Loading…
[DP] Copy environment variables to Ray DPEngineCoreActors
v1
#20344
opened Jul 2, 2025 by
ruisearch42
Loading…
3 of 4 tasks
[BugFix] [P/D] Handle lookahead token count edge-case with Eagle Spec Decoding and P/D
v1
#20340
opened Jul 1, 2025 by
Pradyun92
Loading…
Add benchmark dataset for mlperf llama tasks
llama
Related to Llama models
perf-benchmarks
performance
Performance-related issues
#20338
opened Jul 1, 2025 by
mgoin
Loading…
Change default model to Qwen3-0.6B
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#20335
opened Jul 1, 2025 by
tlrmchlsmth
Loading…
[Misc] DP : Add ExpertTokensMetadata
#20332
opened Jul 1, 2025 by
varun-sundar-rabindranath
Loading…
[Perf] Optimize Vectorization Utils for Int 8 Quantization Kernels
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#20331
opened Jul 1, 2025 by
yewentao256
Loading…
[ROCm] warpSize is being made non constexpr in ROCm 7.0
rocm
Related to AMD ROCm
#20330
opened Jul 1, 2025 by
gshtras
Loading…
Add MoE config files for Nvidia Pro 6000 Blackwell Workstation Edition
#20329
opened Jul 1, 2025 by
Chen-zexi
Loading…
1 of 4 tasks
[USAGE] Improve error handling for weight initialization in Unquantized…
documentation
Improvements or additions to documentation
v1
#20321
opened Jul 1, 2025 by
koiker
Loading…
3 of 4 tasks
HF Hub LoRA Resolver
ci/build
documentation
Improvements or additions to documentation
#20320
opened Jul 1, 2025 by
alex-jw-brooks
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.