-
-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Upstream fp8 with static scales gpt oss
gpt-oss
Related to GPT-OSS models
needs-rebase
#30357
opened Dec 9, 2025 by
maleksan85
•
Draft
[CI][DeepSeek] Add nightly DeepSeek R1 Related to DeepSeek models
ready
ONLY add when PR is ready to merge/full CI is needed
lm_eval tests on H200
ci/build
deepseek
#30356
opened Dec 9, 2025 by
MatthewBonanni
Loading…
2 of 5 tasks
[Fix] Handle multiple tool calls in Qwen3-MTP tool parser
frontend
qwen
Related to Qwen models
tool-calling
#30353
opened Dec 9, 2025 by
ArkVex
Loading…
[Bugfix] Cache added_vocab to avoid per-token overhead
#30351
opened Dec 9, 2025 by
scratch-ml
Loading…
5 tasks
Remove virtual engine handling
codex
kv-connector
needs-rebase
qwen
Related to Qwen models
tpu
Related to Google TPUs
v1
#30350
opened Dec 9, 2025 by
WoosukKwon
Loading…
[Docs]: adds a new metric vllm:request_prefill_kv_computed_tokens in docs
documentation
Improvements or additions to documentation
#30348
opened Dec 9, 2025 by
googs1025
Loading…
5 tasks
[cpu][ci] Add CPU Attention Tests for Neon Backend
#30347
opened Dec 9, 2025 by
fadara01
Loading…
2 tasks
[Core] Major fix catch backend grammar exceptions (xgrammar, outlines, etc) in scheduler
v1
#30346
opened Dec 9, 2025 by
blancsw
Loading…
Fix typos in comments across multiple files
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#30345
opened Dec 9, 2025 by
wilsonwu
Loading…
5 tasks
[Bugfix] Fix HunyuanOCR cross-image contamination in batch processing
#30344
opened Dec 9, 2025 by
anker-c2
Loading…
3 of 5 tasks
[CI] refine more logic when generating and using nightly wheels & indices
ci/build
#30341
opened Dec 9, 2025 by
Harry-Chen
Loading…
3 of 5 tasks
[CMake][Build]: Remove unused ACL CMake env variables
ci/build
#30339
opened Dec 9, 2025 by
Radu2k
Loading…
Fix gigachat3 parser + update tests
frontend
tool-calling
#30338
opened Dec 9, 2025 by
ajpqs
Loading…
3 of 5 tasks
[Bugfix]: Streaming i/o of batch files. Resolves #30268
ci/build
frontend
#30334
opened Dec 9, 2025 by
umgefahren
Loading…
3 of 5 tasks
[Bugfix] Fix cuda graph sizes when running with speculative decoding
nvidia
#30330
opened Dec 9, 2025 by
PatrykSaffer
Loading…
[BugFix] Fix hang issue in LMCache mp mode
kv-connector
v1
#30327
opened Dec 9, 2025 by
wz1qqx
Loading…
5 tasks
[Frontend] [Doc] Exclude log deltas feature
frontend
#30322
opened Dec 9, 2025 by
Catacomba
Loading…
3 tasks done
[BugFix] Spec decode with VLLM_ENABLE_V1_MULTIPROCESSING=0
v1
#30319
opened Dec 9, 2025 by
heheda12345
Loading…
5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.