-
-
Notifications
You must be signed in to change notification settings - Fork 18.3k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[XPU][CI]Skip v1/spec_decode/test_speculators_correctness.py in intel GPU nightly
ci/build
intel-gpu
Related to Intel GPU
ready
ONLY add when PR is ready to merge/full CI is needed
#46356
opened Jun 22, 2026 by
zxd1997066
Contributor
Loading…
4 tasks done
[Test][KV Offloading] Add unit tests for OffloadingSpecFactory and SecondaryTierFactory
v1
#46355
opened Jun 22, 2026 by
Alex-ai-future
Contributor
Loading…
[CPU][Perf] Accelerate unquantized MoE for AArch64
ci/build
cpu
Related to CPU backends
gpt-oss
Related to GPT-OSS models
performance
Performance-related issues
#46353
opened Jun 22, 2026 by
fadara01
Contributor
Loading…
4 tasks
fix: stream Qwen3 tool call string arguments
qwen
Related to Qwen models
tool-calling
#46351
opened Jun 22, 2026 by
Palaiologos1453
Contributor
Loading…
[Frontend] Add TLS support with certificate/key files
frontend
needs-rebase
rust
#46350
opened Jun 22, 2026 by
badrinatarajan
•
Draft
test: run MoRIIO layout geometry on CPU
kv-connector
v1
#46349
opened Jun 22, 2026 by
fengjikui
Loading…
[Rust Frontend] Align Rust allowed_token_ids validation with Python
rust
#46348
opened Jun 22, 2026 by
reidliu41
Contributor
Loading…
4 tasks
[GDN] Improve kkt kernel of CuteDSL prefill backend
#46346
opened Jun 22, 2026 by
gau-nernst
Contributor
•
Draft
4 tasks
[Frontend] Fix Kimi K2 tool call IDs for required tool choice
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#46344
opened Jun 22, 2026 by
chaunceyjiang
Collaborator
Loading…
4 tasks
fix: resolve issue #37729
structured-output
v1
#46342
opened Jun 22, 2026 by
KartavyaDikshit
Loading…
[Kernel] TD operand loads for batched MoE GEMM (moe_mmk) on XPU
intel-gpu
Related to Intel GPU
#46340
opened Jun 22, 2026 by
oonyshch
Loading…
[Bugfix] Re-enable FP8 MoE on NVIDIA Thor
bug
Something isn't working
ci/build
nvidia
#46339
opened Jun 22, 2026 by
DarkLight1337
Member
Loading…
4 tasks
[Profiler] Add execution trace capture to torch profiler config
documentation
Improvements or additions to documentation
v1
#46336
opened Jun 22, 2026 by
sachinkademane
Loading…
[Doc] Fix %% rendering in CLI reference for --safetensors-load-strategy
documentation
Improvements or additions to documentation
#46335
opened Jun 22, 2026 by
lizzzcai
Contributor
Loading…
[Bugfix][KV Connector] Mooncake: honor logical->physical block ratio in register_kv_caches
bug
Something isn't working
kv-connector
v1
#46334
opened Jun 22, 2026 by
llying-001
Loading…
3 of 4 tasks
[Bugfix][Reasoning] Fix SeedOSS streaming when start token is omitted
bug
Something isn't working
tool-calling
#46333
opened Jun 22, 2026 by
GodlyDonuts
Loading…
[ROCm][P/D] Support MoRIIO heterogeneous TP fan-in
kv-connector
rocm
Related to AMD ROCm
v1
#46332
opened Jun 22, 2026 by
tanpinsiang
Contributor
•
Draft
[Feature] FlashAttention prefill-context-parallel (PCP) support for GQA
v1
#46330
opened Jun 22, 2026 by
JaredforReal
Contributor
•
Draft
4 tasks
[XPU] update nixl to v1.2.0
ci/build
intel-gpu
Related to Intel GPU
kv-connector
#46327
opened Jun 22, 2026 by
zhenwei-intel
Contributor
Loading…
4 tasks
Add HiSparse MLA decode
ci/build
kv-connector
nvidia
v1
#46326
opened Jun 22, 2026 by
faresobeid
•
Draft
4 tasks
fix(cudagraph): align spec-decode capture sizes for PIECEWISE mode
nvidia
#46324
opened Jun 22, 2026 by
davidzha712
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.