Skip to content

Pull requests: alibaba/rtp-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add RDMA transport for ViT
#1105 opened Jun 15, 2026 by ydshi0 Loading…
[ROCm] Prefill performance optimization for embedding models
#1102 opened Jun 15, 2026 by liaocz Collaborator Loading…
Feature/vit rpc metrics worker status
#1101 opened Jun 15, 2026 by yzyDavid Collaborator Loading…
Feat/support dash frontend
#1100 opened Jun 15, 2026 by wanglining97 Collaborator Loading…
fix: size flashinfer prefill workspace dynamically
#1099 opened Jun 15, 2026 by Vinkle-hzt Collaborator Loading…
generic layer-wise KV cache specs
#1097 opened Jun 15, 2026 by Adrenaline-S Collaborator Loading…
feat(xpu): XPU module factories and bindings (4/4)
#1096 opened Jun 13, 2026 by aslanxie Loading…
feat(online_optimizer) support theoretical hit rate statistics in flexlb
#1095 opened Jun 12, 2026 by YoungRX Collaborator Loading…
Feat/adapt sm120 rtx5000pro merge
#1094 opened Jun 12, 2026 by parkerpang Collaborator Loading…
Feature/offline inference scheduling
#1093 opened Jun 11, 2026 by HoniiTro19 Loading…
perf: remove redundent kv cache update for finished stream
#1092 opened Jun 11, 2026 by zhangjianning-zjn Collaborator Loading…
fix: fix 1s stall during polling output of generate stream
#1091 opened Jun 11, 2026 by zhangjianning-zjn Collaborator Loading…
fix: fix metric reporting on waiting time of generate stream
#1090 opened Jun 11, 2026 by zhangjianning-zjn Collaborator Loading…
perf(rocm): enable FlyDSL fused MoE for MI308X Qwen3.5 decode
#1087 opened Jun 11, 2026 by chengshu-lcc Collaborator Loading…
feat(rocm): support Qwen3/3.5 VL model on ROCm
#1086 opened Jun 11, 2026 by liaocz Collaborator Loading…
feat: add prompt scoring (per-position logits for input sequences)
#1081 opened Jun 10, 2026 by theNiemand Collaborator Loading…
feat: add simple scheduler for ViT
#1079 opened Jun 9, 2026 by ydshi0 Loading…
feat(omni): add Qwen2.5-Omni multi-stage pipeline support
#1074 opened Jun 7, 2026 by stmatengss Loading…
1 of 3 tasks
feat(p2p): decode_entrance P2P support with lease-based race fix
#1067 opened Jun 4, 2026 by ZhihanYan Collaborator Loading…
7 tasks
ProTip! Updated in the last three days: updated:>2026-06-12.