-
Notifications
You must be signed in to change notification settings - Fork 169
Pull requests: lightseekorg/tokenspeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(kernel): Tie weight preprocessors to kernels
#480
opened Jun 18, 2026 by
Max191
Contributor
Loading…
test(agentic): add EvalScope trie benchmark protocol
#466
opened Jun 17, 2026 by
Xiangyi1996
Collaborator
•
Draft
test(ci): add DeepSeek-V4-Flash MTP AIME25 eval
#461
opened Jun 16, 2026 by
dongjiyingdjy
Contributor
Loading…
perf(kernel): optimize Qwen vision QKV rotary layout
#456
opened Jun 15, 2026 by
qimcis
Contributor
Loading…
fix(scheduler): release paged-cache snapshots in ~HybridPrefixCache to avoid teardown use-after-free
#455
opened Jun 15, 2026 by
Sunt-ing
Loading…
[WIP] EPD: encode-worker path, async embedding receive, E2P row-sharding
#437
opened Jun 12, 2026 by
chenht2022
Contributor
•
Draft
Fix EP8 DP/TP RSAG init and empty LM head
#416
opened Jun 11, 2026 by
yubofredwang
Contributor
Loading…
Port mamba2 kernels and runtime from sglang#03c77dc
#412
opened Jun 10, 2026 by
netanel-haber
Loading…
perf(gdn): fuse causal_conv1d and QKV split for GDN prefill
#382
opened Jun 8, 2026 by
elwhyjay
Contributor
Loading…
fix(scheduler): publish prefix to radix tree during prefill for non-hybrid models
#381
opened Jun 8, 2026 by
qywu
Collaborator
Loading…
fix(cache): Coarsely fence the compute stream behind the host loadback stream on.
inactive
#370
opened Jun 6, 2026 by
LorrinWWW
Contributor
Loading…
[Bugfix] fix smg command argument cannot be used multiple times
inactive
#366
opened Jun 6, 2026 by
lengrongfu
Loading…
fix(kernel): tile merge_state over heads to avoid >1024 block
inactive
#330
opened Jun 1, 2026 by
elwhyjay
Contributor
Loading…
perf(model_loader): multi-threaded safetensors weight loading
inactive
#287
opened May 28, 2026 by
yuanqingz
Loading…
Add Triton sampling backends alongside FlashInfer
#280
opened May 27, 2026 by
FlamingoPg
Contributor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-20.