Skip to content

Pull requests: llm-d/llm-d-workload-variant-autoscaler

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add SGLang inference-engine backend support
#1351 opened Jun 27, 2026 by MohanKumar21 Loading…
3 tasks done
test: cover LWS multi-vendor GPU resources
#1350 opened Jun 27, 2026 by xuhui-lu Contributor Loading…
feat: always enable HTTP/2 and set NextProtos for ALPN
#1346 opened Jun 26, 2026 by ugiordan Loading…
3 tasks done
docs/proposals: revive the QueueingModelAnalyzer (SLO-driven scaling) area/engine
#1343 opened Jun 25, 2026 by atantawi Collaborator Loading…
1 of 3 tasks
fix: exclude KEDA-generated HPAs from the collector's managed-HPA index backport/v0.8 Mark PR to backport to release-8.0
#1341 opened Jun 25, 2026 by ieaves Contributor Loading…
3 tasks
Bumping llm-d release to latest version v0.8.0 area/installation ready-for-review Signal that changes are ready for review
#1337 opened Jun 25, 2026 by dumb0002 Collaborator Loading…
3 tasks
feat(config): allow opt-in plain HTTP Prometheus endpoints
#1335 opened Jun 24, 2026 by Goutham-Annem Contributor Loading…
1 of 3 tasks
fix(saturation): log actual analyzer mode instead of hardcoded value
#1334 opened Jun 24, 2026 by Goutham-Annem Contributor Loading…
1 of 3 tasks
Initial gpu preference design
#1332 opened Jun 24, 2026 by asm582 Collaborator Draft
3 tasks done
feat: Observability: PrometheusRule Alerting Rules area/observability ready-for-review Signal that changes are ready for review
#1328 opened Jun 24, 2026 by shuynh2017 Collaborator Loading…
1 of 3 tasks
Add proposal for hpa based saturation config area/coordinator
#1322 opened Jun 23, 2026 by asm582 Collaborator Loading…
deps(actions): bump actions/checkout from 6 to 7 dependencies Pull requests that update a dependency file
#1307 opened Jun 22, 2026 by dependabot Bot Loading…
blog: goodput metric.
#1294 opened Jun 18, 2026 by lionelvillard Collaborator Draft
chore(deps): upgrade Go Kubernetes stack
#1292 opened Jun 18, 2026 by yankay Contributor Loading…
4 tasks done
chore(deps): bump sigs.k8s.io/lws to v0.9.0
#1291 opened Jun 18, 2026 by yankay Contributor Draft
3 tasks done
test(e2e): KV cache saturation scale-up to maxReplicas=10
#1283 opened Jun 16, 2026 by asm582 Collaborator Draft
3 tasks
docs(proposals): minimal ScalingPolicy CRD core (split from #1194) area/config area/engine lgtm Looks good to me, indicates that a PR is ready to be merged.
#1245 opened Jun 7, 2026 by ev-shindin Collaborator Loading…
docs(proposals): add priority-weighted rescale proposal area/engine lgtm Looks good to me, indicates that a PR is ready to be merged.
#1238 opened Jun 4, 2026 by ev-shindin Collaborator Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.