test: cover LWS multi-vendor GPU resources by xuhui-lu · Pull Request #1350 · llm-d/llm-d-workload-variant-autoscaler

xuhui-lu · 2026-06-27T13:09:49Z

Summary

add LWS GPU counting coverage for supported resource names: NVIDIA, AMD, Intel Gaudi, Intel i915, and Intel Xe
add a mixed-vendor LWS case across leader and worker containers
keep this as a unit-test-only follow-up for the multi-vendor GPU work

Part of #1106.

Testing

go test ./internal/utils/scaletarget
go test ./internal/utils/...

Also tried go test ./...; it is not clean locally for unrelated reasons:

internal/engines/analyzers/throughput: FitITLModel flat-line test currently fails
test/chart: local helm binary is not installed
test/e2e: entered cluster preflight / auth flow and was interrupted

Signed-off-by: Xuhui Lu <swimming.fish06@gmail.com>

Copilot

Pull request overview

This PR extends unit-test coverage for LWSAccessor.GetTotalGPUsPerReplica() to ensure GPU counting works across the full set of supported vendor resource names (NVIDIA, AMD, Intel Gaudi, Intel i915, Intel Xe), including a mixed-vendor leader/worker scenario. This supports the broader multi-vendor GPU support work tracked in #1106 by guarding against regressions in GPU resource parsing for LeaderWorkerSet scale targets.

Changes:

Added a table-driven unit test covering all supported GPU resource names for LWS GPU counting.
Added a mixed-vendor LWS test case that combines different vendor resources across leader and worker containers.
Introduced small test helpers to reduce duplication when building LWS/container GPU requests.

ev-shindin

LGTM. Look to some minor (non-blocking) comments

Signed-off-by: Xuhui Lu <swimming.fish06@gmail.com>

ev-shindin · 2026-06-30T05:18:13Z

/ok-to-test

github-actions · 2026-06-30T05:18:22Z

🚀 Kind E2E (full) triggered by /ok-to-test

View the Kind E2E workflow run

github-actions · 2026-06-30T05:18:28Z

🚀 OpenShift E2E — approve and run (/ok-to-test)

View the OpenShift E2E workflow run

github-actions · 2026-06-30T05:21:22Z

GPU Pre-flight Check ✅

GPUs are available for e2e-openshift tests. Proceeding with deployment.

Resource	Total	Allocated	Available
GPUs	50	27	23

Cluster	Value
Nodes	16 (7 with GPUs)
Total CPU	993 cores
Total Memory	10383 Gi
GPUs required	4 (min) / 6 (recommended)

ev-shindin

Thanks @xuhui-lu ! To follow-up: every existing and new test sets a non-zero GPU request, so the if total == 0 { return 1 } default is never exercised — change it to return 0 and the whole suite still passes. Add a CPU-only (no GPU requests) LWS case asserting 1

Copilot AI review requested due to automatic review settings June 27, 2026 13:09

test: cover LWS multi-vendor GPU resources

fddeeb9

Signed-off-by: Xuhui Lu <swimming.fish06@gmail.com>

Copilot started reviewing on behalf of xuhui-lu June 27, 2026 13:10 View session

xuhui-lu force-pushed the xlu/1106-lws-multivendor-tests branch from d7a5a23 to fddeeb9 Compare June 27, 2026 13:10

Copilot AI reviewed Jun 27, 2026

View reviewed changes

xuhui-lu marked this pull request as draft June 27, 2026 13:20

xuhui-lu marked this pull request as ready for review June 28, 2026 07:54

ev-shindin previously approved these changes Jun 29, 2026

View reviewed changes

Comment thread internal/utils/scaletarget/lws_test.go

Comment thread internal/utils/scaletarget/lws_test.go

test: address LWS GPU resource review nits

6b57c8d

Signed-off-by: Xuhui Lu <swimming.fish06@gmail.com>

xuhui-lu dismissed ev-shindin’s stale review via 6b57c8d June 29, 2026 17:04

ev-shindin approved these changes Jun 30, 2026

View reviewed changes

ev-shindin merged commit 3be8e78 into llm-d:main Jun 30, 2026
19 checks passed

xuhui-lu mentioned this pull request Jul 1, 2026

test: cover CPU-only LWS GPU default #1372

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test: cover LWS multi-vendor GPU resources#1350

test: cover LWS multi-vendor GPU resources#1350
ev-shindin merged 2 commits into
llm-d:mainfrom
xuhui-lu:xlu/1106-lws-multivendor-tests

xuhui-lu commented Jun 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

ev-shindin left a comment

Uh oh!

Uh oh!

Uh oh!

ev-shindin commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

Uh oh!

ev-shindin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

xuhui-lu commented Jun 27, 2026

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

ev-shindin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ev-shindin commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

GPU Pre-flight Check ✅

Uh oh!

ev-shindin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants