Skip to content

Update dsr1-fp8-h200-sglang SGLang image to v0.5.12-cu130#1423

Merged
functionstackx merged 2 commits into
mainfrom
claude/issue-1154-dsr1-fp8-h200-sglang
May 17, 2026
Merged

Update dsr1-fp8-h200-sglang SGLang image to v0.5.12-cu130#1423
functionstackx merged 2 commits into
mainfrom
claude/issue-1154-dsr1-fp8-h200-sglang

Conversation

@Klaud-Cold
Copy link
Copy Markdown
Collaborator

Updates SGLang image for dsr1-fp8-h200-sglang from v0.5.11-cu130 to v0.5.12-cu130.
\nRef #1154

Generated with Claude Code

Ref #1154

Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>
@github-actions
Copy link
Copy Markdown
Contributor

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

Copy link
Copy Markdown
Contributor

@claude claude Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM — straightforward SGLang image version bump.

Extended reasoning...

Overview

This PR is a simple two-line version bump: lmsysorg/sglang:v0.5.11-cu130v0.5.12-cu130 for the dsr1-fp8-h200-sglang config, plus a corresponding entry appended to perf-changelog.yaml. No runtime code is touched — only declarative YAML config.

Security risks

None. This is a third-party container image tag bump in a config file; there is no auth, crypto, or permission surface affected. The image registry and namespace are unchanged.

Level of scrutiny

Low. Mechanical config-only change following the exact same pattern as the surrounding recent commits (e.g. 40f8c47, 539ea1b, c09c3e2 — all identical image-bump PRs). The full-sweep-enabled label indicates the benchmark sweep will validate the new image works as expected.

Other factors

The bug hunting system flagged a single nit — the new changelog entry uses pr-link: XXX instead of the actual PR URL. That inline comment is already attached; this is changelog metadata only and does not affect runtime behavior, so it is not worth blocking on. The author can fix it in place before merge.

Comment thread perf-changelog.yaml
Comment on lines +2552 to +2556
- config-keys:
- dsr1-fp8-h200-sglang
description:
- "Update SGLang image from v0.5.11-cu130 to v0.5.12-cu130"
pr-link: XXX
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🟡 The new changelog entry for dsr1-fp8-h200-sglang has pr-link: XXX instead of an actual PR URL. Every other entry in this file uses the format https://github.com/SemiAnalysisAI/InferenceX/pull/<number> (e.g. line 2550 uses /pull/1402). It should be https://github.com/SemiAnalysisAI/InferenceX/pull/1423 to match this PR.

Extended reasoning...

What the bug is

The diff appends a new entry to perf-changelog.yaml (lines 2552-2556) recording the SGLang image bump for dsr1-fp8-h200-sglang. The final field is:

  pr-link: XXX

XXX is a placeholder from the PR-generation template (AGENTS.md uses pull/XXX as the placeholder pattern to be filled in). It was never replaced with the real URL before the PR was opened.

Why the convention says this should be a URL

The file contains over 100 entries, and the standard form is a fully qualified GitHub PR URL — for example, the entry immediately above this one (line 2550) reads:

  pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1402

Since this PR is #1423, the correct value is https://github.com/SemiAnalysisAI/InferenceX/pull/1423.

Step-by-step proof

  1. Open perf-changelog.yaml and jump to the new entry at line 2552.
  2. Observe line 2556: pr-link: XXX.
  3. Compare to the preceding entry at line 2550: pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1402 — a proper URL.
  4. grep -c "pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/" perf-changelog.yaml shows the URL form is the convention used throughout the file.
  5. The PR number for this change is 1423 (per the PR metadata), so the placeholder resolves to https://github.com/SemiAnalysisAI/InferenceX/pull/1423.

Addressing the refutation

One refutation noted that line 2502 (the minimaxm2.5-fp8-mi355x-vllm entry) also contains pr-link: XXX, and concluded that XXX is therefore an "accepted convention." I disagree: a single prior occurrence among 100+ entries — versus the overwhelmingly consistent URL format used everywhere else — is more naturally explained as another unfilled placeholder that slipped past review than as an accepted convention. The AGENTS.md template uses pull/XXX as the explicit fill-in marker, which is exactly what is left here. The right fix is to replace XXX with the real URL for this PR (and, separately, ideally backfill line 2502 too, though that is out of scope here).

How to fix

Replace line 2556:

-  pr-link: XXX
+  pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1423

Impact

Low — this is changelog metadata, not runtime behavior. But the link will not resolve as written, which defeats the purpose of including it, so it is worth fixing before merge.

@github-actions
Copy link
Copy Markdown
Contributor

1 similar comment
@github-actions
Copy link
Copy Markdown
Contributor

@functionstackx
Copy link
Copy Markdown
Collaborator

/reuse-sweep-run

@functionstackx functionstackx merged commit aa2df95 into main May 17, 2026
14 of 19 checks passed
@functionstackx functionstackx deleted the claude/issue-1154-dsr1-fp8-h200-sglang branch May 17, 2026 07:41
@github-actions
Copy link
Copy Markdown
Contributor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Development

Successfully merging this pull request may close these issues.

2 participants