Skip to content

[AMD/ROCm] qwen3.5-fp8-mi355x-atom, Bump image to rocm/atom:rocm7.2.3_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom20260511#1389

Closed
seungrokj wants to merge 3 commits into
SemiAnalysisAI:mainfrom
seungrokj:update-qwen35-fp8-mi355x-atom
Closed

[AMD/ROCm] qwen3.5-fp8-mi355x-atom, Bump image to rocm/atom:rocm7.2.3_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom20260511#1389
seungrokj wants to merge 3 commits into
SemiAnalysisAI:mainfrom
seungrokj:update-qwen35-fp8-mi355x-atom

Conversation

@seungrokj
Copy link
Copy Markdown
Collaborator

Summary

  • Bump ATOM image for qwen3.5-fp8-mi355x-atom from rocm/atom:rocm7.2.2_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom0.1.2.post to rocm/atom-dev:nightly_202605111702
  • TP=4 shows +3.2% to +16.3% throughput improvement across 1k1k and 8k1k workloads (concurrency 4-256)
  • Validated via ATOM upstream nightly benchmark run #25686894636 (2026-05-11)

Throughput Comparison (tput/GPU, tok/s)

ISL OSL Conc GPUs InferenceX ATOM Upstream Diff %
1024 1024 4 4 208.99 221.58 +6.0%
1024 1024 8 4 373.92 407.45 +9.0%
1024 1024 16 4 599.13 627.41 +4.7%
1024 1024 32 4 955.48 998.01 +4.5%
1024 1024 64 4 1368.61 1429.45 +4.4%
1024 1024 128 4 1929.98 1992.52 +3.2%
1024 1024 256 4 2861.12 2936.57 +2.6%
8192 1024 4 4 811.98 944.35 +16.3%
8192 1024 8 4 1496.50 1631.33 +9.0%
8192 1024 16 4 2276.05 2417.82 +6.2%
8192 1024 32 4 3388.72 3619.13 +6.8%
8192 1024 64 4 4498.52 4775.50 +6.2%
8192 1024 128 4 5321.91 5943.18 +11.7%
8192 1024 256 4 6925.46 7330.06 +5.8%

Changes

  • .github/configs/amd-master.yaml: Update image tag for qwen3.5-fp8-mi355x-atom
  • perf-changelog.yaml: Add changelog entry

Test plan

  • Verify benchmark runs successfully with new image on MI355X

🤖 Generated with Claude Code

TP=4 shows +3.2% to +16.3% throughput improvement across 1k1k and 8k1k
workloads (concurrency 4-256) compared to the current InferenceX baseline
(rocm/atom:rocm7.2.2_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom0.1.2.post).

Validated via ATOM upstream nightly benchmark run #25686894636 (2026-05-11).

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copy link
Copy Markdown
Contributor

@claude claude Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

@seungrokj seungrokj changed the title Bump qwen3.5-fp8-mi355x-atom image to nightly_202605111702 [AMD/ROCm] qwen3.5-fp8-mi355x-atom, Bump image to nightly_202605111702 May 15, 2026
@seungrokj seungrokj changed the title [AMD/ROCm] qwen3.5-fp8-mi355x-atom, Bump image to nightly_202605111702 [AMD/ROCm] qwen3.5-fp8-mi355x-atom, Bump image to rocm/atom-dev:nightly_202605111702 May 15, 2026
@seungrokj
Copy link
Copy Markdown
Collaborator Author

Todo: change docker img to public

@seungrokj seungrokj changed the title [AMD/ROCm] qwen3.5-fp8-mi355x-atom, Bump image to rocm/atom-dev:nightly_202605111702 [AMD/ROCm] qwen3.5-fp8-mi355x-atom, Bump image to rocm/atom:rocm7.2.3_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom20260511 May 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Development

Successfully merging this pull request may close these issues.

1 participant