feat(ci): G3 perf regression gate + allowlist governance docs by flyingrobots · Pull Request #293 · flyingrobots/echo

flyingrobots · 2026-03-08T08:44:03Z

Summary

G3 perf regression gate ([P0][PRF] Criterion baseline + regression threshold for materialization path #280): Criterion benchmark output is now compared against a git-tracked perf-baseline.json with a configurable regression threshold (default 15%). Structured perf-report.json artifact uploaded alongside raw perf.log. A separate perf-baseline-update.yml workflow auto-generates baseline update PRs on main pushes that touch Rust sources.
Allowlist governance (Docs: Document ban-nondeterminism.sh allowlist process in RELEASE_POLICY.md #287): Added "Determinism Allowlist Governance" section to docs/RELEASE_POLICY.md documenting when exemptions are acceptable, approval requirements, and audit cadence. Cross-referenced from ban-nondeterminism.sh header.

New files

File	Purpose
`scripts/check_perf_regression.cjs`	G3 gate: compare bencher output against baseline, fail on regression
`scripts/generate_perf_baseline.cjs`	Generate `perf-baseline.json` from criterion bencher output
`.github/workflows/perf-baseline-update.yml`	Auto-PR baseline updates on main push
`perf-baseline.json`	Git-tracked performance baseline (seeded empty; first main push populates)

Test plan

Pre-commit: cargo fmt, clippy, markdown lint — clean
Pre-push: fmt, clippy, tests, rustdoc, patterns, determinism — all passed
check_perf_regression.cjs exits 0 with empty baseline (first-run behavior)
CI: det-gates G3 job runs comparison step
CI: perf-baseline-update.yml creates baseline PR on first main push

Closes #280
Closes #287

- G3 gate now compares criterion bencher output against git-tracked perf-baseline.json, failing on >15% regression (configurable). - New perf-baseline-update.yml workflow auto-generates baseline update PRs on main pushes that touch Rust sources. - Added check_perf_regression.cjs (gate) and generate_perf_baseline.cjs (baseline generation from bencher output). - Added "Determinism Allowlist Governance" section to RELEASE_POLICY.md documenting exemption criteria, approval requirements, and audit cadence. - Added cross-reference from ban-nondeterminism.sh to governance policy. Closes #280 Closes #287

coderabbitai · 2026-03-08T08:44:20Z

Warning

Rate limit exceeded

@flyingrobots has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 2 minutes and 42 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 2567b99f-720d-400f-b6aa-ab75cfb86440

📥 Commits

Reviewing files that changed from the base of the PR and between 78fc92d and 6c039da.

📒 Files selected for processing (8)

.github/workflows/det-gates.yml
.github/workflows/perf-baseline-update.yml
CHANGELOG.md
docs/RELEASE_POLICY.md
perf-baseline.json
scripts/ban-nondeterminism.sh
scripts/check_perf_regression.cjs
scripts/generate_perf_baseline.cjs

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/close-280-287

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b94868b54f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-08T08:48:37Z

+  const report = [];
+  let regressions = 0;
+
+  for (const name of benchNames) {


Fail when baseline benchmarks disappear from current run

Iterating only Object.keys(current) means the gate never checks benchmarks that exist in perf-baseline.json but are missing from the new output, so a renamed/removed benchmark can silently bypass regression enforcement and still report G3: PASSED. This undermines the accuracy of the regression gate in exactly the cases where benchmark coverage changes, so the comparison should include baseline-only entries (at least as a hard failure or explicit review-required state).

Useful? React with 👍 / 👎.

- Add step id to "Generate baseline JSON" and wire skip output into "Create baseline PR" if-condition (was dead GITHUB_OUTPUT write). - Add date prefix to baseline branch name to avoid short-SHA collisions. - Remove redundant git-diff guard (now handled by step skip logic). - Warn when baseline benchmarks are missing from current criterion run. - Remove unused path import from check_perf_regression.cjs.

… run Promote missing-benchmark detection from WARN to FAIL. A benchmark that exists in perf-baseline.json but is absent from the current criterion output now counts as a regression, preventing silent bypass via benchmark renames or removals. Resolution: update perf-baseline.json to remove the stale entry. Addresses chatgpt-codex-connector review comment on PR #293.

flyingrobots · 2026-03-08T09:08:38Z

Fixed in 6c039da: Missing benchmarks now fail G3 gate instead of warn-only. A benchmark present in perf-baseline.json but absent from criterion output counts as a regression with status MISSING. Resolution path: update the baseline to remove the stale entry.

flyingrobots · 2026-03-08T09:08:57Z

@coderabbitai review

coderabbitai · 2026-03-08T09:09:06Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

chatgpt-codex-connector Bot reviewed Mar 8, 2026

View reviewed changes

flyingrobots added 2 commits March 8, 2026 01:00

flyingrobots merged commit 971ba5a into main Mar 8, 2026
28 checks passed

flyingrobots deleted the fix/close-280-287 branch March 8, 2026 09:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ci): G3 perf regression gate + allowlist governance docs#293

feat(ci): G3 perf regression gate + allowlist governance docs#293
flyingrobots merged 3 commits into
mainfrom
fix/close-280-287

flyingrobots commented Mar 8, 2026

Uh oh!

coderabbitai Bot commented Mar 8, 2026 •

edited

Loading

Rate limit exceeded

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 8, 2026

Uh oh!

flyingrobots commented Mar 8, 2026

Uh oh!

flyingrobots commented Mar 8, 2026

Uh oh!

coderabbitai Bot commented Mar 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

flyingrobots commented Mar 8, 2026

Summary

New files

Test plan

Uh oh!

coderabbitai Bot commented Mar 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

flyingrobots commented Mar 8, 2026

Uh oh!

flyingrobots commented Mar 8, 2026

Uh oh!

coderabbitai Bot commented Mar 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai Bot commented Mar 8, 2026 •

edited

Loading