Skip to content

docs: update compare command references for N-way matrix mode#384

Merged
christso merged 2 commits into
mainfrom
docs/compare-nway-matrix
Feb 26, 2026
Merged

docs: update compare command references for N-way matrix mode#384
christso merged 2 commits into
mainfrom
docs/compare-nway-matrix

Conversation

@christso
Copy link
Copy Markdown
Collaborator

Summary

All documentation and skills previously only referenced the two-file pairwise agentv compare a.jsonl b.jsonl workflow. Updated to show N-way matrix as the primary workflow with --baseline, --candidate, and --targets flags.

Files updated:

  • README.md (root + CLI mirror): matrix output example, baseline/pairwise/two-file commands
  • docs/COMPARISON.md: CI/CD example with --baseline regression gate
  • examples/features/compare/README.md: N-way matrix + pairwise modes
  • examples/features/compare/evals/README.md: full output examples for all modes
  • examples/showcase/multi-model-benchmark/README.md: combined JSONL workflow, updated flow diagram
  • plugins/agentv-dev/skills/agentv-eval-builder/SKILL.md: compare command reference

Test plan

  • Docs-only change — no code modified
  • Pre-push hooks pass (build, typecheck, lint, test)
  • Output examples match actual agentv compare output from fixtures

Follows up #382, #383.

All references to `agentv compare` previously only documented two-file
pairwise mode. Updated to show the N-way matrix as the primary workflow
with --baseline, --candidate, and --targets flags.

Updated files:
- README.md (root + CLI): matrix output example, baseline/pairwise commands
- docs/COMPARISON.md: CI example with --baseline regression gate
- examples/features/compare/: N-way matrix + pairwise examples with output
- examples/showcase/multi-model-benchmark/: combined JSONL workflow
- plugins/agentv-dev/skills/agentv-eval-builder/: compare command reference
@cloudflare-workers-and-pages
Copy link
Copy Markdown

cloudflare-workers-and-pages Bot commented Feb 26, 2026

Deploying agentv with  Cloudflare Pages  Cloudflare Pages

Latest commit: 0650078
Status: ✅  Deploy successful!
Preview URL: https://41cbee75.agentv.pages.dev
Branch Preview URL: https://docs-compare-nway-matrix.agentv.pages.dev

View logs

The evaluators block was renamed to assert in the eval YAML schema.
Update both code examples in COMPARISON.md to use the current syntax.
@christso christso merged commit 6066ea8 into main Feb 26, 2026
1 check passed
@christso christso deleted the docs/compare-nway-matrix branch February 26, 2026 02:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant