test_runner: add --prompt mode driving the Firefox DevTools MCP by msujaws · Pull Request #2197 · mozilla/mozregression

msujaws · 2026-06-29T20:05:48Z

Add an agent-driven verdict mode, the natural-language equivalent of --command (like git bisect run). --prompt "<instruction>" shells out to the claude CLI in headless mode, pointed at the @mozilla/firefox-devtools-mcp server via a generated --mcp-config, to inspect each build and decide good/bad.

The MCP launches the build itself, so AgentTestRunner installs the build to obtain the binary path (passed via --firefox-path) without starting it. Verdicts are parsed as GOOD/BAD from the agent output.

Gating: the Firefox DevTools MCP supports Firefox 100+, so ranges that predate it are rejected. This happens both up front (resolved good/bad range in cli.validate) and per build (mozversion application_version), configurable via --prompt-min-version (default 100).

A pre-run check (Application.check_prerequisites) fails fast before bisecting if claude/npx are missing or if the instruction is not usable for a good/bad determination.

--prompt is mutually exclusive with --command and --launch. Adds --prompt-headless and --prompt-model. New UnsupportedVersionError.

msujaws · 2026-06-29T20:07:34Z

A sample command that can be used to demonstrate this is:

mozregression --prompt "Go to about:preferences and look for an AI controls menu item. Good if it is there, bad if it isn't." --find-fix

Add an agent-driven verdict mode, the natural-language equivalent of --command (like `git bisect run`). `--prompt "<instruction>"` shells out to the `claude` CLI in headless mode, pointed at the @mozilla/firefox-devtools-mcp server via a generated --mcp-config, to inspect each build and decide good/bad. The MCP launches the build itself, so AgentTestRunner installs the build to obtain the binary path (passed via --firefox-path) without starting it. Verdicts are parsed as GOOD/BAD from the agent output. Gating: the Firefox DevTools MCP supports Firefox 100+, so ranges that predate it are rejected. This happens both up front (resolved good/bad range in cli.validate) and per build (mozversion application_version), configurable via --prompt-min-version (default 100). A pre-run check (Application.check_prerequisites) fails fast before bisecting if `claude`/`npx` are missing or if the instruction is not usable for a good/bad determination. --prompt is mutually exclusive with --command and --launch. Adds --prompt-headless and --prompt-model. New UnsupportedVersionError.

msujaws force-pushed the prompt-devtools-mcp-verdict branch from c2624f9 to e5bfacd Compare June 29, 2026 20:50

marco-c mentioned this pull request Jun 30, 2026

Create hackbot agent to automatically try to find a regression range for regression bugs mozilla/bugbug#6259

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test_runner: add --prompt mode driving the Firefox DevTools MCP#2197

test_runner: add --prompt mode driving the Firefox DevTools MCP#2197
msujaws wants to merge 1 commit into
mozilla:mainfrom
msujaws:prompt-devtools-mcp-verdict

msujaws commented Jun 29, 2026

Uh oh!

msujaws commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

msujaws commented Jun 29, 2026

Uh oh!

msujaws commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant