You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Code Simplifier workflow has failed 10/10 consecutive scheduled runs on main (daily ~04:48 UTC, 2026-06-10 → 2026-06-19). No successful baseline run exists in that window.
In the latest run the agent executes substantial work (turns=49, 1.85M tokens, 160 firewall requests, 0 blocked) and then the Copilot CLI exits non-zero: [copilot-harness] attempt 1 failed: exitCode=1 failureClass=authentication_failed.
The provider returns HTTP 403: Authentication failed with provider at (172.30.0.30/redacted) (HTTP 403). Check your COPILOT_PROVIDER_API_KEY or COPILOT_PROVIDER_BEARER_TOKEN. The harness treats first-attempt auth failure as non-retryable (authentication failed — not retrying), so the run dies after one attempt with write_actions=0.
Representative (analyzed):§27806068277 — 2026-06-19 04:48 UTC. Failed step Execute GitHub Copilot CLI; failureClass=authentication_failed, HTTP 403 from 172.30.0.30:10002; turns=49, tokens=1,851,074.
Chronic comparators (all failure, main, schedule): 27737401463 (06-18), 27666488465 (06-17), 27594887412 (06-16), 27524707602 (06-15), 27488668377 (06-14), 27456907583 (06-13), 27395179213 (06-12) — workflow is red every day with no successful baseline.
Probable root cause
The Copilot BYOK api-proxy (172.30.0.30:10002) rejects the request with HTTP 403 part-way through the session (the agent had already produced output, hasOutput=true, after 6m25s). This indicates the bearer token / API key presented to the proxy is invalid, expired, or revoked for this workflow — not a model-resolution or quota error (isCAPIError400=false isCAPIQuotaExceededError=false isModelNotSupportedError=false).
The non-retryable first-attempt auth policy means a mid-session 403 (e.g. token TTL expiring during a long 49-turn run) is fatal with no recovery.
Proposed remediation
Verify COPILOT_PROVIDER_API_KEY / COPILOT_PROVIDER_BEARER_TOKEN provisioning for Code Simplifier's scheduled main runs — confirm the credential is present, valid, and not expiring within the run's wall-clock (this run ran 6m25s before the 403).
If the token has a short TTL, refresh/re-mint it before long-running agent sessions, or allow a single token-refresh-and-retry on a mid-session 403 instead of the blanket non-retryable path.
Capture the BYOK proxy's 403 response body into the failure annotation so credential-vs-policy 403s are distinguishable (the harness currently surfaces only the generic Check your COPILOT_PROVIDER_* hint).
Success criteria / verification
A subsequent scheduled Code Simplifier run on main completes the agent step without an authentication_failed HTTP 403.
Code Simplifier produces at least one successful (or correctly-classified, non-auth) run on main within 48h.
A mid-session provider 403 is either recovered via token refresh or surfaced as a distinct classified failure with the proxy response body attached.
AI Moderator§27806212006 failed at Checkout PR branch with Refusing PR checkout: actor 'Jobayer-Q1' has 'read' permission (requires write or higher) — this is the security guard working as intended (external issue_comment actor without write access), not a defect.
Parent report: #39883.
Problem statement
Code Simplifierworkflow has failed 10/10 consecutive scheduled runs onmain(daily ~04:48 UTC, 2026-06-10 → 2026-06-19). No successful baseline run exists in that window.[copilot-harness] attempt 1 failed: exitCode=1 failureClass=authentication_failed.Authentication failed with provider at (172.30.0.30/redacted) (HTTP 403). Check your COPILOT_PROVIDER_API_KEY or COPILOT_PROVIDER_BEARER_TOKEN.The harness treats first-attempt auth failure as non-retryable (authentication failed — not retrying), so the run dies after one attempt withwrite_actions=0.Affected workflow and run IDs
Code Simplifier(.github/workflows/code-simplifier.lock.yml), enginecopilot(GitHub Copilot CLI, BYOK/offline mode), triggerschedule, branchmain.Execute GitHub Copilot CLI;failureClass=authentication_failed, HTTP 403 from172.30.0.30:10002; turns=49, tokens=1,851,074.failure,main,schedule): 27737401463 (06-18), 27666488465 (06-17), 27594887412 (06-16), 27524707602 (06-15), 27488668377 (06-14), 27456907583 (06-13), 27395179213 (06-12) — workflow is red every day with no successful baseline.Probable root cause
172.30.0.30:10002) rejects the request with HTTP 403 part-way through the session (the agent had already produced output,hasOutput=true, after 6m25s). This indicates the bearer token / API key presented to the proxy is invalid, expired, or revoked for this workflow — not a model-resolution or quota error (isCAPIError400=false isCAPIQuotaExceededError=false isModelNotSupportedError=false).Proposed remediation
COPILOT_PROVIDER_API_KEY/COPILOT_PROVIDER_BEARER_TOKENprovisioning forCode Simplifier's scheduledmainruns — confirm the credential is present, valid, and not expiring within the run's wall-clock (this run ran 6m25s before the 403).Check your COPILOT_PROVIDER_*hint).Success criteria / verification
Code Simplifierrun onmaincompletes the agent step without anauthentication_failedHTTP 403.Code Simplifierproduces at least one successful (or correctly-classified, non-auth) run onmainwithin 48h.Correlation note
ERR_CONFIG/ turns=2) — already covered by [aw-failures] [aw] Avenger agent job fails at "Parse agent logs" — ERR_CONFIG "no structured log entries" despite successful age [Content truncated due to length] #40145; not re-filed.Checkout PR branchwithRefusing PR checkout: actor 'Jobayer-Q1' has 'read' permission (requires write or higher)— this is the security guard working as intended (externalissue_commentactor without write access), not a defect.node not available inside AWF chroot) occurred on aworkflow_dispatchof dev branchcopilot/add-share-agentic-workflow(firewall v0.27.2, a stale lock predating the [aw-failures] [aw] Copilot CLI exits 127 — node missing inside AWF chroot (chronic Daily News failure on main) #40074 fix); scheduledmainDaily News runs the same day succeeded. Not a production regression — recompile the branch. [aw-failures] [aw] Copilot CLI exits 127 — node missing inside AWF chroot (chronic Daily News failure on main) #40074 stays closed.cancelled(concurrency/guard); Smoke Copilot ×3safe_outputsfailures on PR/test branches (unresolvable targets, per [aw-failures] [aw] Copilot CLI exits 1 with no classified error — chronic Daily Model Inventory failures #39946).References:
Related to [aw-failures] [aw] Failure Investigation Report — 6h window (2026-06-17 19:34 UTC) #39883