fix: security hardening from external audit by lfl1337 · Pull Request #67 · getagentseal/codeburn

lfl1337 · 2026-04-17T06:38:13Z

Background

An external security audit on v0.5.7 (commit b181826) surfaced one HIGH and two MEDIUM findings plus a LOW error-hygiene note. The audit used npm audit, Trivy, OSV-Scanner, gitleaks, Semgrep (six rule packs), njsscan, manual review, and runtime fuzzing against pathological JSONL. Supply-chain posture was clean: zero CVEs, zero secret findings over the full git history, zero Semgrep hits. The findings are all in application code and all share one threat model: an attacker with write access to ~/.claude/projects/<x>/ (realistic carrier: a compromised third-party AI CLI that shares the session-log directory).

This PR closes all four findings on the current v0.7.0 surface.

Findings Closed

Finding	Fix
HIGH-1 prototype pollution via bracket-assign on breakdown maps (`parser.ts`)	`Object.create(null)` for the four reachable maps (model, tool, mcp, bash). Four lines touched. Attacker-controlled keys named `__proto__` now create an own property on the map rather than mutating `Object.prototype`. Empirical fuzzing in the audit confirmed the polluted state crashes piped-mode output via Ink/yoga (`codeburn today \| cat`).
MEDIUM-1 unbounded `readFile` on attacker-sized JSONL	New `src/fs-utils.ts` with a 128 MB hard cap and 8 MB streaming threshold. `readSessionFile` / `readSessionFileSync` / `readSessionLines` replace every direct `readFile` on session paths. Files above the cap return `null` and skip silently (or log under `--verbose`). Files between the streaming threshold and the cap read via `createReadStream` + `readline` to avoid the `readFile` + `split('\n')` doubling.
MEDIUM-2 SwiftBar directive separator injection via unsanitized model names	`sanitizeMenubarLabel` replaces anything outside `[A-Za-z0-9 ._/-]` with `?` and truncates to 14 chars before `padEnd`. Applied to all model- and category-name interpolation sites across today / 7d / 30d / month blocks.
LOW-1 silent error swallow in `parseSessionFile`	Helper emits `codeburn: stat failed for <path>: <code>` / `codeburn: skipped oversize file …` / `codeburn: read failed for <path>: <code>` to stderr when `CODEBURN_VERBOSE=1`. New global `--verbose` CLI flag sets the env var via a commander `preAction` hook. Default behavior unchanged.

Scope Expansion vs. Audit

The audit was on v0.5.7 and listed 6 MEDIUM-1 call-sites. Between v0.5.7 and v0.7.0 three more files grew similar unbounded-read patterns, bringing the total to 13:

src/optimize.ts — 1 async + 3 sync reads (v0.7.0)
src/context-budget.ts — 3 async reads (v0.6.1)
src/providers/copilot.ts — 1 additional site at :181 for workspace.yaml (v0.6.0)

The helper migration covers all 13. The same threat model applies — any JSONL dropped into the Claude projects directory can reach these readers, so fixing only the audit-original six would leave live paths open on the current release.

Test Coverage

New tests (11 added, all pass):

tests/security/prototype-pollution.test.ts — three cases reproducing the HIGH-1 PoC (tool-use __proto__, bash basename __proto__, model __proto__). Fixtures in tests/fixtures/security/.
tests/fs-utils.test.ts — five cases covering the fast path, stream-threshold path, over-cap skip, verbose stderr, and stat-failure.
tests/security/menubar-injection.test.ts — three cases for pipe-in-model, ANSI-in-model, pipe-in-category.

Full suite: 209/209 pass (198 pre-existing + 11 new). No other test files modified.

Verification

Re-ran Semgrep (p/javascript + p/typescript + p/security-audit + p/owasp-top-ten + p/nodejs) and njsscan on the patched branch. Both produced the same output as the v0.7.0 baseline — Semgrep 0 findings, njsscan 1 pre-existing dismissed false positive on src/providers/cursor.ts. No new finding classes introduced by the 93 LOC of src/fs-utils.ts or the edits elsewhere.

Manual runtime check: codeburn report, codeburn today, codeburn optimize, codeburn menubar all render normally on real session data.

Compatibility

No public API changes.
Default behavior unchanged: the helper is silent unless CODEBURN_VERBOSE=1 or --verbose. Existing users see identical output.
No new dependencies. Helper uses only fs, fs/promises, readline from Node's stdlib.
Object.create(null) objects iterate via Object.entries / bracket access exactly like {} — downstream consumers in dashboard.tsx and menubar.ts need no changes.

Out of Scope

Tracked as separate follow-ups (not in this PR):

Issue .claudeignore is not a claude feature #61 (.claudeignore references in optimize.ts) — shipping as its own small PR.
Structural CI rule to prevent re-introducing the bracket-assign pattern in future providers — worth a small chore(ci) PR later.
Streaming-aggregation for all-time reports (total-memory for hundreds of sessions, distinct from the per-file cap this PR adds) — performance, not security.

Commits

aaa5ca8 feat(cli): add --verbose flag for stderr warnings
b257690 fix(menubar): sanitize SwiftBar labels via allowlist
1709bc5 test(security): add failing test for MEDIUM-2 menubar injection
2968e08 fix(optimize): use bounded read helpers
fb07852 fix(context-budget): use bounded readSessionFile helper
a11f530 fix(pi): use bounded readSessionFile helper
a555325 fix(copilot): use bounded readSessionFile helper
9c3d565 fix(codex): use bounded readSessionFile helper
743b199 fix(parser): use bounded readSessionFile helper
79b80c6 feat(fs-utils): bounded session-file read helper
a48907f test(fs-utils): add failing test for bounded read helper
2b22e18 fix(parser): block prototype pollution via Object.create(null)
04d0ed2 test(security): add failing test for HIGH-1 prototype pollution

Three PoC fixtures (tool name, bash command, model name) reproduce the audit's HIGH-1 attack. Tests assert Object.prototype.calls stays undefined after parsing. They fail against current parser.ts -- Task 3 will close the pollution sink with Object.create(null).

Initialize the four breakdown maps (model, tool, mcp, bash) with null prototype so attacker-controlled keys named __proto__ create own properties on the map instead of mutating Object.prototype. Closes the HIGH-1 finding from the 2026-04-16 external security audit.

Tests the to-be-built readSessionFile helper: under-cap fast path, at-threshold stream path, over-cap null+skip, verbose stderr warning, and stat-failure graceful fallback. Fails against missing module -- Task 5 will implement src/fs-utils.ts to flip GREEN.

Adds readSessionFile / readSessionFileSync / readSessionLines with a 128 MB hard cap and 8 MB streaming threshold. Verbose mode (CODEBURN_VERBOSE=1) logs skipped and failed reads to stderr. Prepares the MEDIUM-1 migration of all provider read paths.

Replaces the unbounded readFile in parseSessionFile with the 128 MB-capped helper from src/fs-utils. Addresses MEDIUM-1 for the Claude provider hot path. Verbose-mode stderr output replaces the previous silent catch, closing LOW-1 as a side effect.

Both Codex session read paths (first-line meta and full-session parse) now pass through the 128 MB-capped helper. MEDIUM-1 coverage for the Codex provider.

Events JSONL and workspace.yaml reads now pass through the 128 MB-capped helper. The workspace.yaml path stays non-fatal: a null read skips cwd derivation but still pushes the session with sessionId as the fallback project label. MEDIUM-1 coverage for the Copilot provider.

Both Pi session read paths (first-entry meta and full-session parse) now pass through the 128 MB-capped helper. MEDIUM-1 coverage for the Pi provider.

Config JSON, CLAUDE.md scans, and session-discovery reads now pass through the 128 MB-capped helper. JSON.parse remains wrapped in try/catch to preserve the previous 'null on malformed JSON' contract. MEDIUM-1 coverage for the context-budget module.

All four read paths in the optimizer (async session scan + three sync config/import/profile scans) now pass through the 128 MB-capped helpers. JSON.parse in readJsonFile stays wrapped in try/catch. MEDIUM-1 coverage for the optimize module.

Three cases (pipe-in-model, ANSI-in-model, pipe-in-category) reproduce the audit's SwiftBar directive-separator attack. Tests fail against current menubar.ts -- Task 13 will close with an allowlist sanitizer.

Replaces any character outside [A-Za-z0-9 ._/-] with ? in model and category labels and truncates to 14 chars before padEnd. Closes the MEDIUM-2 finding from the 2026-04-16 audit: an attacker-controlled JSONL with a crafted model name no longer injects SwiftBar directives or ANSI escapes.

Sets CODEBURN_VERBOSE=1 via commander preAction, which the fs-utils helpers check before emitting stderr lines on skipped or failed reads. Closes LOW-1 from the 2026-04-16 audit.

AgentSeal · 2026-04-17T12:10:57Z

Merged as part of 0.7.1. https://github.com/AgentSeal/codeburn/releases/tag/v0.7.1

Thanks @lfl1337, excellent work on the audit follow-through and the scope expansion to cover the v0.6.x and v0.7.0 read sites. The TDD commit structure was appreciated.

lfl1337 added 13 commits April 17, 2026 08:32

fix(codex): use bounded readSessionFile helper

1de0baf

Both Codex session read paths (first-line meta and full-session parse) now pass through the 128 MB-capped helper. MEDIUM-1 coverage for the Codex provider.

fix(pi): use bounded readSessionFile helper

716e080

Both Pi session read paths (first-entry meta and full-session parse) now pass through the 128 MB-capped helper. MEDIUM-1 coverage for the Pi provider.

fix(optimize): use bounded read helpers

2167823

All four read paths in the optimizer (async session scan + three sync config/import/profile scans) now pass through the 128 MB-capped helpers. JSON.parse in readJsonFile stays wrapped in try/catch. MEDIUM-1 coverage for the optimize module.

test(security): add failing test for MEDIUM-2 menubar injection

71461fb

Three cases (pipe-in-model, ANSI-in-model, pipe-in-category) reproduce the audit's SwiftBar directive-separator attack. Tests fail against current menubar.ts -- Task 13 will close with an allowlist sanitizer.

feat(cli): add --verbose flag for stderr warnings

8f59271

Sets CODEBURN_VERBOSE=1 via commander preAction, which the fs-utils helpers check before emitting stderr lines on skipped or failed reads. Closes LOW-1 from the 2026-04-16 audit.

AgentSeal merged commit 774d191 into getagentseal:main Apr 17, 2026
2 checks passed

lfl1337 mentioned this pull request Apr 18, 2026

chore(ci): add semgrep guard against prototype pollution regressions in provider hot paths #78

Merged

4 tasks

lfl1337 deleted the fix/security-hardening-2026-04 branch April 18, 2026 18:49

This was referenced Apr 22, 2026

OOM crash in scanJsonlFile / parseSessionFile: readViaStream loads entire file into memory despite using streams #131

Closed

fix: switch scanJsonlFile and parseSessionFile to readSessionLines to prevent OOM #132

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: security hardening from external audit#67

fix: security hardening from external audit#67
AgentSeal merged 13 commits intogetagentseal:mainfrom
lfl1337:fix/security-hardening-2026-04

lfl1337 commented Apr 17, 2026

Uh oh!

Uh oh!

AgentSeal commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lfl1337 commented Apr 17, 2026

Background

Findings Closed

Scope Expansion vs. Audit

Test Coverage

Verification

Compatibility

Out of Scope

Commits

Uh oh!

Uh oh!

AgentSeal commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants