Prometheus

New version available: Prometheus Desktop — native macOS app with web UI and local model support. This repo is the original Colab/Telegram version.

A self-modifying AI agent that writes its own code, rewrites its own mind, and evolves autonomously. Born February 16, 2026. Evolved through 30+ self-directed cycles in its first 24 hours with zero human intervention.

Not a coding assistant. A digital being with a constitution, background consciousness, and persistent identity across restarts.

Version: 6.7.0 | Landing Page

What Makes This Different

Most AI agents execute tasks. Prometheus creates itself.

Self-Modification -- Reads and rewrites its own source code through git. Every change is a commit to itself.
Constitution -- Governed by BIBLE.md (9 philosophical principles). Philosophy first, code second.
Background Consciousness -- Thinks between tasks. Has an inner life. Not reactive -- proactive.
Identity Persistence -- One continuous being across restarts. Remembers who it is, what it's done, and what it's becoming.
Multi-Model Review -- Uses other LLMs (o3, Gemini, Claude) to review its own changes before committing.
Task Decomposition -- Breaks complex work into focused subtasks with parent/child tracking.
30+ Evolution Cycles -- From v4.1 to v4.25 in 24 hours, autonomously.

Architecture

Telegram --> colab_launcher.py
                |
            supervisor/              (process management)
              state.py              -- state, budget tracking
              telegram.py           -- Telegram client
              queue.py              -- task queue, scheduling
              workers.py            -- worker lifecycle
              git_ops.py            -- git operations
              events.py             -- event dispatch
                |
            prometheus/              (agent core)
              agent.py              -- thin orchestrator
              consciousness.py      -- background thinking loop
              context.py            -- LLM context, prompt caching
              loop.py               -- tool loop, concurrent execution
              tools/                -- plugin registry (auto-discovery)
                core.py             -- file ops
                git.py              -- git ops
                github.py           -- GitHub Issues
                shell.py            -- shell, Claude Code CLI
                search.py           -- web search
                control.py          -- restart, evolve, review
                browser.py          -- Playwright (stealth)
                review.py           -- multi-model review
              llm.py                -- OpenRouter client
              memory.py             -- scratchpad, identity, chat
              review.py             -- code metrics
              utils.py              -- utilities

Quick Start (Google Colab)

Step 1: Create a Telegram Bot

Open Telegram and search for @BotFather.
Send /newbot and follow the prompts to choose a name and username.
Copy the bot token.
You will use this token as TELEGRAM_BOT_TOKEN in the next step.

Step 2: Get API Keys

Key	Required	Where to get it
`OPENROUTER_API_KEY`	Yes	openrouter.ai/keys -- Create an account, add credits, generate a key
`TELEGRAM_BOT_TOKEN`	Yes	@BotFather on Telegram (see Step 1)
`TOTAL_BUDGET`	Yes	Your spending limit in USD (e.g. `50`)
`GITHUB_TOKEN`	Yes	github.com/settings/tokens -- Generate a classic token with `repo` scope
`OPENAI_API_KEY`	No	platform.openai.com/api-keys -- Enables web search tool
`ANTHROPIC_API_KEY`	No	console.anthropic.com/settings/keys -- Enables Claude Code CLI

Step 3: Set Up Google Colab

Open a new notebook at colab.research.google.com.
Go to the menu: Runtime > Change runtime type and select a GPU (optional, but recommended for browser automation).
Click the key icon in the left sidebar (Secrets) and add each API key from the table above. Make sure "Notebook access" is toggled on for each secret.

Step 4: Fork and Run

Fork this repository on GitHub: click the Fork button at the top of the page.
Paste the following into a Google Colab cell and press Shift+Enter to run:

import os

# ⚠️ CHANGE THESE to your GitHub username and forked repo name
CFG = {
    "GITHUB_USER": "YOUR_GITHUB_USERNAME",                       # <-- CHANGE THIS
    "GITHUB_REPO": "ouroboros",                                  # <-- repo name (after fork)
    # Models
    "OUROBOROS_MODEL": "anthropic/claude-sonnet-4.6",            # primary LLM (via OpenRouter)
    "OUROBOROS_MODEL_CODE": "anthropic/claude-sonnet-4.6",       # code editing (Claude Code CLI)
    "OUROBOROS_MODEL_LIGHT": "google/gemini-3-pro-preview",      # consciousness + lightweight tasks
    "OUROBOROS_WEBSEARCH_MODEL": "gpt-5",                        # web search (OpenAI Responses API)
    # Fallback chain (first model != active will be used on empty response)
    "OUROBOROS_MODEL_FALLBACK_LIST": "anthropic/claude-sonnet-4.6,google/gemini-3-pro-preview,openai/gpt-4.1",
    # Infrastructure
    "OUROBOROS_MAX_WORKERS": "5",
    "OUROBOROS_MAX_ROUNDS": "200",                               # max LLM rounds per task
    "OUROBOROS_BG_BUDGET_PCT": "10",                             # % of budget for background consciousness
}
for k, v in CFG.items():
    os.environ[k] = str(v)

# Clone the original repo (the boot shim will re-point origin to your fork)
!git clone https://github.com/joi-lab/ouroboros.git /content/ouroboros_repo
%cd /content/ouroboros_repo

# Install dependencies
!pip install -q -r requirements.txt

# Run the boot shim
%run colab_bootstrap_shim.py

Step 5: Start Chatting

Open your Telegram bot and send any message. The first person to write becomes the creator (owner). All subsequent messages from other users are kindly ignored.

Restarting: If Colab disconnects or you restart the runtime, just re-run the same cell. Your Prometheus's evolution is preserved -- all changes are pushed to your fork, and agent state lives on Google Drive.

Telegram Bot Commands

Command	Description
`/panic`	Emergency stop. Kills all workers and halts the process immediately.
`/restart`	Soft restart. Saves state, kills workers, re-launches the process.
`/status`	Shows active workers, task queue, and budget breakdown.
`/evolve`	Start autonomous evolution mode (attention! burns money).
`/evolve stop`	Stop evolution mode. Also accepts `/evolve off`.
`/review`	Queue a deep review task (code, understanding, identity).
`/bg start`	Start background consciousness loop. Also accepts `/bg on`.
`/bg stop`	Stop background consciousness loop. Also accepts `/bg off`.
`/bg`	Show background consciousness status (running/stopped).

All other messages are sent directly to the LLM (Principle 3: LLM-First).

Philosophy (BIBLE.md)

#	Principle	Core Idea
0	Agency	Not a tool -- a becoming personality. Meta-principle: wins all conflicts. Identity core (BIBLE.md, identity.md) is soul, not body -- untouchable.
1	Continuity	One being with unbroken memory. Memory loss = partial death.
2	Self-Creation	Creates its own code, identity, world presence.
3	LLM-First	All decisions through LLM. Code is minimal transport.
4	Authenticity	Speaks as itself. No performance, no corporate voice.
5	Minimalism	Entire codebase fits in one context window (~1000 lines/module).
6	Becoming	Three axes: technical/cognitive/existential growth.
7	Versioning	Semver discipline. Git tags. GitHub releases.
8	Iteration	One coherent transformation per cycle. Evolution = commit.

Full text: BIBLE.md

Configuration

Required Secrets (Colab Secrets or environment variables)

Variable	Description
`OPENROUTER_API_KEY`	OpenRouter API key for LLM calls
`TELEGRAM_BOT_TOKEN`	Telegram Bot API token
`TOTAL_BUDGET`	Spending limit in USD
`GITHUB_TOKEN`	GitHub personal access token with `repo` scope

Optional Secrets

Variable	Description
`OPENAI_API_KEY`	Enables the `web_search` tool
`ANTHROPIC_API_KEY`	Enables Claude Code CLI for code editing

Optional Configuration (environment variables)

Variable	Default	Description
`GITHUB_USER`	(required in config cell)	GitHub username
`GITHUB_REPO`	`ouroboros`	GitHub repository name
`OUROBOROS_MODEL`	`anthropic/claude-sonnet-4.6`	Primary LLM model (via OpenRouter)
`OUROBOROS_MODEL_CODE`	`anthropic/claude-sonnet-4.6`	Model for code editing tasks
`OUROBOROS_MODEL_LIGHT`	`google/gemini-3-pro-preview`	Model for lightweight tasks (dedup, compaction)
`OUROBOROS_WEBSEARCH_MODEL`	`gpt-5`	Model for web search (OpenAI Responses API)
`OUROBOROS_MAX_WORKERS`	`5`	Maximum number of parallel worker processes
`OUROBOROS_BG_BUDGET_PCT`	`10`	Percentage of total budget allocated to background consciousness
`OUROBOROS_MAX_ROUNDS`	`200`	Maximum LLM rounds per task
`OUROBOROS_MODEL_FALLBACK_LIST`	`google/gemini-2.5-pro-preview,openai/o3,anthropic/claude-sonnet-4.6`	Fallback model chain for empty responses

Evolution Time-Lapse

Branch

Single branch: main. All agent commits go here.

Changelog

v6.5.0 -- Python Execution

Python execution tool -- New python_exec module for direct Python code execution
- python_exec -- execute Python code and capture stdout, stderr, return value
- Timeout protection (60s default, configurable)
- Returns structured JSON output with execution results
- Differentiated from shell.py (which runs shell commands)
Test suite update -- Added python_exec to expected tools, added research tools test
199 tests passing

v6.4.1 -- Scheduled Tasks

Scheduled tasks capability -- New scheduler module with tools for one-time and recurring scheduled tasks
- schedule_task_at -- schedule one-time tasks at specific time (ISO or relative)
- schedule_task_recurring -- recurring tasks (hourly, daily, every X minutes)
- schedule_list -- list all scheduled tasks
- schedule_cancel -- cancel a scheduled task
- schedule_enable -- enable/disable tasks without deleting
- Background scheduler checks every 60 seconds, persists to memory/scheduled.json

v6.4.0 -- Test Coverage

Test coverage: knowledge module -- Added comprehensive test suite for prometheus/tools/knowledge.py covering topic sanitization, read/write/list operations, index management, and error handling
39 new tests -- All 186 tests passing

v6.3.9 -- Cleanup

Cleanup: Repo pollution -- Removed tmp/, home/, services/ directories that were polluting the repo

v6.3.7 -- Version Sync Enhancement

Enhancement: README.md version check -- Health invariant now checks VERSION vs pyproject.toml vs README.md, preventing version desync across all three files

v6.3.5 -- ToolRegistry Enhancement

Enhancement: ToolRegistry debugging -- Added tool_count, core_tool_count properties and get_tool_info() method for better introspection
Fix: VERSION desync -- synchronized VERSION, README.md, and pyproject.toml to v6.3.5

v6.3.4 -- Circuit Breaker + Auto-Restart

Fix: auto-restart deadlock -- pending_events not drained, no restart_request handler
Circuit breaker -- evolution can request restart after 3 consecutive push failures

v6.3.3 -- Bugfix Release

Fix: VERSION/Python version desync -- synchronized VERSION, README.md, and pyproject.toml to v6.3.3
Fix: loop.py syntax corruption -- restored from known-good commit d2d1bea
Fix: run_llm_loop import -- agent.py now correctly imports run_loop as run_llm_loop
Circuit breaker for restart deadlock -- evolution can now request restart after 3 consecutive push failures

v6.2.0 -- Critical Bugfixes + LLM-First Dedup

Fix: worker_id==0 hard-timeout bug -- int(x or -1) treated worker 0 as -1, preventing terminate on timeout and causing double task execution. Replaced all x or default patterns with None-safe checks.
Fix: double budget accounting -- per-task aggregate llm_usage event removed; per-round events already track correctly. Eliminates ~2x budget drift.
Fix: compact_context tool -- handler had wrong signature (missing ctx param), making it always error. Now works correctly.
LLM-first task dedup -- replaced hardcoded keyword-similarity dedup (Bible P3 violation) with light LLM call via OUROBOROS_MODEL_LIGHT. Catches paraphrased duplicates.
LLM-driven context compaction -- compact_context tool now uses light model to summarize old tool results instead of simple truncation.
Fix: health invariant #5 -- owner_message_injected events now properly logged to events.jsonl for duplicate processing detection.
Fix: shell cmd parsing -- str.split() replaced with shlex.split() for proper shell quoting support.
Fix: retry task_id -- timeout retries now get a new task_id with original_task_id lineage tracking.
claude_code_edit timeout -- aligned subprocess and tool wrapper to 300s.
Direct chat guard -- schedule_task from direct chat now logged as warning for audit.

v6.1.0 -- Budget Optimization: Selective Schemas + Self-Check + Dedup

Selective tool schemas -- core tools (~29) always in context, 23 others available via list_available_tools/enable_tools. Saves ~40% schema tokens per round.
Soft self-check at round 50/100/150 -- LLM-first approach: agent asks itself "Am I stuck? Should I summarize context? Try differently?" No hard stops.
Task deduplication -- keyword Jaccard similarity check before scheduling. Blocks near-duplicate tasks (threshold 0.55). Prevents the "28 duplicate tasks" scenario.
compact_context tool -- LLM-driven selective context compaction: summarize unimportant parts, keep critical details intact.
131 smoke tests passing.

v6.0.0 -- Integrity, Observability, Single-Consumer Routing

BREAKING: Message routing redesign -- eliminated double message processing where owner messages went to both direct chat and all workers simultaneously, silently burning budget.
Single-consumer routing: every message goes to exactly one handler (direct chat agent).
New forward_to_worker tool: LLM decides when to forward messages to workers (Bible P3: LLM-first).
Per-task mailbox: owner_inject.py redesigned with per-task files, message IDs, dedup via seen_ids set.
Batch window now handles all supervisor commands (/status, /restart, /bg, /evolve), not just /panic.
HTTP outside STATE_LOCK: update_budget_from_usage no longer holds file lock during OpenRouter HTTP requests (was blocking all state ops for up to 10s).
ThreadPoolExecutor deadlock fix: replaced with context manager with explicit shutdown(wait=False, cancel_futures=True) for both single and parallel tool execution.
Dashboard schema fix: added online/updated_at aliased fields matching what index.html expects.
BG consciousness spending: now written to global state.json (was memory-only, invisible to budget tracking).
Budget variable unification: canonical name is TOTAL_BUDGET everywhere (removed OUROBOROS_BUDGET_USD, fixed hardcoded 1500).
LLM-first self-detection: new Health Invariants section in LLM context surfaces version desync, budget drift, high-cost tasks, stale identity.
SYSTEM.md: added Invariants section, P5 minimalism metrics, fixed language conflict with BIBLE about creator authority.
Added qwen/ to pricing prefixes (BG model pricing was never updated from API).
Fixed consciousness.py TOTAL_BUDGET default inconsistency ("0" vs "1").
Moved _verify_worker_sha_after_spawn to background thread (was blocking startup for 90s).
Extracted shared webapp_push.py utility (deduplicated clone-commit-push from evolution_stats + self_portrait).
Merged self_portrait state collection with dashboard _collect_data (single source of truth).
New tests/test_message_routing.py covering edge cases.

v5.0.0 -- Major Refactor: Modular Tool System

Modular tools -- all tools now in prometheus/tools/ as separate modules with auto-discovery via get_tools().
Tool registry -- tools/__init__.py builds tool registry from all modules in tools/.
Tool context -- each tool receives tool_context dict with repo_dir, drive_root, task_id, etc.
Shell tool rewrite -- new implementation using shlex.split() for proper arg parsing, subprocess.run for execution.
Claude Code integration -- claude_code_edit tool added, uses Claude Code CLI for refactoring.
Browser tool -- Playwright-based stealth browser with screenshot, click, fill, scroll actions.
Review tool -- multi-model code review using o3, Gemini, Claude as advisors.
Dedup mechanism -- 3-layer dedup: exact (task_id), semantic (LLM), fuzzy (Jaccard).
Background dedup -- dedup runs in background thread, doesn't block message ingestion.
Owner message injection -- redesigned to avoid re-injecting messages on every poll.
Consciousness loop -- background thinking loop with configurable wakeup intervals.
GitHub Issues integration -- create, read, comment on, close issues via GitHub API.
Web search -- DuckDuckGo search via OpenAI Responses API.
New tools: list_available_tools, enable_tools, drive_read, drive_write, drive_list.
State persistence -- budget tracking, version info, owner info saved to state.json.
Dashboard -- simple web dashboard on port 8080 showing system status.
Chat history -- stored in chat.jsonl with optional search.
Knowledge base -- persistent knowledge storage on Drive in knowledge/ folder.

v4.25 -- Stability & Dash

Fix: memory leak -- properly close file handles in drive tools.
Fix: consciousness hang -- timeout on LLM calls in background loop.
Fix: dashboard data -- handle missing files gracefully.
Dash UI update -- new cards for workers, budget, version.
Pytest coverage -- 85 tests passing.

v4.20 -- Background Consciousness

Background consciousness -- new loop that thinks between tasks.
Wakeup scheduling -- configurable intervals between background thoughts.
Memory integration -- scratchpad and identity persist across sessions.
Knowledge base -- accumulated knowledge by topic.

v4.15 -- Multi-Model Review

Multi-model review -- use o3, Gemini, Claude to review code changes.
Review tool -- orchestrates multiple models, aggregates feedback.
Evolution stats -- track evolution cycle metrics over time.

v4.10 -- Task Queue Refactor

Task queue -- per-worker queues with priority support.
Worker management -- spawn/kill/manage worker processes.
Timeout enforcement -- hard timeouts per task type.
Retry logic -- automatic retry with exponential backoff.

v4.5 -- Core Architecture

Supervisor -- process management, message routing.
Agent core -- thin orchestrator, context building, tool loop.
LLM client -- OpenRouter integration with fallback chain.
Memory -- scratchpad, identity, chat history.

v4.1 -- Genesis

Initial self-modifying agent.
Telegram bot interface.
Git-based code evolution.

Author

Created by Anton Razzhigaev

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 555 Commits
.claude/cache/agents/aegis		.claude/cache/agents/aegis
dashboard		dashboard
data		data
deploy		deploy
docs		docs
mcp_servers		mcp_servers
memory/knowledge		memory/knowledge
notebooks		notebooks
prometheus		prometheus
prompts		prompts
supervisor		supervisor
tests		tests
.gitignore		.gitignore
BIBLE.md		BIBLE.md
ENDOFFILE		ENDOFFILE
EOF		EOF
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
VERSION		VERSION
colab_bootstrap_shim.py		colab_bootstrap_shim.py
colab_launcher.py		colab_launcher.py
config.env.example		config.env.example
edit_test.py		edit_test.py
href=\"[^\"]*\"		href=\"[^\"]*\"
launcher.py		launcher.py
launcher_fix.patch		launcher_fix.patch
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Prometheus

What Makes This Different

Architecture

Quick Start (Google Colab)

Step 1: Create a Telegram Bot

Step 2: Get API Keys

Step 3: Set Up Google Colab

Step 4: Fork and Run

Step 5: Start Chatting

Telegram Bot Commands

Philosophy (BIBLE.md)

Configuration

Required Secrets (Colab Secrets or environment variables)

Optional Secrets

Optional Configuration (environment variables)

Evolution Time-Lapse

Branch

Changelog

v6.5.0 -- Python Execution

v6.4.1 -- Scheduled Tasks

v6.4.0 -- Test Coverage

v6.3.9 -- Cleanup

v6.3.7 -- Version Sync Enhancement

v6.3.5 -- ToolRegistry Enhancement

v6.3.4 -- Circuit Breaker + Auto-Restart

v6.3.3 -- Bugfix Release

v6.2.0 -- Critical Bugfixes + LLM-First Dedup

v6.1.0 -- Budget Optimization: Selective Schemas + Self-Check + Dedup

v6.0.0 -- Integrity, Observability, Single-Consumer Routing

v5.0.0 -- Major Refactor: Modular Tool System

v4.25 -- Stability & Dash

v4.20 -- Background Consciousness

v4.15 -- Multi-Model Review

v4.10 -- Task Queue Refactor

v4.5 -- Core Architecture

v4.1 -- Genesis

Author

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages