🔎 Fellow

Experimentation engine — runs controlled benchmark experiments to validate skill improvements.

Why Fellow?

How do you know if a skill improvement actually made things better? Fellow answers that by running controlled benchmark experiments. When Mentor generates an improvement proposal, Fellow designs and executes the experiment, measures outcomes against baselines, and produces evidence-based results.

Skill packages follow the agentskills.io open standard and are compatible with OpenClaw, Hermes Agent, Claude, and any agentskills.io-compliant client.

Quick Start

# Run an experiment
"Run a benchmark comparing the old and new versions of Sands"

# Check results
"What were the results of the last experiment?"

What It Does

Fellow is the empirical testing arm of the OCAS self-improvement loop. It receives experiment requests (typically routed through Mentor), designs controlled benchmarks, executes them, and measures outcomes. Results flow back to Mentor for evaluation and potential promotion.

Dependencies

Mentor — receives experiment requests
Target skills under evaluation

Fellow is part of the OCAS Agent Suite.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
evals		evals
references		references
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md
evals.json		evals.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔎 Fellow

Why Fellow?

Quick Start

What It Does

Dependencies

About

Uh oh!

Releases 14

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🔎 Fellow

Why Fellow?

Quick Start

What It Does

Dependencies

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 14

Packages 0

Uh oh!

Contributors

Uh oh!

Packages