Run the prompt
Open Claude Code in the clonedagent-platform repo and paste:
- Pick an agent. Either the one you just built, or one you point it at.
- Pull recent container logs. Reads
docker compose logsto see runs, sessions, and tool calls. - Identify failure modes. Reads through the logs and flags weak prompts, bad tool selections, missing instructions, hallucinations.
- Propose changes. Edits the agent’s instructions, adds or removes tools, tightens the prompt, adds guardrails.
- Re-run and verify. Tests the changed agent against the same prompts, compares before/after, asks if you want to keep the changes.
Why this works
Three things have to be true for a coding agent to improve another agent:| Requirement | Where it lives |
|---|---|
| The container logs | docker compose logs, visible to Claude Code in the terminal. |
| The agent code | agents/*.py in this repo. |
| A way to test changes | The running platform, hot-reloads on file change. |
When to run it
- After a batch of real usage (not just your own testing).
- Before deploying a major change.
- When users report that an agent is missing the point.
- Periodically as part of routine maintenance.
What changes, what doesn’t
Claude Code edits agent code in place: instructions, tools, guardrails, model choice. It does not touch:- The platform code (
app/,db/). - Other agents not under review.
- Your
.envor any secrets.
git diff and revert.