Next we’re going to improve our agents using Claude Code. The biggest advantage of having a unified agent platform is that coding agents can read logs to iteratively improve our agents. The codebase comes with two prompts:Documentation Index
Fetch the complete documentation index at: https://agno-v2-ab-home-page-updates-5-16.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
docs/improve-agent.md. Claude derives probes from the agent’s instructions, judges responses, and edits until they pass. Autonomous.docs/extend-agent.md. Add a tool, refine a prompt, fix a bug. User-driven.
agents/<slug>.py directly. Hot-reload picks up the change in ~2s, so the test → judge → edit cycle is fairly tight.
Improve: autonomous probe-and-judge
Open Claude Code in youragent-platform directory and paste:
INSTRUCTIONS and derives 8–12 probes across four categories: golden path, edge cases, tool selection, and adversarial. For each probe, it cURLs the live container, reads tool calls from the logs, and judges PASS or FAIL against what the instructions promise.
For every failure, Claude picks a lever and edits: tighten a rule, add a rule, swap a tool, bump num_history_runs. Claude re-runs only the failed probes. Caps at five iterations.
Extend: user-driven changes
When you have a specific change in mind, paste:When to run each
- Just created an agent and want to harden it before deploying. Use improve.
- Users report the agent is missing the point. Use improve.
- You want to add a new tool or knowledge base. Use extend.
- You hit a specific bug. Use extend.
- You just extended an agent and want to make sure nothing regressed. Use improve again.