Train any model through scenario-based courses. Graduate with a SKILL.md — portable expertise that makes agents reliable in production. No fine-tuning. No weight modification. Just knowledge, distilled and proven.
Run agents through progressively harder scenarios with mock services. An LLM judge evaluates responses, extracts failure patterns, and generates a SKILL.md document that gets injected into the agent's system prompt. Different models fail differently — each gets its own SKILL.md. The loop repeats until the target score is reached.
npm install -g dojo.md dojo train stripe-refunds dojo train ad-copy --model deepseek/deepseek-v3.2 --target 90 dojo arena stripe-refunds