Skip to main content
Agents can change real systems: close issues, send messages, refund payments, grant access, and update records. Unit tests and mocks miss the hard failures: stale context, wrong permissions, bad ordering, and unsafe cross-tool actions. Archal lets you test those agents before production. It gives you hosted clones of the SaaS services your code already uses - GitHub, Slack, Stripe, Linear, and more. Point your agent at the clones, run a task, then read the score and trace. Clones aren’t mocks:
  • they hold state across calls
  • they enforce referential integrity between objects
  • they return the same error shapes as the real service

I have an existing agent

Add Archal to a repo that already runs.

I need the first run

Add a small harness and run one scored test.

I want manual endpoints

Start persistent clones for SDK, REST, or MCP debugging.

I want test-suite integration

Use hosted clones from Vitest.

When to use which entrypoint

  • archal run - scenario runner. Markdown describes setup, task, and success criteria; Archal provisions clones, runs your code, evaluates the result.
  • archal clone start - persistent clone session with live API endpoints. Right for debugging, prompt iteration, or wiring up a new integration. Pair with archal run --reuse-session for a fast inner loop that skips the cold-start.
  • archal/vitest - hosted clones inside an existing Vitest suite. Tests use normal SDK clients; route mode redirects traffic to clones.
Same runtime in all three. The wrapping is different.

Concepts

CloneStateful behavioral copy of a real service, hosted by Archal.
ScenarioMarkdown file: setup, task, success criteria.
HarnessThe command Archal runs to call your agent.
SatisfactionScore from 0 to 100 across one or more runs.
SeedStarting state for a clone. Built-in or generated from English.
TraceRecord of every tool call and API request, with state mutations.

Go deeper