Overview
Archal runs in any CI environment. Set your auth token, pick an output format, and set a pass threshold. If the agent’s satisfaction score drops below the threshold, the build fails.Secrets
The only required secret is your Archal token:GitHub Actions
GitLab CI
Useful flags
| Flag | What it does |
|---|---|
--pass-threshold <score> | Exit 1 if satisfaction is below this (0-100) |
-o json | Machine-readable JSON output |
-q | Suppress non-error output |
-n, --runs <count> | Run the scenario multiple times for a real satisfaction score |
--tag <tag> | Only run scenarios with a matching tag (exits 0 if no match) |
--preflight-only | Validate config and exit without running |
Exit codes
| Code | Meaning |
|---|---|
0 | Score met the threshold (or scenario skipped by --tag) |
1 | Score below threshold or runtime error |
2 | Validation error (bad flags, missing scenario, invalid config) |
Running multiple scenarios
If you have multiple scenario files, run them in a loop or use a suite in.archal.json:
Tips
- Start with
--pass-threshold 60and tighten as your agent improves. - Use
--runs 3or higher. A single run can be noisy. Multiple runs give you a real satisfaction score. - Use
-o jsonwhen you want to parse the output or store it as an artifact. - Set
--timeoutto something reasonable for CI. The default is 180 seconds per run.
