Eval Setup
This page covers the minimum setup for repeatable eval execution.1) Ensure project prerequisites
skill-project.tomlexists at project root- Skills are installed (
fastskill install) - Chosen eval agent is available on
PATH
2) Validate eval configuration
Run validation before any suite execution:--agent to confirm the runtime is available in the current environment.
3) Define a stable output layout
Use a predictable output root (for local and CI):--output-dir in eval run.
4) Normalize run filters
Use tags and case ids consistently so teams can run:- fast feedback: smoke tags
- release checks: full suite
- focused debugging: single case id
5) Define fail policy
Default behavior should fail on quality regression. Use--no-fail only for exploratory runs.
Setup checklist
-
fastskill eval validatepasses - Agent binary is available
- Output directory is versioned in CI artifacts policy
- Team agreed on tag naming (
smoke,regression,release)