Skip to main content

Overview

fastskill eval runs skill quality checks using prompts and rubrics under [tool.fastskill.eval] in skill-project.toml. Subcommands are validate, run, report, and score.
eval run requires an agent on PATH for the chosen --agent key (for example codex, claude, gemini, opencode, or agent). Use eval validate --agent <key> to verify availability.

Usage

fastskill eval <SUBCOMMAND> [OPTIONS]

eval validate

Checks eval configuration and files (optional agent presence check).
fastskill eval validate
fastskill eval validate --agent codex
fastskill eval validate --json
OptionDescription
--agent <AGENT>Verify the given agent is available
--format <FORMAT>table, json, grid, or xml
--jsonShorthand for --format json

eval run

Runs eval cases against an agent; writes artifacts under --output-dir.
fastskill eval run --agent codex --output-dir ./eval-runs
fastskill eval run --agent claude --output-dir /tmp/evals --case smoke-1
fastskill eval run --agent codex --output-dir ./evals --tag smoke --model gpt-4o
fastskill eval run --agent codex --output-dir ./evals --json --no-fail
OptionDescription
--agent <AGENT>Required. Agent key to run
--output-dir <DIR>Required. Directory for run output
--model <MODEL>Optional model override passed to the agent
--case <ID>Run only this case id
--tag <TAG>Run only cases with this tag
--format <FORMAT>Output format
--jsonShorthand for --format json
--no-failExit zero even when the suite fails

eval report

Summarizes a completed run directory.
fastskill eval report --run-dir ./eval-runs/2026-04-01T14-32-10Z
fastskill eval report --run-dir /tmp/evals/latest --json
OptionDescription
--run-dir <DIR>Required. Run directory to summarize
--format <FORMAT>Output format
--jsonShorthand for --format json

eval score

Re-scores saved artifacts without invoking the agent again.
fastskill eval score --run-dir ./eval-runs/2026-04-01T14-32-10Z
fastskill eval score --run-dir /tmp/evals/run-1 --no-fail
OptionDescription
--run-dir <DIR>Required. Run directory
--format <FORMAT>Output format
--jsonShorthand for --format json
--no-failExit zero on suite failure

See also