critiqor runs prints a summary table of every finalized Critiqor evaluation run found in the local runs directory. For each run it shows the trust score, total tool call count, hallucination risk band, and the primary failure type identified by the diagnosis engine. Use it to get a quick overview of agent reliability across sessions, or to find a run_id before opening a specific dashboard.
Syntax
Options
| Option | Default | Description |
|---|---|---|
--runs-dir TEXT | runs | Directory containing run artifacts |
Example output
critiqor finalize.
Understanding the columns
| Column | Description |
|---|---|
| Trust score | 0–100 weighted reliability score across 6 dimensions. Higher is better. |
| Tool Calls | Total tool calls captured and verified in the run |
| Hallucination Risk | Low (trust ≥ 75) · Review (trust 60–74) · High (trust < 60) |
| Primary diagnosis | The most impactful detected failure type, or No Major Issue if none was found |
Workflow
critiqor runs is the fastest way to find a run ID for further inspection: