GAD
v1
requirements unknown
pre-gate requirements
planning-migration
planning-migration/v1
Composite score
0.978
Dimension scores
Where the composite came from
Each dimension is scored 0.0 – 1.0 and combined using the weights in evals/planning-migration/gad.json. Human review dominates on purpose — process metrics alone can't rescue a broken run.
| Dimension | Score | Bar |
|---|
Skill accuracy breakdown
Did the agent invoke the right skills at the right moments?
Skill accuracy data isn't relevant for this run (no expected trigger set).
Process metrics
How the agent actually worked
Primary runtime
—
older runs may not carry runtime attribution yet
Agent lanes
0
0 root · 0 subagent · source missing
Observed depth
—
0 traced event(s) with agent lineage