AI Demo Evaluation Worksheet

Score each run from 1 (needs work) to 5 (excellent).

DimensionWhat good looks like
ClarityAnyone can understand the answer quickly.
UsefulnessThe output leads to a concrete next action.
AccuracyThe response aligns with source facts and context.
TrustLimits, assumptions, and risks are clearly disclosed.
ConsistencySimilar inputs produce stable, reliable outputs.

Debrief prompts