The 60 evaluators shipped across Phases 1–6 are organized below by category, with their key,
implementing class, and (where applicable) the Foundry evaluator they fork from.
The authoritative source for each evaluator's full metadata — score formula, severity, pass
threshold, expected inputs, recommended visualisation, and external compatibility — is the
matching EvaluatorCard JSON under src/AgentEval.Evals.Agentic/EvaluatorCards/<key>.json.
At runtime, Mission Control loads these into the GraphQL evaluators query (see
Mission Control Getting Started), and the
Portal-Ready Evaluators guide documents
the schema for new evaluator authors. Phase 6 evaluators (memory, multi-turn, reasoning,
calibration, UX, adversarial, efficiency) are listed in Cost Guidance.
Detailed per-evaluator scoring rubrics, input contracts, and calibration guidance live in
the EvaluatorCard JSON files. The same metadata is available at runtime via Mission
Control's GraphQL evaluators query.