Architecture Decision Records (ADRs)

This folder contains Architecture Decision Records documenting significant technical decisions made in the AgentEval project.

What is an ADR?

An Architecture Decision Record (ADR) captures an important architectural decision along with its context and consequences.

ADR Template

Each ADR follows this structure:

  1. Title - Short descriptive title
  2. Status - Proposed, Accepted, Deprecated, Superseded
  3. Context - The situation and forces that led to this decision
  4. Decision - What we decided to do
  5. Consequences - The results of the decision (positive and negative)
  6. Alternatives Considered - Other options we evaluated

Index

ADR Title Status Date
001 Metric Naming Prefixes Proposed 2026-01-07
002 Result Directory Structure Proposed 2026-01-07
003 CLI Review Commands Proposed 2026-01-07
004 Trace Recording and Replay Accepted 2026-01-07
005 Model Comparison and Stochastic Testing Architecture Accepted 2026-01-08
006 Service-Based Architecture & DI Accepted 2026-01-09
007 Metrics Taxonomy Accepted 2026-01-10
008 Calibrated Judge for Multi-Model LLM Evaluation Accepted 2026-01-12
009 Benchmark Strategy Accepted 2026-01-13

Template based on Michael Nygard's ADR format