Architecture Decision Records (ADRs)
This folder contains Architecture Decision Records documenting significant technical decisions made in the AgentEval project.
What is an ADR?
An Architecture Decision Record (ADR) captures an important architectural decision along with its context and consequences.
ADR Template
Each ADR follows this structure:
- Title - Short descriptive title
- Status - Proposed, Accepted, Deprecated, Superseded
- Context - The situation and forces that led to this decision
- Decision - What we decided to do
- Consequences - The results of the decision (positive and negative)
- Alternatives Considered - Other options we evaluated
Index
| ADR | Title | Status | Date |
|---|---|---|---|
| 001 | Metric Naming Prefixes | Proposed | 2026-01-07 |
| 002 | Result Directory Structure | Proposed | 2026-01-07 |
| 003 | CLI Review Commands | Proposed | 2026-01-07 |
| 004 | Trace Recording and Replay | Accepted | 2026-01-07 |
| 005 | Model Comparison and Stochastic Testing Architecture | Accepted | 2026-01-08 |
| 006 | Service-Based Architecture & DI | Accepted | 2026-01-09 |
| 007 | Metrics Taxonomy | Accepted | 2026-01-10 |
| 008 | Calibrated Judge for Multi-Model LLM Evaluation | Accepted | 2026-01-12 |
| 009 | Benchmark Strategy | Accepted | 2026-01-13 |
Template based on Michael Nygard's ADR format