Testing

How Evals Help Build Reliable AI

I finished 2025 with a negative AI post, so let’s start 2026 with a positive one.

I’m leading a project for an AI-based tool, and something we are heavily experimenting with is eval-driven development. Technical readers may immediately associate this with test-driven development, and …