Framework
Why AI needs to dream: a guide to long-term memory
Agentic Memory is an area of the AI world where even benchmarks are vague.
How do you really measure “memory”? Retention? Relevance? Recall quality? Personality consistency?
It is surprisingly difficult to find concise, practical information on the topic.
Sure, there are tons of …
10 Lessons learned when building Charlie
Last week, we launched Charlie, our first AI Agent, specifically designed and tailored for financial institutions.
While designing and building it over the past few months, the pace was intense. Now I finally have the space to step back and document the core insights we gained while developing this …
How Evals Help Build Reliable AI
I finished 2025 with a negative AI post, so let’s start 2026 with a positive one.
I’m leading a project for an AI-based tool, and something we are heavily experimenting with is eval-driven development. Technical readers may immediately associate this with test-driven development, and …