Tag
Eval
Every post tagged "Eval" · articles, case studies, guides.
posts02rss feed→
- 01→
Build an LLM Eval Harness in 200 Lines of TS
Frameworks are great until they get in the way. Here is a 200-line TS eval harness that runs in CI, blocks regressions and prints a diff.
AI solutions - 02→
LLM evals-as-code · the CI gate we run on every RAG deploy
An eval that's not in CI is not an eval. Here's the evals-as-code workflow we run on every RAG project.
AI solutions
Liked what you saw? Let's build yours.
Short email or a 30-min call · 24h reply.
Start a project