template · Python · Markdown · 11 KB
RAG eval harness starter · Python + Markdown
The eval framework scaffold we drop into RAG projects in week one. Giskard + promptfoo + custom metrics.
formatPython · Markdownsize11 KBserviceAI solutions
description
Starter scaffold for RAG system evaluation: 5 metric classes (faithfulness, context precision, answer relevance, bias, injection resistance), CI integration, diff reporting. MIT-licensed.
AI solutions →what's inside05 / items
- 015 metric classes with Python code
- 02Giskard + promptfoo integration
- 03Injection-resistance eval suite (80+ prompts)
- 04CI step YAML (GitHub Actions + GitLab CI)
- 05Diff-report generator per build
Want a custom version?
A tailored audit or template delivered in 2 weeks · DField Solutions, Budapest.
Get a quote→