Miriam Kümmel & Mathis Lucka•2026-04-26•10 min read
Hallucination Detection Comparison
What's the best tool for hallucination detection? We put 7 of them to the test.
Read MoreLatest insights, tutorials, and news about AI governance and guardrails
What's the best tool for hallucination detection? We put 7 of them to the test.
Read More
We created PlaceboBench, a challenging pharmaceutical RAG benchmark based on real clinical questions and official EMA documents. Twelve state-of-the-art LLMs show hallucination rates between 24% and 64%.
Read More
Our LLM OCR Cost Calculator allows AI builders to compare PDF parsing costs across different LLM providers and models
Read More
We enhanced the RAGTruth benchmark by finding 10x more hallucinations through automated detection and human review.
Read More
The gap between hallucination benchmarks and production reality and what bears have to do with it
Read More