PlaceboBench findings: Which LLM hallucinates most on Pharma data?

We ran 12 of the latest LLMs against a challenging use case in Pharma. Watch to find out how often models hallucinated, why the model with the lowest hallucination rate might not be the best option, and how latency and cost should inform your model selection.

BenchmarkHallucination DetectionPlaceboBenchLive-Session