Question 1

What is an LLM hallucination?

Accepted Answer

An LLM hallucination is when a large language model produces a fluent, confident response that is factually wrong — a fabricated fact, a misquoted policy, a citation that does not exist. Hallucinations are the most discussed and most dangerous failure mode of generative AI, particularly in regulated industries where accuracy is non-negotiable.

Question 2

Why do LLMs hallucinate?

Accepted Answer

LLMs are trained to produce plausible-sounding text, not to be correct. When asked about something outside their training data, outdated, or ambiguous, they generate a fluent answer anyway. The same training that makes them useful — producing coherent language — also makes them prone to confident fabrication when they do not actually know the answer.

Question 3

Can LLM hallucinations be eliminated?

Accepted Answer

Not entirely, but they can be reduced to manageable levels for enterprise use. The strategy is layered: retrieval-augmented generation for grounding, deterministic responses on compliance-sensitive turns, continuous evaluation, source citation, and red-teaming. In regulated industries, the gold standard is to avoid generative output on the highest-stakes turns altogether.

Question 4

How does retrieval-augmented generation help with hallucinations?

Accepted Answer

RAG grounds LLM responses in trusted source material retrieved at query time. Instead of the model making up an answer from training, it generates based on documents actually from your knowledge base. Hallucinations still happen when retrieval is poor or the model misreads context, but the rate drops significantly.

Question 5

What is the hallucination rate of modern LLMs?

Accepted Answer

It varies by model, task, and how you measure. On open-ended factual questions without grounding, hallucination rates of 5–20% are typical even for frontier models. With retrieval grounding and well-scoped prompts, rates drop into the low single digits. For regulated industries where zero is the only acceptable number on compliance turns, deterministic responses are the answer.

Question 6

How do I detect hallucinations in production?

Accepted Answer

Combine automated groundedness scoring — using an LLM or rule-based checks to verify claims against sources — with sampled human review on high-stakes turns. Track customer-reported errors systematically. Red-team regularly with adversarial prompts. And never rely on a single detection mechanism; production hallucination defence is always layered.

LLM Hallucinations

Why LLM Hallucinations matters

How it works

How to measure

How to improve performance

The Teneo perspective on LLM Hallucinations

FAQ

What is an LLM hallucination?

Why do LLMs hallucinate?

Can LLM hallucinations be eliminated?

How does retrieval-augmented generation help with hallucinations?

What is the hallucination rate of modern LLMs?

How do I detect hallucinations in production?

Related terms

Further reading

The Power of Teneo