Skip to main content
Hallucination Firewall

~2% hallucination rate. Not 18%.

A 3-stage pipeline that extracts claims, verifies them against source turns, and sanitizes any that can't be confirmed — all within the same LLM call.

2.1%
Hallucination rate
vs 18.3% raw LLM baseline
94%
Claim catch rate
in blind evaluation runs
0ms
Added latency
runs inside the LLM call

The 3-stage pipeline

Every LLM response passes through all three stages before the user sees it.

Stage 01
ClaimExtractor
Identifies all factual claims in the LLM output before delivery to the user.
Stage 02
ClaimVerifier
Cross-references each extracted claim against source conversation turns.
Stage 03
OutputSanitizer
Removes unverified claims; replaces with null or a targeted follow-up question.

Real example: medical intake

The same conversation — once with raw LLM output, once with the firewall active.

Raw LLM — invented data

Based on your conversation, your blood pressure appears to be 120/80 and you seem to have a mild fever of 37.5°C.

These values were never mentioned in the conversation
TrueNorth Firewall — sanitized

Based on what you've shared, I'd like to confirm a few things. Could you share your current blood pressure reading? And have you measured your temperature recently?

Unverifiable claims removed, targeted follow-ups generated

Common questions

Things engineers ask before deploying to production.

QDoes this add latency?

No. The 3 stages run inside the same LLM call using structured output constraints. There is zero added round-trip latency.

QHow was the 2.1% rate measured?

Blind automated evaluation across 1,000 real production medical intakes using a separate verification model. The test harness is open-source.

QDoes it work in other languages?

Yes. ClaimExtractor and ClaimVerifier both operate on the semantic content, not raw English strings. All 22 supported languages are covered.

QCan I tune the confidence threshold?

Yes. Set min_confidence: 0.85 in your YAML to accept only claims the verifier is 85%+ confident about. Lower thresholds allow more through.

See it run live

The playground runs the full pipeline including the hallucination firewall on every bot turn.