AI Hallucinations
The Lawyer Who Trusted ChatGPT
In June 2023, a New York lawyer named Steven Schwartz filed a legal brief in federal court. The brief cited six court cases as precedents to support his argument. The citations looked perfect β case names, dates, court decisions, page numbers. Very professional.
There was just one problem: none of the cases existed.
Not a single one. Schwartz had used ChatGPT to research case law, and the AI had confidently generated six completely fabricated court cases, complete with fake judges, fake rulings, and fake legal reasoning. When the opposing lawyers couldn't find the cases, they informed the judge. The judge was, to put it mildly, not amused.
Schwartz was sanctioned (fined) and his case became international news β not for the legal issue, but as a cautionary tale about AI hallucinations.
This wasn't a bug. It wasn't a glitch. It was a fundamental feature of how large language models work. And understanding it is one of the most important things you can learn about AI.
What Exactly is a Hallucination?
An AI hallucination is when an AI model generates information that is false, fabricated, or nonsensical β but presents it with complete confidence, as if it were true.
The key characteristics:
- Confident delivery. The AI doesn't say "I'm not sure" or "I made this up." It states the false information with the same tone and certainty as true information.
- Plausible sounding. The hallucinated content follows the right patterns β proper grammar, correct format, realistic-sounding names and dates. It looks right.
- Unflagged. The AI gives no warning that this particular piece of information might be unreliable. You get no asterisk, no disclaimer, no "I'm guessing here."
Think of it like a student who doesn't know the answer to an exam question but writes a beautifully structured, articulate, completely wrong response. The handwriting is neat, the grammar is perfect, and the argument flows logically β but the facts are fiction.
Why Do Hallucinations Happen?
To understand hallucinations, you need to understand what LLMs actually do. An LLM like ChatGPT is, at its core, a next-token prediction machine. Given a sequence of words, it predicts the most likely next word. Then the next. Then the next.
Here's the critical insight: predicting the most likely next word is not the same as knowing what's true.
Let's break down the specific reasons:
1. Pattern Matching, Not Knowledge
An LLM doesn't have a database of facts that it looks up. It has learned statistical patterns about which words tend to follow other words. When you ask "What is the capital of France?" it doesn't look up the answer β it recognizes that in its training data, the word "Paris" almost always follows that question pattern.
This works great for well-known facts. But when the patterns are ambiguous, rare, or the question is about something the model hasn't seen much training data for, those statistical patterns can lead to plausible-sounding but wrong answers.
2. No Concept of "I Don't Know"
LLMs are trained to always generate a response. They're completion machines β you give them a prompt, they complete it. They're not naturally trained to say "I don't know" or "I'm not sure about this." The model will always produce the most statistically likely continuation, even when the honest answer would be uncertainty.
3. Training Data Isn't Truth-Labeled
LLMs are trained on massive amounts of internet text β which includes correct information, incorrect information, myths, fiction, jokes, and lies. The model has no reliable way to distinguish between "information that appeared in a textbook" and "information that appeared in a Reddit comment."
4. The Blending Problem
LLMs can mix and match real details into false combinations. It might know that there's a "Journal of Cognitive Science" and a researcher named "Sarah Mitchell" β but combine them into a citation that never existed. Each individual piece might be real; the combination is fabricated.
Demonstrating Why Hallucinations Happen
Famous Hallucination Incidents
The lawyer case isn't an isolated incident. Hallucinations have caused real problems across industries:
- The Lawyer Brief (2023) β Steven Schwartz cited 6 fabricated court cases generated by ChatGPT. He was sanctioned by the court.
- Google Bard's Launch (2023) β During Google's live demo of Bard, the AI incorrectly stated that the James Webb Space Telescope took the first pictures of exoplanets. It didn't. Google's stock dropped $100 billion in a single day.
- Medical Misinformation β Studies have found AI chatbots generating false medical advice, including incorrect drug dosages and fabricated drug interactions. Potentially life-threatening.
- Fake Academic Papers β AI has been caught generating entire fake research papers with fabricated data, fake authors, and made-up experimental results that passed initial peer review.
- News Articles β AI-generated news articles have included fabricated quotes attributed to real people, invented events, and wrong dates for real events.
The Hallucination Spectrum
Not all hallucinations are created equal. They range from minor to catastrophic:
- Minor inaccuracies β Getting a date slightly wrong ("founded in 1997" when it was 1998). Annoying but low-risk.
- Fabricated details β Inventing a specific statistic ("studies show 73% of users prefer..." when no such study exists). Misleading.
- Fabricated sources β Generating fake citations, fake research papers, or fake court cases. Dangerous in professional contexts.
- Contradicting reality β Stating something completely opposite to established fact. Dangerous if the reader trusts the AI.
- Fabricated expertise β Providing detailed medical, legal, or financial advice based on fabricated knowledge. Potentially catastrophic.
How to Spot Hallucinations
Developing a "hallucination radar" is one of the most valuable skills in the AI age. Here's how:
Red Flags to Watch For
- Very specific numbers or statistics β If the AI quotes a precise percentage, dollar amount, or measurement, verify it. Hallucinated stats often sound suspiciously specific ("73.2% of respondents" rather than "about three-quarters").
- Specific citations β Author names, journal titles, case names, dates. These are among the most commonly hallucinated content types.
- Uncommon or niche topics β The less popular a topic is, the less training data exists, and the higher the hallucination risk.
- Biographical details β AI frequently invents or mixes up details about real people, especially those who aren't very famous.
- Recent events β LLMs have training cutoff dates. Anything after that date is either unknown to the model or based on limited data, increasing hallucination risk.
Verification Strategies
- Cross-reference. Never rely on a single AI source. Check the AI's claims against reliable sources β official websites, academic databases, established encyclopedias.
- Ask for sources, then verify them. If the AI cites a study or case, search for it independently. If you can't find it, it might not exist.
- Use grounded AI tools. Tools like NotebookLM are "grounded" in specific sources, dramatically reducing hallucination risk.
- Apply the "too good" test. If the AI's answer perfectly supports your argument with a specific stat or quote, be extra skeptical. Hallucinations often tell you exactly what you want to hear.
- Check the basics. If the AI mentions a person, verify they exist. If it mentions an organization, check it's real. If it quotes a law, look it up.
Why Hallucinations Are So Hard to Fix
If hallucinations are such a big problem, why don't AI companies just fix them? Because it's incredibly difficult. Here's why:
- It's baked into the architecture. LLMs are next-token predictors. They're designed to produce fluent, plausible text β not verified truth. You'd need to fundamentally change how these models work.
- Truth is hard to define. What counts as "true"? Scientific consensus changes. Historical interpretations evolve. Even "facts" can be contested. Building a system that verifies truth is arguably harder than building the language model itself.
- Scale of knowledge. Verifying every claim would require checking against the entire body of human knowledge in real time. That's computationally impractical.
- The long tail. AI companies can reduce hallucinations on common topics, but there's a long tail of obscure, niche, and emerging topics where hallucination remains common.
Current approaches to reducing hallucinations include:
- RLHF (Reinforcement Learning from Human Feedback) β Training models to be more cautious and say "I don't know"
- RAG (Retrieval-Augmented Generation) β Having the model look up information before answering, rather than relying on memory alone
- Grounding β Connecting the model to verified knowledge bases (like NotebookLM does with your documents)
- Chain-of-thought reasoning β Making the model show its reasoning step by step, which can reduce errors
- Confidence calibration β Training models to express uncertainty when they're less sure