Bad AI output means an answer that sounds right but is not: a made-up number, a wrong date, a citation that does not exist. The polished name for this is a hallucination (the model stating something false as if it were fact). Here is the one idea that makes it spottable:
Think of it like a very smooth talker at a party. They are charming and they sound like they know everything, so you take their word for it. Sometimes they are right. Sometimes they confidently invent a fact. Your job is not to be impressed by the delivery. It is to check the parts that can be checked.
This is the single most expensive belief people hold about AI. Let us name it as a myth, then correct it.
The myth: a fluent, specific, confident answer is a reliable answer. The more detail it gives (exact figures, dates, quotes), the more we trust it.
The reality: detail and confidence are the easiest things for a language model to produce. It generates the most plausible-sounding next words. "Plausible" and "true" overlap a lot, but they are not the same thing, and the gaps are exactly where mistakes hide.
A property of the writing.
A property of the world.
Below are real-shaped AI answers, each with subtle inventions hidden inside. Click any underlined phrase that looks off. We will tell you if you caught one, and why it is wrong.
๐ This game runs entirely in your browser. Nothing is typed, sent, or stored anywhere. The "answers" are canned examples, not live AI.
๐ก The pattern: the fabrications are the most specific bits (a stat, a date, a named source). That is the tell. Specificity is where a model is most likely to confidently invent.
You do not need to fact-check every word. You need a quick reflex on the parts that matter. Run these whenever the answer feeds a real decision.
This is the hands-on companion to the deeper safety lesson. For the full picture of guardrails and review, see Day 24: AI safety โ
You know that confidence is a writing style, not proof. You can name the myth and bust it. And you have a five-check reflex for the parts that actually matter: names, numbers, dates, citations, and anything legal or financial.
Spotting bad output by hand works. The stronger move is building AI that has less to fabricate: grounded in your real data, with checks and a human in the loop where it counts. That is what we build, end to end.
Next: the deeper AI safety lesson โ
Day 26 of 30 free, working AI lessons and kits for small business.