Confident Nonsense: Garbage In, Polished Garbage Out

Dec 31

Reality check: AI doesn't correct bad data. It does produce confident output built on whatever you feed it. Because of this, most “AI failures” are actually upstream data governance failures hiding behind polished outputs. Leaders should treat data readiness as a product, with clear owners, thresholds, and controls that prevent confident nonsense from scaling.

Below is a 750-word LinkedIn-newsletter version of Chapter 3, written in your established tone and structure, with a TL;DR and a provocative opening.

TL;DR

AI does not correct bad data. It produces confident output built on whatever you feed it.
Most “AI failures” are actually upstream data governance failures hiding behind polished outputs.
Leaders should treat data readiness as a product, with clear owners, thresholds, and controls that prevent confident nonsense from scaling.

Garbage In, Polished Garbage Out: Why AI Makes Bad Data Look Better Than Ever

If you want to find the fastest way to embarrass an enterprise, don’t deploy a broken AI model. Deploy a perfectly fine model sitting on top of a CRM that hasn’t been trustworthy since your last office holiday party with a budget.

AI doesn’t fix bad data. It beautifies it. It smooths contradictions, fills in gaps with plausible guesses, and presents everything in crisp, well-structured language that looks trustworthy enough to paste into a QBR deck. That’s the problem.

The idea is simple: AI makes bad data look professionally wrong. And when the output looks confident, teams stop questioning the inputs.

Why polished garbage happens

Enterprises overwhelmingly treat data cleanup as a “phase two” activity, which is a polite way of saying “after the funding cycle renews, assuming nothing catches on fire.” Meanwhile, AI projects receive attention, budget, and urgency because they promise visible wins.

The result is predictable: the model appears functional during testing, then fails quietly and consistently in real operations.

Three forces drive the problem.

Organizational incentives
Data quality is rarely owned properly. No single team is accountable for resolving duplicates, normalizing records, or enforcing consent logic at ingestion. AI teams inherit this mess and are expected to produce accuracy on top of it.

Technical smoothing
LLMs and decisioning systems are exceptional at making incomplete or contradictory data look coherent. They are built to produce plausible output, not truth. The more gaps they fill, the more credible the system looks… right up until customers receive contradictory messages.

Human shortcuts
Once stakeholders see a few polished, AI-generated examples, they assume the entire pipeline is fine. Pilots create a false sense of reliability that does not survive activation across regions, products, or messy real-world records.

How to spot the issue before it becomes expensive

You can usually detect “polished garbage” before customers experience it. You just need to look in the right places.

Record-level contradictions: Sales says the account churned; the model shows “high expansion likelihood.”
Data that magically becomes complete at inference time: Consent fields, enrichment, or product data appear “fixed” only when the model runs.
Metrics that look good in pockets but collapse at scale: A few clean segments perform well, while the broader population produces uneven or erratic outcomes.
Steering-committee optimism: Phrases like “we’ll fix the data later” or “the model can learn around gaps” are bright, blinking indicators that you’re scaling confidence rather than correctness.

What to do instead

The solution is less glamorous than a new model and far more effective: treat data readiness like a product.

Start by defining the AI-grade dataset required for the specific use case. Not a master data vision. Not a multi-year program. Just the minimum viable set of fields and rules needed to make automated decisions safe and consistent.

This typically includes:

Identity keys and account hierarchy
Consent and preferences
Entitlements or eligibility
Contactability and channel data
A small, trustworthy set of behavioral signals

Then assign real ownership. A data product owner manages the backlog, SLAs, and improvements. Platform owners handle schema and integration reliability. Governance leads define policy. Model owners evaluate and monitor outcomes. Frontline SMEs provide the operational reality check.

These roles prevent the classic pattern where everyone assumes someone else is responsible.

Next, introduce quality gates before anything gets automated. Examples include:

Acceptable duplicate rate in the target segment
Identity match confidence thresholds
Consent completeness thresholds tied to regulatory requirements

If those gates aren’t met, the system does not act. This is not bureaucracy. It is insurance against customer complaints, compliance issues, and misdirected spend.

Finally, enforce safe defaults. Unknown consent suppresses. Low-confidence identity goes to review. Missing entitlements block eligibility-dependent offers. The goal is not perfection. It is predictable behavior.

A quick before-and-after snapshot

Here’s what the shift looks like when done well:

Before: AI “cleans” data at inference time, so outcomes shift unpredictably.
After: Data is normalized upstream, governed, logged, and repeatable.
Before: Consent is stored somewhere and “assumed correct.”
After: Consent is enforced at decision time, versioned, and auditable.
Before: Drift means “retrain the model.”
After: Drift is monitored across both data and outcomes, triggering upstream reviews.
Before: Success equals “more automation.”
After: Success connects data quality to business metrics like complaint rate, conversion quality, and retention.

What this means for leaders

AI will amplify whatever foundation you provide. If the data is contradictory, incomplete, or poorly governed, the system will still produce fluent outputs that look right until they’re wrong in front of a customer.

Leaders should expect three things from any AI program:

A clear definition of the AI-grade dataset.
Quality gates with thresholds that prevent unsafe automation.
Transparent measurement tying data quality to business outcomes.

If you focus here, the AI becomes stable and predictable. If you skip this and scale quickly, you get polished garbage at enterprise volume. And that is much harder to unwind than fixing the plumbing before the water starts flowing.

Greg Kihlstrom