
How to completely prevent AI hallucinations in your retail catalog copy by using schema-based prompts.
If you have ever used an AI tool to write product catalog copy and then had to correct half of it because the model invented specifications, dimensions, or features that do not exist, you already understand the hallucination problem. AI language models do not check your inventory system before they write. They generate text that sounds plausible based on the type of product they think they are describing. For a retailer with hundreds of SKUs, this is not an inconvenience — it is a liability. Published catalog copy with incorrect specifications produces returns, complaints, and lost customer trust. For $1, this article shows you how to eliminate AI catalog hallucinations entirely using a schema-based prompt structure.
The solution is not to avoid AI for catalog copy. It is to give the AI exactly what it needs to write accurately: a structured specification sheet that leaves no room for invention. When the model has a complete, well-structured data source to draw from, hallucination becomes structurally impossible — because the only content available to the model is the content you have provided.
Why Hallucinations Happen in Catalog Copy
AI language models hallucinate when they are asked to write about something they do not have complete data on. If you prompt an AI with 'write a product description for a 2.4kg titanium camping stove,' the model will fill in any gaps — fuel type, BTU output, packed dimensions, compatibility — based on what is plausible for a product in that category. It will not tell you it is guessing. It will write confidently and incorrectly.
The fix is not better prompting in the abstract sense. It is providing a data structure so complete that the model has no gaps to fill. The schema-based approach means that every piece of information the model needs to write accurate copy is explicitly provided — and the model is explicitly instructed to use only the provided data.
Building the Product Schema
Create a standardised data template for your product catalog. Every SKU must complete this template before AI copy is generated. The template includes: product name (exact, as sold), SKU or product code, category and subcategory, weight (precise, with unit), dimensions (precise, with unit), materials (exact composition), colour options (exact names), compatibility notes (what it works with and what it does not), key features (maximum five, each in one sentence), what is not included (critical for managing expectations), and warranty or guarantee terms.
This template becomes your source of truth. No field is left blank. If a specification is not available, the field reads 'not specified' — which instructs the AI not to invent the value.
Build the template in a spreadsheet so that you can complete it quickly for each SKU and copy the completed data block into your AI prompt. This is a one-time setup cost that pays back across every product you write copy for going forward.
The Schema-Based Prompt
Your AI prompt should follow this structure exactly: 'Write a product catalog description for the following item. Use ONLY the information provided in the specification below. Do not add features, specifications, or claims that are not explicitly stated. If a specification is marked as not specified, do not include information about that attribute in the copy. Specification: [paste completed schema].'
The phrase 'Use ONLY the information provided' is the most important instruction in the prompt. Add it explicitly. Then add: 'Do not add features, specifications, or claims that are not in the schema.' Repeating the constraint in two different formulations reduces the hallucination risk substantially.
Test the system with three products where you know every specification precisely. Review the output for any invented data. If you find hallucinations, add a third instruction line: 'If you are unsure whether a detail is in the specification sheet, do not include it.'
Quality Control
Even with a well-structured schema and explicit instructions, run every AI-generated product description through a two-minute check against the original specification sheet. This takes less time than reviewing copy written from a vague prompt, because you are comparing against a structured source rather than checking from memory.
Create a checklist: every numerical claim verified, every material claim verified, every compatibility claim verified, all items listed in 'what is not included' are not implied to be included in the body copy. This checklist takes 90 seconds to complete and eliminates the returns and complaints that come from inaccurate catalog copy.
Scaling the System
Once the schema-based prompting process is established for one product category, it scales to every category in your catalog with minimal additional effort. The schema template adapts — different product types need different specification fields — but the underlying principle is identical: complete inputs produce accurate outputs.
For a catalog of 500 or more SKUs, build the schema completion into your product intake process. Every new product entering your catalog completes the schema before any marketing copy is written. This upstream quality control eliminates the downstream correction cycle that schema-free AI copy generation inevitably creates.
Over time, your completed schema library becomes a valuable company asset — a structured record of every product specification that can be used not just for AI copy generation but for training new staff, answering customer queries, managing returns, and populating third-party sales platforms.
The Schema as a Company Asset
Over time, your completed schema library becomes a valuable company asset — a structured record of every product specification that can be used not just for AI copy generation but for training new staff, answering customer queries, managing returns, and populating third-party sales platforms. The schema work you invest in today continues paying dividends in every future use of the product data.
Export the completed schema library to a shared document system that every team member can access. The customer service team who answers specification questions, the logistics team who manages returns, and the marketing team who creates campaign content all benefit from the same structured data. The schema is not a copywriting tool — it is a company knowledge base.
Final Thought
Garbage in, garbage out is the oldest principle in data. The schema-based approach simply applies it to AI copywriting. Give the model complete, structured data and it produces complete, accurate copy. Give it vague prompts and it invents plausible-sounding specifics. The schema is the difference.
—
