Feed Optimization
Shopify Product Feed Optimizer: Fix What AI Agents Actually Check
Most "product feed optimizer" tools were built for Google Shopping XML feeds. AI shopping agents read your store differently — and they're checking signals those tools never touch.
The two-era problem in feed optimization
Feed optimization tools were built in the era of Google Shopping XML feeds. Their job: make sure your g:id, g:price, g:availability, and g:condition are correct so your products appear in the Shopping tab.
That still matters. But since March 2026, there's a second era: AI shopping agents that browse your store directly via /products.json, crawl your PDPs for structured data, and use LLM-based reasoning to decide whether your catalog is worth recommending. They're not reading your Merchant Center feed. They're reading your store.
The optimization problem is now two-dimensional: traditional feed compliance for paid channels, plus AI-agent readiness for the new organic channel. Most tools only solve dimension one.
What traditional feed optimizer tools cover
Tools like DataFeedWatch, GoDataFeed, Feedonomics, and Shopify's own Google & YouTube app optimize the XML/CSV feed that gets submitted to Merchant Center. Their audit checklist typically covers:
| Field | Google Shopping | AI Agents |
|---|---|---|
| Product title length + keyword | Critical | Medium |
| Price & availability accuracy | Critical | Medium |
| Product images (count + quality) | Critical | Medium |
| GTIN / MPN / brand | Critical | Critical |
| Google product category | Critical | Low |
| Description richness (200+ words) | Low | Critical |
| AggregateRating JSON-LD | Low | Critical |
| ProductGroup / variant JSON-LD | Low | Critical |
| Shopify AI Shopping category | Low | Critical |
| robots.txt / bot access gates | N/A | Critical |
The overlap is thin: only GTINs and product images are genuinely shared priorities. Everything that makes AI agents recommend you — descriptions, structured data, review signals, category taxonomy — is outside the scope of traditional feed optimization.
The 7 things an AI-era feed optimizer needs to check
-
1
robots.txt access gates
If
User-agent: GPTBotorUser-agent: PerplexityBotis blocked inrobots.txt, no amount of feed optimization matters — the agent can't see your store. This is the most common and most silent failure mode. Check it first. -
2
/products.json accessibility and completeness
Your
/products.jsonendpoint is what most AI agents use for bulk catalog discovery. It needs to be publicly accessible (no password protection), paginated correctly (250 products per page max), and include variant data. How to test your feed URL → -
3
GTIN coverage across variants
GTINs (barcodes) are the product identifier that lets AI agents do price comparison, exact variant matching, and cross-retailer consolidation. Missing GTINs on a variant means that variant is effectively invisible to AI-powered price comparison. Coverage needs to be per-variant, not per-product. GTIN guide for Shopify →
-
4
Description richness and specificity
ChatGPT Shopping and Perplexity Commerce use your product description as citation material when answering a user's query. Thin descriptions (under 100 words, or just a size chart) score poorly. Rich descriptions covering use case, materials, fit, and features score well. This is the highest-leverage signal most stores get wrong.
-
5
AggregateRating JSON-LD on PDPs
Perplexity Commerce treats review count and average rating as a direct recommendation confidence signal. ChatGPT Shopping uses it as a secondary quality indicator. You likely have reviews in your app — but if they're not in JSON-LD on the product page, agents can't read them.
-
6
ProductGroup JSON-LD for variant products
Shopify products with multiple variants (size, color, material) need
ProductGroupstructured data that links parent product to individualProductitems with their own GTINs. Without this, AI agents see one product instead of a full variant matrix — and can't answer "do you have this in medium?" -
7
Shopify AI Shopping category taxonomy
Shopify's Agentic Storefronts (enabled by default since March 2026) use its own AI Shopping category taxonomy. If your products are categorized using old Shopify types or custom tags instead of the standardized taxonomy, they don't surface in the Shopify Global Catalog used by AI agents browsing via the Storefront API.
The optimization sequence: what to fix first
If you run an audit and find issues across all 7 areas, prioritize in this order — each fix unlocks the next:
- robots.txt — 30 minutes; immediate unlock for all AI agent crawling
- /products.json access — 15 minutes; confirms the bulk catalog endpoint works
- AggregateRating JSON-LD — 2–4 hours with a review app theme integration; high leverage for Perplexity Commerce
- ProductGroup JSON-LD + GTIN — 1–2 days for stores with complex variant catalogs; required for exact-match variant lookup
- Description rewrites — highest effort, highest long-term payoff; prioritize top-20 best-selling SKUs first
- Shopify AI Shopping categories — Shopify admin bulk-edit or metafield update; affects Shopify Agentic Storefronts specifically
- FAQ schema on PDPs — adds citation surface for Perplexity; good for informational queries tied to your product type
How CatalogScan runs this audit automatically
CatalogScan is an AI-era feed optimizer: it scans your Shopify store across all 18 AI-readiness signals and returns a 0–100 score with the top-5 highest-leverage fixes. It reads your store directly — /products.json, your PDPs, your robots.txt, and your structured data — the same way AI shopping agents do.
The free scan takes 2 minutes and requires no app install. It checks:
- robots.txt access gates for all major AI crawlers (GPTBot, PerplexityBot, OAI-SearchBot, Googlebot)
- /products.json availability, pagination, and variant field coverage
- GTIN coverage across product variants
- Structured data presence and validity (Product, ProductGroup, AggregateRating, FAQPage)
- Description richness metrics (word count, specificity signals)
- AI Shopping category assignment
- Review signal accessibility
Pro tier adds bulk auto-fix: metafield fills, description rewrites via Claude, and GTIN enrichment from GS1 lookups — the same 7 categories covered above, automated.
FAQ
Do I still need to optimize my Google Shopping feed separately?
Yes. Google Shopping (paid) still uses Merchant Center and your XML feed submission. AI search optimization (Google AI Mode, ChatGPT Shopping, Perplexity) is a separate channel that reads your live store directly. You need both — they have limited overlap except for GTINs and basic product data accuracy.
Can I use an existing feed optimizer app and CatalogScan together?
Yes — they're complementary, not competing. A feed optimizer app (DataFeedWatch, etc.) handles your paid shopping channel. CatalogScan handles the AI-discovery channel. Fixing one doesn't fix the other.
What's the biggest mistake stores make when trying to optimize for AI agents?
Blocking crawlers in robots.txt — usually accidentally, from a "Block all bots" theme setting or a security app that flags AI crawlers as scrapers. It's the most silent failure mode: your store looks fine, your feed looks fine, but ChatGPT and Perplexity have never seen a single product page.
How often do I need to re-optimize?
Description quality and structured data are durable — once fixed, they stay fixed. But AI agent ranking signals evolve fast (ChatGPT and Perplexity both updated their shopping algorithms twice in 2025). Running a rescan every 90 days catches regressions before they cost you traffic.
Is Shopify's built-in Google Sales Channel sufficient?
For Google Shopping paid ads: yes, it handles the Merchant Center sync. For AI agents: no — it doesn't audit structured data, description richness, GTIN coverage per variant, or review signal accessibility, which are the signals that determine AI shopping visibility.
Run a free AI-readiness audit on your Shopify store
2-minute scan across 18 signals. No app install required.
Scan your store free →