HomeBlog › Shopify metafields for AI shopping agents

Shopify metafields for AI shopping agents: which 8 actually move the score

Most Shopify stores have hundreds of metafields. AI shopping agents read maybe eight of them. This is the practical guide — which namespaces ChatGPT Shopping, Perplexity Shopping, and Google AI Mode actually parse, the eight metafields that move the CatalogScan score, what we keep finding broken in the wild, and how to fix coverage in an afternoon without a developer.

Published 2026-04-30 · ~10 min read · By the CatalogScan team

TL;DR: Two namespaces matter — Shopify's Standard Product Taxonomy (shopify.*) and the legacy Google Shopping namespace (mm-google-shopping.*). Eight metafields between them carry almost all the weight: product category, brand, GTIN, MPN, color, size, material, and condition. Custom metafields you invented are invisible to AI agents until you wire them into Product JSON-LD yourself.

What a metafield actually is

A Shopify metafield is a typed key-value pair attached to a Shopify object — a product, variant, customer, order, collection, or any other resource the API exposes. Each one is namespaced: a dotted prefix that tells Shopify which app, integration, or owner controls the field.

You'll see four categories of metafield in any working Shopify store:

This article only cares about the first two. If a metafield isn't in the shopify.* or mm-google-shopping.* namespace, it's not moving the AI-shopping needle without a custom integration.

The only two namespaces AI agents read

There are exactly two reasons a metafield ends up in front of ChatGPT Shopping, Perplexity Shopping, or Google AI Mode:

1. Shopify Global Catalog ingestion

Since Agentic Storefronts went on by default on March 24, 2026, Shopify's Global Catalog is the federated product index that ChatGPT Shopping queries first. The Global Catalog is built from specific fields on each product — title, description, price, availability, and a fixed list of taxonomy attributes pulled from the shopify.* namespace. Custom metafields are not ingested. The agent never sees them.

2. Google Merchant Center feed

Stores that have the Google & YouTube channel installed get an auto-generated Merchant Center feed. Google's required and recommended attributes (gtin, mpn, brand, google_product_category, condition, gender, age_group, color, size, material) come from a mix of native Shopify fields and the mm-google-shopping.* metafields the channel writes. That feed flows directly into Google Shopping, into Google AI Mode shopping recommendations, and indirectly into ChatGPT Shopping when ChatGPT cross-references Google's data layer.

If you want a metafield to influence which agent picks your variant, it has to live in one of those two namespaces — or you need to manually emit it into Product JSON-LD on the page (covered in the custom metafields section below).

Heads up: Shopify is deprecating the mm-google-shopping namespace in favor of the standardized shopify.* taxonomy fields. New stores added in 2026 are getting the shopify.* path by default and the Google channel is starting to read from there. If you have both populated, that's fine — the Global Catalog and the Google feed both reconcile cleanly. If you have only one, prefer shopify.* going forward.

The eight metafields that move the score

Out of dozens of standardized metafields across the two AI-relevant namespaces, eight carry almost all the weight in our scoring model. Each one maps to a real failure mode we see in the scan data.

#MetafieldWhat it doesScore impact
1 shopify.product-category
(or mm-google-shopping.google_product_category)
Maps your product to the Google Product Taxonomy (5,000+ categories). Decides which agent queries you're eligible to answer at all. +9 pts
2 Variant barcode field
(GTIN; not technically a metafield, often confused for one)
Cross-retailer identity match for "where to buy X" queries. Covered in detail in the GTIN guide. +10 pts
3 mm-google-shopping.mpn Manufacturer Part Number. Fallback identity for products without a GTIN. Brand + MPN can uniquely identify a SKU when GTIN is missing. +5 pts
4 Brand (Vendor field, ideally exposed as a nested {"@type":"Brand"} in Product JSON-LD) Tells the agent which store is the "official" seller of a branded product vs. a marketplace reseller. See signal 8 in the signals reference. +8 pts
5 shopify.color-pattern Standardized color from Shopify's taxonomy (90+ values). Used by agents for "show me a navy blue X" filter queries. +3 pts
6 shopify.size Standardized size value (e.g. EU 42, Medium, King). Used for agent fit/availability queries. +3 pts
7 shopify.material or shopify.fabric Composition. Drives "show me 100% cotton X" or "leather X" queries. Highest-leverage for apparel, furniture, and bag categories. +4 pts
8 mm-google-shopping.condition One of new, refurbished, used. Defaults to new if blank. Critical for refurbished/secondhand/refilled stores; ignoring this gets refurbished products dropped from "new" feeds. +2 pts

The first four (category, GTIN, MPN, brand) decide whether the agent can identify your product at all. The last four (color, size, material, condition) decide whether you're eligible for filter and refinement queries — the long tail of "show me a black leather messenger bag in size large" intents that AI agents excel at because they can decompose a sentence into structured filters in a way Google Shopping never could.

For a deeper breakdown of how each of these signals is weighted alongside the other ranking-spread fields, see the 15-signal reference page.

What we found scanning 100 stores

Here's the metafield slice from our scan of the top 100 DTC Shopify stores in April 2026. The full leaderboard is at /scans/top-100/.

28%
had a Google Product Category set on every product
63%
emit ProductGroup JSON-LD with the wrong shape
11%
populate shopify.color-pattern on color variants
17 pts
average gap, full taxonomy vs. zero coverage

The 28% with full category coverage were almost all stores running the Google & YouTube channel app — it sets mm-google-shopping.google_product_category as a side effect of the channel onboarding. Stores not on Google Shopping had a 4% category-coverage rate. That's the fastest-to-fix gap in the whole dataset: install the Google channel, accept the auto-categorizations, immediately move from 4% to ~85%, then hand-correct the ~15% the auto-classifier got wrong. About an hour of work for a 9-point lift on the CatalogScan score.

The 11% color-pattern figure was the most surprising. Most stores have color variants, and most stores set the option name to "Color" — but that's the variant option, not the standardized metafield. The Shopify admin asks you to fill in shopify.color-pattern as a separate step under "Product attributes," and almost no one does. Same gap exists for size: option says "Medium," but shopify.size is empty. Filter queries collapse on stores with this gap.

The 17-point score gap between "full taxonomy" and "zero coverage" was second only to GTIN coverage in single-signal score variance. If a store is in the 60-80 score band, fixing taxonomy coverage is the second-highest-leverage move after fixing GTIN coverage.

How to audit your metafields in five minutes

Three increasingly thorough ways to check.

Free path (90 seconds)

Run the free CatalogScan on your domain. Signals 5 (Google Product Category), 6 (GTIN), 8 (Brand entity), and 11 (variant attributes) all surface metafield gaps directly. The report shows percentage coverage per signal and ranks the highest-leverage fix first.

DIY path (Shopify admin)

Open any product in Shopify Admin. Scroll past the title, description, and media to the Product attributes section. You'll see a card with the standard taxonomy metafields for that product's category — color, size, material, age group, gender. Empty fields are unset.

To check coverage at scale: Settings → Custom data → Products. The page lists every metafield definition on your store. The standard ones from the shopify.* namespace are pinned at the top. Each row shows "Used in N products" — coverage in absolute terms, not percentage, but the math is easy.

API path (audit at variant level)

If you have catalog of more than a few hundred products, querying the Storefront API is faster than clicking through admin pages:

{
  products(first: 250) {
    edges {
      node {
        id
        title
        productType
        vendor
        category { id name }
        metafields(identifiers: [
          { namespace: "shopify",            key: "color-pattern" },
          { namespace: "shopify",            key: "size" },
          { namespace: "shopify",            key: "material" },
          { namespace: "mm-google-shopping", key: "google_product_category" },
          { namespace: "mm-google-shopping", key: "mpn" },
          { namespace: "mm-google-shopping", key: "condition" }
        ]) { value namespace key }
      }
    }
  }
}

Page through the catalog, count nulls, and you have a per-metafield coverage report in 30 seconds. This fits a CI job nicely if you want to alert when coverage regresses on a release.

How to fix coverage (in cost order)

FixWhen to useCost
Install the Google & YouTube channel Quickest path to category + condition coverage. The channel auto-classifies every product against the Google taxonomy and writes mm-google-shopping.* metafields as a side effect. Reviewable in admin before the feed goes live. $0
Accept Shopify's category suggestions For stores not on Google Shopping. Shopify admin shows suggested values for every shopify.* metafield based on the product title, description, and option names. Accepting in bulk via the bulk editor takes ~15 minutes per 200 products. $0
CSV bulk import from a spreadsheet Stores with deep variant trees where the bulk editor times out. Export products, fill metafield columns in Excel, reimport. Shopify accepts shopify.color-pattern, shopify.size, etc. as named columns directly. $0 + time
Bulk auto-fill via Shopify Admin API Stores with 1,000+ SKUs where manual review isn't viable. CatalogScan Pro generates and applies suggestions with per-product diff approval, weighted by which metafield gives the largest score lift. See pricing. $49/mo
Don't: bulk-set shopify.color-pattern = "Multi" across the whole catalog to game the coverage number. Agents penalize "Multi" as low-confidence and the standardized color taxonomy already accommodates patterns (Striped, Floral, Geometric). Use a real value or leave it null — partial coverage with high accuracy beats full coverage with junk values.

Six mistakes we keep finding

  1. Variant option name set, standardized metafield empty. Most common failure. The variant has option Color: Navy, but shopify.color-pattern is null. Agents read the metafield, not the option name.
  2. Free-text values instead of taxonomy values. shopify.color-pattern = "Ocean Vibes" instead of Blue. The Standard Product Taxonomy is a closed enum — anything outside the list collapses to "unrecognized" in the Global Catalog.
  3. Old-namespace and new-namespace fighting. Some stores have mm-google-shopping.color = "Navy" and shopify.color-pattern = "Blue" on the same variant. The Global Catalog picks the standardized one; the Google feed picks the legacy one. The two product cards now disagree across surfaces. Pick one canonical source per field and delete the duplicate.
  4. Brand as a string instead of a nested entity. Theme emits "brand": "Allbirds" instead of {"@type": "Brand", "name": "Allbirds"}. Half-credit on the JSON-LD parse. One commit to product.liquid fixes it; same data either way.
  5. Custom metafields the team thinks are "structured data." A merchant writes custom.fit_guide, custom.recyclable, custom.lab_tested and assumes AI agents pick them up. They don't — those are storefront-only unless explicitly emitted into Product JSON-LD via a theme edit.
  6. Setting condition = "new" on refurbished or secondhand products. Often left as the default by accident on a refurb-only store. Google's policy is to disapprove the listing if the actual product condition doesn't match the feed. Disapproval cascades the same way GTIN disapproval does — invisible to agents.

What about custom metafields?

Custom metafields are useful — for storefront features. They're how you build size guides, ingredient lists, certifications, fit charts, materials breakdowns, lab results, and anything else outside the Standard Product Taxonomy. You should still use them.

The catch is that AI agents won't read them unless you do one of two things:

  1. Emit them into Product JSON-LD. Edit your product.liquid (or equivalent on Hydrogen) so the JSON-LD additionalProperty array includes the custom metafield as a PropertyValue object: {"@type": "PropertyValue", "name": "Recyclable", "value": "Yes"}. Agents that parse PDP JSON-LD pick this up.
  2. Map them to a Standard Product Taxonomy attribute. If your custom.fabric field already holds "100% Cotton", just copy that value into shopify.fabric. Same data, now in a namespace agents read.

The first option gives you flexibility — any custom field can be exposed. The second gives you confidence — Standard Taxonomy values are guaranteed-readable by every agent. We recommend the second whenever a standard field exists for what you're describing, and the first only for genuinely store-specific data (your loyalty tier, your warranty terms, your shop policy).

What good coverage looks like

The CatalogScan score gives full credit on the metafield-related signals when:

For a store starting from near-zero coverage, the realistic 30-day target is: install the Google channel (1 hour), accept Shopify's bulk taxonomy suggestions (1 hour per 200 products), audit and patch the auto-classifications (~30 minutes), and write a one-time CSV reimport for color and size on color/size variants. Two focused afternoons, no developer time, no new tooling. We've seen stores move from 41% to 89% taxonomy coverage in two weeks doing exactly this — and lift their CatalogScan score by 12-15 points.

The agentic-storefronts shift means catalog hygiene now compounds the way SEO did fifteen years ago. Stores that fix taxonomy now compound through every new AI shopping query for the rest of their lifetime. Stores that don't, don't. The eight metafields above are most of the lift; the rest is in the agentic storefronts checklist.

Related on CatalogScan

See your store's metafield coverage in 90 seconds

Free scan. No login. Per-signal verdicts and the top-5 fixes ranked by score lift.

Run a free scan See the top-100 leaderboard