The default: AI bots crawl unless you block them
Shopify's default robots.txt does not block AI training bots. GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and similar user-agents are allowed by default the same way bingbot and Googlebot are. The platform's stance is opt-out, not opt-in. To block any AI bot, the merchant must create robots.txt.liquid and add an explicit per-user-agent Disallow rule. This is consistent with Shopify's broader 'platform handles the safe defaults, merchant handles the strategic decisions' philosophy.
The four AI crawler user-agents that matter in 2026
Four user-agents cover ~90% of AI crawler traffic on Shopify stores. (1) GPTBot — OpenAI's training crawler. Distinct from OAI-SearchBot (which powers ChatGPT search) and ChatGPT-User (the on-demand fetcher when a user clicks a citation). Blocking GPTBot does not block ChatGPT's ability to cite your store via OAI-SearchBot. (2) ClaudeBot — Anthropic's web crawler used for training. Honours robots.txt. (3) PerplexityBot — Perplexity's citation crawler. Used to discover and index pages cited in Perplexity's answers. (4) Google-Extended — Google's user-agent string for opting out of AI training (Bard, Vertex AI) without affecting Search indexing. This is the cleanest 'have it both ways' option in the matrix.
The decision matrix — block, allow, or selective
Three positions cover most stores. (1) Allow all — the default. Best for stores chasing AI shopping visibility (most Shopify stores in 2026). Catalog eligibility plus AI bot access maximizes the chance products surface in ChatGPT, Perplexity, Gemini, and Copilot. (2) Block training, allow shopping — block GPTBot, ClaudeBot, Google-Extended (training-focused user-agents) while leaving OAI-SearchBot, Googlebot, and PerplexityBot (citation-focused) allowed. Useful for brands sensitive to AI training but still wanting AI shopping visibility. (3) Block all — block every named AI user-agent. Almost never the right call on an ecommerce Shopify store; sometimes justified on editorial brands with proprietary content.
How to add per-bot rules in robots.txt.liquid
Add per-bot Disallow rules below Shopify's default-groups loop in robots.txt.liquid. Never replace the default loop — that disconnects the store from Shopify's automatic default-rule updates. The pattern: keep the default loop intact, then append User-agent / Disallow blocks for each bot you want to block. Validate the rendered /robots.txt in a private browser window before pushing to live.
The AI shopping cost of blocking
Blocking AI bots has a real, asymmetric cost in 2026. Shopify's Catalog feed reaches AI channels via two paths: (1) direct API integration (ChatGPT shopping, Perplexity shopping, Gemini, Copilot, Shop) and (2) the AI bots' independent crawl of your storefront for context enrichment. Blocking the bots breaks path (2) but not path (1) — your products still appear in AI shopping recommendations if eligible, but the AI engines have less context (no policy data, no editorial copy, no FAQ pages) to reason about them. The result: fewer citations, weaker product summaries, and reduced placement in conversational shopping flows.