Skip to content

Cluster 2E · the crawler layer

Which AI crawlers visit Shopify stores in 2026

Shopify's default robots.txt allows AI crawlers — verbatim from Shopify: 'Your store can be indexed by search engines and large language models (LLMs) without signatures.'1 The risk isn't Shopify's defaults. It's the 2024-era 'block AI bots' templates merchants pasted into robots.txt.liquid that now make their store invisible to ChatGPT, Claude, Perplexity, and Gemini.

Published

Shopify's default robots.txt allows AI

Shopify's Crawling your store help page says it verbatim: 'Your store can be indexed by search engines and large language models (LLMs) without signatures.' That means a fresh Shopify store, with no robots.txt.liquid file added to the theme, is reachable by GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, Claude-User, and Claude-SearchBot. The Web Bot Auth signatures Shopify documents are for first-party audit tools — not gates for the public bots.

The Crawling your store page1 is the canonical Shopify document for this. Shopify Web Bot Auth covers HTTP message signatures for merchant-authorised first-party crawlers (accessibility audits, SEO audits, automated testing) — not the public AI crawlers. Public AI crawlers reach the store the same way any other crawler does: respecting the published robots.txt, following links, fetching pages.

The default state

Allowed

default state for AI crawlers on a fresh Shopify store — per Shopify's Crawling your store doc.

Shopify Help · 2026-05-22
3 months

maximum expiration for Web Bot Auth signatures; non-renewable, for first-party audits only.

Shopify Help · 2026-05-22
0

AI crawlers Shopify blocks by default — the editable file is robots.txt.liquid, and it ships absent.

Shopify Dev · 2026-05-22

The AI crawlers that matter on Shopify in 2026

Eight AI crawlers shape what AI engines see on a Shopify store. OpenAI runs three: GPTBot (training), OAI-SearchBot (citations inside ChatGPT search), and ChatGPT-User (on-demand fetch when a user references the URL). Anthropic runs three: ClaudeBot (training), Claude-User (live retrieval), and Claude-SearchBot (Claude.ai search). Perplexity runs PerplexityBot. Google runs Google-Extended for Gemini training opt-out — distinct from Googlebot and AI Overviews.

The bot-by-bot references live in each platform's own documentation78910. The articles below cover each crawler separately with the robots.txt.liquid rules and the Shopify-specific gotchas. The most important distinction for ranking: training bots (GPTBot, ClaudeBot, Google-Extended) influence the next-version model; retrieval bots (OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, PerplexityBot) influence today's answer.

Editing robots.txt.liquid on Shopify

The editable file is robots.txt.liquid, created in the theme's templates folder via the code editor and served at the store root. Customizations supported: allow/disallow specific URLs, add crawl-delay rules, add sitemap URLs, block specific crawlers. The file is not included in themes by default — merchants add it manually. Changes take effect instantly; crawlers may lag. ThemeKit preserves the file; admin theme uploads do not.

Shopify's edit-robots help page2 carries two warnings worth repeating: "Shopify Support can't help with edits to the robots.txt.liquid file" and "Incorrect use of the feature can result in loss of all traffic." The dev docs3 add the strong recommendation to "use the provided Liquid objects whenever possible" because "the default rules are updated regularly to ensure that SEO best practices are always applied" — pasting a plain-text robots.txt is what disables future updates.

liquid The Shopify-default robots.txt.liquid template body (verbatim from shopify.dev)
{% for group in robots.default_groups %}
  {{- group.user_agent -}}
  {% for rule in group.rules %}
    {{- rule -}}
  {% endfor %}
  {%- if group.sitemap != blank -%}
    {{ group.sitemap }}
  {%- endif -%}
{% endfor %}

The 2024 'block AI bots' trap

In 2023-2024, agency content widely recommended blocking GPTBot, ClaudeBot, and other AI crawlers via robots.txt.liquid to stop 'AI from scraping your store.' Merchants who applied those templates in 2024 are now invisible to ChatGPT, Claude, Perplexity, and Gemini in 2026 — exactly when Shopify Catalog needs those crawlers to see the storefront. Audit your robots.txt.liquid before anything else.

The fix is mechanical: open the theme code editor, find robots.txt.liquid (it lives under templates/robots.txt.liquid), and check for any Disallow rules under GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, Claude-User, or Claude-SearchBot. Remove the rules for the retrieval bots at minimum (OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, PerplexityBot). Training-bot blocks (GPTBot, ClaudeBot, Google-Extended) are a separate, defensible choice — but they should be deliberate, not inherited from a 2024 template.

llms.txt on Shopify

The /llms.txt file is a markdown manifest at the site root that lists canonical pages an AI should read first. Spec by Jeremy Howard, 2024-09-03. Shopify's theme architecture doesn't serve root .txt files natively the way it serves robots.txt.liquid — there's no llms.txt.liquid template type. The install pattern on Shopify is either a metaobject-rendered page plus a URL redirect or a static asset served from the Shopify CDN. The full pattern lives in the llms.txt article.

The spec11 is short — one required H1, optional blockquote, optional H2 file lists. As of 2026-05-22, no major AI engine has publicly committed to reading third-party llms.txt files at answer time, though Anthropic and Perplexity both publish their own llms.txt for developer docs. Shopify's templates documentation12 does not list an llms.txt template type. The llms.txt article walks the install pattern and the honest current-state.

The 6 articles under this cluster

Six articles cover each crawler and the llms.txt question. Four bot-specific articles (GPTBot, ClaudeBot, PerplexityBot, Google-Extended), one llms.txt explainer, and the OAI-SearchBot vs GPTBot vs ChatGPT-User reference. Audit your robots.txt.liquid first, then come back here.

  1. REFERENCE GPTBot Shopify GPTBot on Shopify: what it indexes, how to allow it OpenAI's training crawler. What it sees, where Shopify documents it, and the robots.txt.liquid rules to allow or block it. 8-min read
  2. REFERENCE ClaudeBot Shopify ClaudeBot on Shopify Anthropic's training crawler plus Claude-User and Claude-SearchBot — what each one does and how to control access per bot on Shopify. 8-min read
  3. REFERENCE PerplexityBot Shopify PerplexityBot on Shopify Perplexity's crawler, the secondary Perplexity-User retrieval bot, and the robots.txt.liquid rules that govern both. 7-min read
  4. REFERENCE Google-Extended Shopify Google-Extended on Shopify The Google-Extended user-agent that controls AI-training opt-out for Gemini — distinct from Googlebot, AI Overviews, and AI Mode. 7-min read
  5. HOW-TO Shopify llms.txt Shopify and llms.txt: what's supported, what isn't The /llms.txt spec, Shopify's template system, and the real install pattern when Shopify doesn't natively serve a root .txt file. 10-min read
  6. REFERENCE OAI-SearchBot GPTBot ChatGPT-User OAI-SearchBot vs GPTBot vs ChatGPT-User on Shopify OpenAI runs three bots — training, search, and on-demand fetch. Get the wrong one wrong and you erase yourself from ChatGPT search. 9-min read