Shopify's default robots.txt allows AI crawlers — verbatim from Shopify: 'Your store can be indexed by search engines and large language models (LLMs) without signatures.'1 The risk isn't Shopify's defaults. It's the 2024-era 'block AI bots' templates merchants pasted into robots.txt.liquid that now make their store invisible to ChatGPT, Claude, Perplexity, and Gemini.
Published
§01Defaults
Shopify's default robots.txt allows AI
Shopify's Crawling your store help page says it verbatim: 'Your store can be indexed by search engines and large language models (LLMs) without signatures.' That means a fresh Shopify store, with no robots.txt.liquid file added to the theme, is reachable by GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, Claude-User, and Claude-SearchBot. The Web Bot Auth signatures Shopify documents are for first-party audit tools — not gates for the public bots.
The Crawling your store page1 is the canonical Shopify document for this. Shopify Web Bot Auth covers HTTP message signatures for merchant-authorised first-party crawlers (accessibility audits, SEO audits, automated testing) — not the public AI crawlers. Public AI crawlers reach the store the same way any other crawler does: respecting the published robots.txt, following links, fetching pages.
The default state
Allowed
default state for AI crawlers on a fresh Shopify store — per Shopify's Crawling your store doc.
Eight AI crawlers shape what AI engines see on a Shopify store. OpenAI runs three: GPTBot (training), OAI-SearchBot (citations inside ChatGPT search), and ChatGPT-User (on-demand fetch when a user references the URL). Anthropic runs three: ClaudeBot (training), Claude-User (live retrieval), and Claude-SearchBot (Claude.ai search). Perplexity runs PerplexityBot. Google runs Google-Extended for Gemini training opt-out — distinct from Googlebot and AI Overviews.
The bot-by-bot references live in each platform's own documentation78910. The articles below cover each crawler separately with the robots.txt.liquid rules and the Shopify-specific gotchas. The most important distinction for ranking: training bots (GPTBot, ClaudeBot, Google-Extended) influence the next-version model; retrieval bots (OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, PerplexityBot) influence today's answer.
§03Editing
Editing robots.txt.liquid on Shopify
The editable file is robots.txt.liquid, created in the theme's templates folder via the code editor and served at the store root. Customizations supported: allow/disallow specific URLs, add crawl-delay rules, add sitemap URLs, block specific crawlers. The file is not included in themes by default — merchants add it manually. Changes take effect instantly; crawlers may lag. ThemeKit preserves the file; admin theme uploads do not.
Shopify's edit-robots help page2 carries two warnings worth repeating: "Shopify Support can't help with edits to the robots.txt.liquid file" and "Incorrect use of the feature can result in loss of all traffic." The dev docs3 add the strong recommendation to "use the provided Liquid objects whenever possible" because "the default rules are updated regularly to ensure that SEO best practices are always applied" — pasting a plain-text robots.txt is what disables future updates.
liquidThe Shopify-default robots.txt.liquid template body (verbatim from shopify.dev)
{% for group in robots.default_groups %}
{{- group.user_agent -}}
{% for rule in group.rules %}
{{- rule -}}
{% endfor %}
{%- if group.sitemap != blank -%}
{{ group.sitemap }}
{%- endif -%}
{% endfor %}
§04The trap
The 2024 'block AI bots' trap
In 2023-2024, agency content widely recommended blocking GPTBot, ClaudeBot, and other AI crawlers via robots.txt.liquid to stop 'AI from scraping your store.' Merchants who applied those templates in 2024 are now invisible to ChatGPT, Claude, Perplexity, and Gemini in 2026 — exactly when Shopify Catalog needs those crawlers to see the storefront. Audit your robots.txt.liquid before anything else.
The fix is mechanical: open the theme code editor, find robots.txt.liquid (it lives under templates/robots.txt.liquid), and check for any Disallow rules under GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, Claude-User, or Claude-SearchBot. Remove the rules for the retrieval bots at minimum (OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, PerplexityBot). Training-bot blocks (GPTBot, ClaudeBot, Google-Extended) are a separate, defensible choice — but they should be deliberate, not inherited from a 2024 template.
§05llms.txt
llms.txt on Shopify
The /llms.txt file is a markdown manifest at the site root that lists canonical pages an AI should read first. Spec by Jeremy Howard, 2024-09-03. Shopify's theme architecture doesn't serve root .txt files natively the way it serves robots.txt.liquid — there's no llms.txt.liquid template type. The install pattern on Shopify is either a metaobject-rendered page plus a URL redirect or a static asset served from the Shopify CDN. The full pattern lives in the llms.txt article.
The spec11 is short — one required H1, optional blockquote, optional H2 file lists. As of 2026-05-22, no major AI engine has publicly committed to reading third-party llms.txt files at answer time, though Anthropic and Perplexity both publish their own llms.txt for developer docs. Shopify's templates documentation12 does not list an llms.txt template type. The llms.txt article walks the install pattern and the honest current-state.
§06Leaves
The 6 articles under this cluster
Six articles cover each crawler and the llms.txt question. Four bot-specific articles (GPTBot, ClaudeBot, PerplexityBot, Google-Extended), one llms.txt explainer, and the OAI-SearchBot vs GPTBot vs ChatGPT-User reference. Audit your robots.txt.liquid first, then come back here.