What GPTBot is and what it does on Shopify
GPTBot is OpenAI's training crawler. It collects data that may be used to train future versions of OpenAI's models. It is not the bot that powers ChatGPT search citations today — that's OAI-SearchBot. It is not the bot that fetches a URL when a user references it in ChatGPT — that's ChatGPT-User. Allowing GPTBot affects future model training; blocking it does not stop ChatGPT from quoting your store via the other two bots.
Training vs retrieval — the distinction
Training bots like GPTBot, ClaudeBot, and Google-Extended influence the next-generation model. The data they collect feeds the training set. Retrieval bots — OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, PerplexityBot — influence today's AI answers. They fetch live pages during a conversation or build a live search index. For a Shopify merchant, retrieval bot access matters for current AI visibility; training bot access matters for next-year visibility.
Allowing or blocking GPTBot on Shopify
The control is robots.txt.liquid. To allow GPTBot, the file should not contain a Disallow rule under GPTBot's user-agent block. To block GPTBot, add a Disallow rule. To allow GPTBot only on specific paths, add granular Disallow rules. Shopify's editing-robots help page documents the syntax2.
GPTBot audit checklist
Five checks. The decision is more philosophical than technical — do you want OpenAI's next-generation models to know about your store, or not? Most stores want yes; some prestige brands intentionally opt out of training corpus.