Question 1

What is Agent Signal in AEO?

Accepted Answer

Agent Signal is the technical accessibility layer of AEO. It governs whether AI crawlers — the automated systems used by ChatGPT, Perplexity, Google, and others — can reach, read, and index your website. Even perfect content and strong authority signals cannot generate AI citations if the AI's crawler is blocked from accessing your site. Agent Signal ensures the technical infrastructure supports everything else you do for AEO.

Question 2

How does blocking AI crawlers affect your visibility?

Accepted Answer

Blocking AI crawlers in your robots.txt file prevents those AI systems from reading your content, which means they cannot cite or recommend you based on your own site. If you block GPTBot (OpenAI), ClaudeBot (Anthropic), or PerplexityBot, those platforms will have no direct access to your content and will rely on older cached data or third-party mentions instead. This directly reduces your citation potential on those platforms.

Question 3

What is llms.txt and why does it matter for AEO?

Accepted Answer

llms.txt is a plain-text file placed at the root of your website (e.g. yourdomain.com/llms.txt) that gives AI systems a curated guide to your site's most important content. Think of it as a sitemap designed specifically for AI. It tells AI models which pages to prioritise, how to interpret your content, and what your brand does — in plain language optimised for machine reading rather than human browsing.

Question 4

How do I verify which AI crawlers can access my site?

Accepted Answer

Check your robots.txt file at yourdomain.com/robots.txt and look for user-agent rules that reference known AI crawler names: GPTBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot, GoogleOther (Google AI), and FacebookBot (Meta AI). If any of these have a Disallow: / rule, they are blocked. You can also use a GEO audit tool to run a systematic crawler access check across all major AI platforms.

Question 5

What is WebMCP and how does it relate to Agent Signal?

Accepted Answer

WebMCP (Web Model Context Protocol) is an emerging standard that allows AI agents to interact with websites in a structured way — reading content, navigating pages, and taking actions. Implementing WebMCP support signals to advanced AI systems that your site is agent-ready. While still an early-stage protocol, it is the forward-looking edge of Agent Signal and positions your site for the next generation of AI-web interaction.

What goes wrong	What happens
Site blocks AI crawlers in robots.txt	AI can't access your content at all
Content renders via client-side JavaScript	AI crawlers see an empty page
No structured data (schema markup)	AI has to guess what your content means
Poor site speed or broken architecture	AI crawlers deprioritise your pages

Crawler	Company	Used By
GPTBot	OpenAI	ChatGPT, GPT models
Google-Extended	Google	Gemini, Google AI Overview
PerplexityBot	Perplexity AI	Perplexity search
ClaudeBot	Anthropic	Claude
CCBot	Common Crawl	Many AI training pipelines

Schema Type	Where to Use	Why It Matters for AEO
Organization	Homepage	Helps AI build an entity for your brand
Person	Founder / key team pages	Links individuals to the organisation
Article / BlogPosting	Content pages	Includes author, dates, headline for attribution
Service	Service pages	Describes offerings, pricing, service area
FAQPage	FAQ sections	Maps Q&A pairs for direct AI extraction

Agent Signal: Making Your Site AI-Accessible

A site with perfect content but poor Agent Signal is like a library with great books but locked doors.

The 3 Components of Agent Signal

AI Crawler Permissions

Structured Data (Schema Markup)

Emerging Protocols (llms.txt and WebMCP)

Example of a Brand

Uncertain where your brand signal is? Get a complimentary 7-day snapshot audit

How to define your Agent Signal

1. AI Crawler Permissions

2. Structured Data

5. Emerging Protocols

Explore the other signals

Brand Signal

Experience Signal

Content Signal

Search Signal

Social Signal

Authority Signal

FAQ

Not sure how AI sees your brand? Find out in 7 days.