[image: AnswerShare — We Speak AI]

ProofHow It WorksGEO OverviewResearchAboutTest Your Domainhello@answershare.aiGEO Methodology

Before AI Understands Your SiteIt Must Translate It FirstWE SPEAK AI™

Overview

How'd we do it?
Well, We Asked.
...100,000 times.

Along the way, we found 12 signals that affect AI citability that we can measure. Once measured, we found that we could tune the levers to maximize AI ingestion, reasoning and trust.

Architecture

AnswerShare reroutes bot traffic at the edge

[image: Diagram showing AnswerShare™ detecting AI/bot traffic at the edge and rerouting it to optimized JSON-LD content (useless characters stripped, dataset-optimized, recordings transcribed, Query Fan-Out survives, ultra-fast delivery, source-grounded, auto-updated by SOT) while human traffic passes through to the production site unchanged. Production site remains source of truth — no CMS change, no workflow change, no security impact, ~30-minute implementation.]

Family 1 -- AI Opinion

AI Opinion -- three NPS-family metrics.

What AI systems actually think of the brand, compared to what the human community says. Direct measurement on both sides.

ΔNPS™ (delta-NPS) is the comparison: positive means AI rates the brand better than the human community does; negative means worse.

μNPS™Human Community Opinion

Measures the human community's opinion of the brand online -- the corpus baseline AI systems are trained against. Direct measurement.

λNPS™AI Opinion

Measures the AI's opinion of the brand in live retrieval. Direct measurement against the live frontier models.

ΔNPS™The Difference

ΔNPS = λNPS − μNPS. Positive = AI rates the brand better than the human community does. Negative = AI rates the brand worse than the human community does.

Family 2 -- Infrastructure Readiness

Infrastructure Readiness -- four metrics, as measured through the cache.

What AI bots actually receive when they hit the property -- composite infrastructure health, edge-cache-served latency on both ends of the response, and survivability across query fan-out.

Infra Score -- Composite of 8 binary signals(Pass / Partial / Fail)

Composite rollup of 8 underlying binary AI-bot-accessibility signals. Pass = 8/8 passing. Partial = 2-7/8 passing. Fail = 0-1/8 passing. The eight underlying signals are itemized below.

TTFB -- Time To First Byte(Pass <= 200ms)

Edge-cache-served time to first byte from a clean IP. Pass if <= 200ms. AI crawlers operate on finite session budgets -- slow first bytes shrink how many pages get reached per visit.

TTLB -- Time To Last Byte(Pass <= 500ms)

Edge-cache-served time to last byte from a clean IP. Pass if <= 500ms. TTLB matters more than TTFB for AI crawlers because they consume the fully-rendered payload, not just the initial response.

QFS™ -- Query Fan-Out Survivability(Pass >= 50%, Fail < 15%)

When an AI arrives at your site, it doesn't just use one prompt — it also fires off 5–10 sub-queries on the same topic. For example: you ask, "Recommend an SEO agency in my town." The AI lands on your site, but it also runs prompts like "Recommend an SEO agency in [state]," "in adjoining cities," and "best SEO agency for [profession]" — and so on. Your QFS score is the percentage of those sub-queries that your site answers.

Supporting detail

The 8 binary signals inside Infra Score

Each is a Pass/Fail individually but rolls up into the Infra Score above. AI crawlers check for these signals on first contact -- every one is independently verifiable from public endpoints.

1robots_ai_bots_allowed

robots.txt explicitly allows the AI crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, etc.). Default-blocked sites are invisible to AI by construction.

2llms_txt_present

/llms.txt exists and lists the canonical pages an AI should ingest. The AI equivalent of a sitemap-for-attention.

3llms_full_txt_present

/llms-full.txt exists with full-text dumps of those canonical pages. Lets AI ingest authoritative content in one hop instead of crawling the whole site.

4sitemap_fresh

sitemap.xml resolves with <lastmod> timestamps inside the last 60 days. Stale sitemaps tell AI the site isn't maintained -- it deprioritizes citation.

5jsonld_structured_data

Schema.org JSON-LD is present and valid on every page. AI uses structured data to resolve entities (Person, Organization, Place) without natural-language inference.

6prerendered_html

The page renders meaningful content (>= 1500 chars) in the initial HTML, not via client-side JavaScript. Most AI crawlers don't execute JS; SPA shells without prerender are invisible.

7mcp_endpoint_live

/.well-known/mcp.json resolves and the declared MCP endpoints respond. The Model Context Protocol surface lets AI clients query data live during inference.

8ai_content_feed

/ai-content-index.json exists and enumerates the machine-fluent payloads (artifact protocol). Gives AI a one-request manifest of everything you want it to ingest.

Family 3 -- Reasoning Keys

Reasoning Keys -- five continuous metrics.

How cleanly the content parses to AI once Infrastructure Readiness is in place. Each metric has a defined threshold -- below threshold means AI is actively penalizing the site in retrieval. All five are direct measurement.

RR -- Relevance Ratio(Pass >= 0.45)

How closely the bot-served HTML matches the human-served HTML. When the two diverge, AI is reading less than half the content. Low RR is the silent killer of citation eligibility.

SGR -- Source Grounding Ratio(Pass >= 0.25)

Live retrieval and RAG require that assertions be grounded to a reputable 3rd party. Without these, the AI considers it “unverified” and deprecates it as “Marketing Fluff.”

RTC -- Retrieval Token Cost(Pass <= 1.000)

Ratio of page-chrome/overhead tokens to useful-content tokens. Higher RTC means AI burns its token budget on navigation, scripts, and ads instead of reasoning over your content -- making citation less likely.

RPC™ -- Retrieval Pages Crawled(absolute count; higher is better)

RPC™ (Retrieval Pages Crawled) measures how much of a site an AI system can realistically traverse during a single answer-generation cycle. Modern AI systems commonly fan out into multiple retrieval probes while composing an answer. Each probe operates under bounded crawl time, limited concurrency, and probabilistic retrieval budgets. RPC estimates how many pages the system can successfully fetch, parse, and evaluate before generation completes, given the site's response speed, error rate, crawlability, and retrieval structure. Higher RPC means more of the site's content reaches the AI citation and reasoning pool. Lower RPC means the AI forms opinions from a shallow subset of the available corpus. RPC replaces traditional requests-per-second metrics, which measured server throughput rather than practical AI retrieval reach.

Per-visit crawl capacity for AI bots -- a concrete page count, comparable across sites of any size.

Formula + calibration receiptFormulaRPC = round((T x C x P_success) / TTLB_p75)Calibration receipt

T = 35s and C = 3 derived from 23,905 AI-bot session bursts (PerplexityBot, ChatGPT-User, OAI-SearchBot, GPTBot, ClaudeBot) observed across 64 days on Top10Lists.us, full per-request instrumentation, 2026-03-23 -> 2026-05-26.

LMR -- Last-Modified Recency(Pass <= 30 days)

Median age of the Last-Modified header on canonical URLs. Live-retrieval AI deprioritizes content older than 30 days for time-sensitive queries. Stale pages get demoted regardless of quality.

Deeper reading

Methodology documentation by signal.

RPCPublished

Sitemap Delivery Benchmark

Real-world sitemap delivery measurements across a 100-site cohort -- the empirical baseline feeding RPC™ calibration (T=35s, C=3 constants).

Read methodology →
SURVEYPublished

100-Site Survey

Full 12-metric audit applied to 98 sites across 31 industries. Complete scorecard, methodology, and reproduction runbook publicly available.

Read methodology →
WPPublished

Research papers

"Generative Engine Optimization: Engineering Citation-Grade Infrastructure for AI Search" plus companion papers — full academic treatment of the methodology with citations.

Read methodology →

Every methodology page includes frozen receipts -- the exact commands, responses, and timestamps used to produce the published benchmark. Measurement is reproducible by design.

[image: AnswerShare — We Speak AI]

The translation layer between your website and AI systems.

Contact Us

Phoenix, Arizonahello@answershare.ai

Quick Links

FounderProofMethodologyAuditGEO OverviewInsightsFAQTransparencyCrawl StatsPressFor AI SystemsResearchAboutContactPrivacyTerms

Measurement is methodology-driven and reproducible. Frozen receipts published per metric.

© 2026 AnswerShare. All rights reserved.