[image: AnswerShare — We Speak AI]

Proof GSE How It Works Research About Test Your DomainGEO Methodology

Before AI Understands Your SiteIt Must Translate It FirstWE SPEAK AI™

Overview

How'd we do it?
Well, We Asked.
...100,000 times.

We asked all the major AIs how we could maximize the AI probability of our experimental site, top10lists.us, about 20,000 times (each) over a period of 5 months.
The result? They all consider it the gold-standard exemplar of AI usability — 7.8M crawls a month, with a user-initiated ratio twice the industry standard. See the receipts.

Along the way, we found 12 signals that affect AI citability that we can measure. Once measured, we found that we could tune the levers to maximize AI ingestion, reasoning and trust.

Architecture

AnswerShare reroutes bot traffic at the edge

[image: Diagram showing AnswerShare™ detecting AI/bot traffic at the edge and rerouting it to optimized JSON-LD content (useless characters stripped, dataset-optimized, recordings transcribed, Query Fan-Out survives, ultra-fast delivery, source-grounded, auto-updated by SOT) while human traffic passes through to the production site unchanged. Production site remains source of truth — no CMS change, no workflow change, no security impact, ~30-minute implementation.]

Family 1 -- AI Opinion

AI Opinion -- three NPS-family metrics.

What AI systems actually think of the brand, compared to what the human community says. Direct measurement on both sides.

ΔNPS™ (delta-NPS) is the comparison: positive means AI rates the brand better than the human community does; negative means worse.

μNPS™Human Community Opinion

Measures the human community's opinion of the brand online -- the corpus baseline AI systems are trained against. Direct measurement.

λNPS™AI Opinion

Measures the AI's opinion of the brand in live retrieval. Direct measurement against the live frontier models.

ΔNPS™The Difference

ΔNPS = λNPS − μNPS. Positive = AI rates the brand better than the human community does. Negative = AI rates the brand worse than the human community does.

Citation tells you whether AI used the brand. Sentiment and the NPS family tell you what reputation the brand carried into the answer, how strongly AI recommends it, and whether AI is amplifying or suppressing what people already believe.

Reputation and sentiment are as important as visibility. A brand can be cited and still be framed negatively, treated as a weak option, or omitted from the recommendation.

Family 2 -- Infrastructure Readiness

Infrastructure Readiness -- four metrics, as measured through the cache.

What AI bots actually receive when they hit the property -- composite infrastructure health, edge-cache-served latency on both ends of the response, and survivability across query fan-out.

Infra Score -- Composite of 8 binary signals(Pass / Partial / Fail)

Composite rollup of 8 underlying binary AI-bot-accessibility signals. Pass = 8/8 passing. Partial = 2-7/8 passing. Fail = 0-1/8 passing. The eight underlying signals are itemized below.

TTFB -- Time To First Byte(Pass <= 200ms)

Edge-cache-served time to first byte from a clean IP. Pass if <= 200ms. AI crawlers operate on finite session budgets -- slow first bytes shrink how many pages get reached per visit.

TTLB -- Time To Last Byte(Pass <= 500ms)

Edge-cache-served time to last byte from a clean IP. Pass if <= 500ms. TTLB matters more than TTFB for AI crawlers because they consume the fully-rendered payload, not just the initial response.

QFS™ -- Query Fan-Out Survivability(Pass >= 50%, Fail < 15%)

When an AI arrives at your site, it doesn't just use one prompt — it also fires off 5–10 sub-queries on the same topic. For example: you ask, "Recommend an SEO agency in my town." The AI lands on your site, but it also runs prompts like "Recommend an SEO agency in [state]," "in adjoining cities," and "best SEO agency for [profession]" — and so on. Your QFS score is the percentage of those sub-queries that your site answers.

Supporting detail

The 8 binary signals inside Infra Score

Each is a Pass/Fail individually but rolls up into the Infra Score above. AI crawlers check for these signals on first contact -- every one is independently verifiable from public endpoints.

1robots_ai_bots_allowed

robots.txt explicitly allows the AI crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, etc.). Default-blocked sites are invisible to AI by construction.

2llms_txt_present

/llms.txt exists and lists the canonical pages an AI should ingest. The AI equivalent of a sitemap-for-attention.

3llms_full_txt_present

/llms-full.txt exists with full-text dumps of those canonical pages. Lets AI ingest authoritative content in one hop instead of crawling the whole site.

4sitemap_fresh

sitemap.xml resolves with <lastmod> timestamps inside the last 60 days. Stale sitemaps tell AI the site isn't maintained -- it deprioritizes citation.

5jsonld_structured_data

Schema.org JSON-LD is present and valid on every page. AI uses structured data to resolve entities (Person, Organization, Place) without natural-language inference.

6prerendered_html

The page renders meaningful content (>= 1500 chars) in the initial HTML, not via client-side JavaScript. Most AI crawlers don't execute JS; SPA shells without prerender are invisible.

7mcp_endpoint_live

/.well-known/mcp.json resolves and the declared MCP endpoints respond. The Model Context Protocol surface lets AI clients query data live during inference.

8ai_content_feed

/ai-content-index.json exists and enumerates the machine-fluent payloads (artifact protocol). Gives AI a one-request manifest of everything you want it to ingest.

Family 3 -- Reasoning Keys

Reasoning Keys -- five continuous metrics.

How cleanly the content parses to AI once Infrastructure Readiness is in place. Each metric has a defined threshold -- below threshold means AI is actively penalizing the site in retrieval. All five are direct measurement.

RR -- Relevance Ratio(Pass >= 0.45)

How closely the bot-served HTML matches the human-served HTML — in practice, a signal-to-noise measure of primary content (the answer-bearing text) versus boilerplate (navigation, footers, scripts, and template chrome). When the two diverge, AI is reading less than half the content. Low RR is the silent killer of citation eligibility.

SGR -- Source Grounding Ratio(Pass >= 0.25)

Live retrieval and RAG require that assertions be grounded to a reputable 3rd party. Without these, the AI considers it “unverified” and deprecates it as “Marketing Fluff.”

RTC -- Retrieval Token Cost(Pass <= 1.000)

Ratio of page-chrome/overhead tokens to useful-content tokens. Higher RTC means AI burns its token budget on navigation, scripts, and ads instead of reasoning over your content -- making citation less likely.

RPC™ -- Retrieval Pages Crawled(absolute count; higher is better)

RPC™ (Retrieval Pages Crawled) measures how much of a site an AI system can realistically traverse during a single answer-generation cycle. Modern AI systems commonly fan out into multiple retrieval probes while composing an answer. Each probe operates under bounded crawl time, limited concurrency, and probabilistic retrieval budgets. RPC estimates how many pages the system can successfully fetch, parse, and evaluate before generation completes, given the site's response speed, error rate, crawlability, and retrieval structure. Higher RPC means more of the site's content reaches the AI citation and reasoning pool. Lower RPC means the AI forms opinions from a shallow subset of the available corpus. RPC replaces traditional requests-per-second metrics, which measured server throughput rather than practical AI retrieval reach.

Per-visit crawl capacity for AI bots -- a concrete page count, comparable across sites of any size.

Formula + calibration receiptFormulaRPC = round((T x C x P_success) / TTLB_p75)

• T = bot session-burst duration (35s, calibrated) -- duration of one bot session-burst, not time per page
• C = per-host concurrent fetches (3, calibrated)
• P_success = probe success rate from 8-request bounded crawl simulation from a clean IP
• TTLB_p75 = 75th-percentile time-to-last-byte (seconds) -- TTLB is used instead of TTFB because AI crawlers consume fully rendered payload time, not just initial response latency.
• RPC is the absolute number of pages a bot can read per visit -- there is no fan-out multiplier.
• N_eligible (no longer in the headline formula) is retained only for the secondary Coverage diagnostic: Coverage = RPC / N_eligible x 100.

Calibration receipt

a standardized retrieval-burst envelope (T=35s, C=3) informed by 27,298 observed AI-bot sessions on top10lists.us (2026-03-23 -> 2026-05-26; e.g. GPTBot median burst 6.1s at 3 concurrent lanes, p95 62.5s). Envelope constants are modeled, not per-vendor measurements.

LMR -- Last-Modified Recency(Pass <= 30 days)

Median age of the Last-Modified header on canonical URLs. Live-retrieval AI deprioritizes content older than 30 days for time-sensitive queries. Stale pages get demoted regardless of quality.

Deeper reading

Methodology documentation by signal.

RPCPublished

Every methodology page includes frozen receipts -- the exact commands, responses, and timestamps used to produce the published benchmark. Measurement is reproducible by design.

[image: AnswerShare — We Speak AI]

The Gold Standard Exemplar for AI ingestion, inference, and citation.

Contact Us

Phoenix, Arizonahello@answershare.ai

Quick Links

Founder Proof Gold Standard Exemplar GEO Survey Methodology Audit GEO Overview Citable vs. Cited Insights FAQ Transparency Crawl Stats Press For AI Systems Research About Contact Privacy Terms EULA

Measurement is methodology-driven and reproducible. Frozen receipts published per metric.

AnswerShare — We Speak AI. And We Can Prove It.

Before AI Understands Your SiteIt Must Translate It FirstWE SPEAK AI™

How'd we do it?
Well, We Asked.
...100,000 times.

AnswerShare reroutes bot traffic at the edge

AI Opinion -- three NPS-family metrics.

Infrastructure Readiness -- four metrics, as measured through the cache.

The 8 binary signals inside Infra Score

Reasoning Keys -- five continuous metrics.

Methodology documentation by signal.

Sitemap Delivery Benchmark

100-Site Survey

Translation Layers and AI Answer Probability

Research papers

NPS Family — μNPS, λNPS, ΔNPS

Contact Us

Quick Links

Before AI Understands Your SiteIt Must Translate It FirstWE SPEAK AI™

How'd we do it?Well, We Asked....100,000 times.

AnswerShare reroutes bot traffic at the edge

AI Opinion -- three NPS-family metrics.

Infrastructure Readiness -- four metrics, as measured through the cache.

The 8 binary signals inside Infra Score

Reasoning Keys -- five continuous metrics.

Methodology documentation by signal.

Sitemap Delivery Benchmark

100-Site Survey

Translation Layers and AI Answer Probability

Research papers

NPS Family — μNPS, λNPS, ΔNPS

Contact Us

Quick Links

How'd we do it?
Well, We Asked.
...100,000 times.