Checklist43 checks30 min audit

    AEO Citation Readiness Checklist

    43 checks across 5 dimensions to ensure your B2B SaaS content gets cited by ChatGPT, Perplexity, Claude, and Google AI Overviews.

    Published Feb 19, 2026

    AEO (Answer Engine Optimization) citation readiness is the structural, schema, and content quality standard a page must meet to be cited by AI search platforms — ChatGPT, Perplexity, Claude, and Google AI Overviews. This checklist operationalizes XEO's AEO optimization methodology into 43 verifiable checks.

    How to use this checklist: Audit one page at a time. Score each check as PASS or FAIL. Pages that fail more than 10 checks are functionally invisible to AI search. Start with your highest-traffic page.

    94%

    B2B buyers use AI in purchasing decisions

    Forrester 2025

    38%

    Software buyers start with AI chatbots

    Gartner 2026

    $750B

    US revenue through AI search by 2028

    McKinsey 2025


    Dimension 1: Entity Clarity (8 Checks)

    AI systems need to understand who you are before they can cite you. Entity clarity means your brand, your people, and your services are unambiguously defined in both content and structured data. This maps to the first two layers of the Entity Authority Stack: Schema Foundation and Content Architecture.

    Entity Clarity

    Does the AI know who you are?

    Entity statement in first 300 words

    A clear 'X is Y' declaration that defines your company/product. Example: 'Stripe is a financial infrastructure platform for the internet.'

    Organization schema with @id, name, url, sameAs

    JSON-LD Organization or ProfessionalService schema with a stable @id URI, official name, canonical URL, and sameAs links to official profiles.

    Author schema with Person type

    Article/blog pages include an author with @type Person, name, jobTitle, and sameAs linking to LinkedIn/professional profiles.

    Brand-to-topic association in schema

    Schema knowsAbout or serviceType fields match the topics the page actually covers. If you claim expertise in 'fintech SEO,' your schema should say so.

    Consistent entity naming throughout

    Same company/product name used in schema, headings, meta, and body copy. No switching between abbreviations or alternate names mid-page.

    sameAs links to all official profiles

    Organization schema includes sameAs array with LinkedIn company page, Twitter/X, GitHub, Crunchbase, and any other official profiles.

    Service/Product schema maps to actual offerings

    Service schema serviceType and description match what you actually do — not a generic 'digital marketing' when you specialize in AEO.

    BreadcrumbList schema present

    Every page has BreadcrumbList JSON-LD showing site hierarchy (Home > Section > Page). Helps AI systems understand how content relates to the broader site.


    Dimension 2: Content Structure for Extraction (10 Checks)

    AI systems extract individual passages, not entire pages. Content structured for extraction — with direct answers, numbered frameworks, and comparison tables — gets cited. Unstructured prose gets skipped.

    Content Structure for Extraction

    Can the AI pull a clean answer from your content?

    Direct-answer first sentence in each H2 section

    Every major section opens with a 40-60 word self-contained answer. If the AI only reads this sentence, does it get a complete answer?

    Definition blocks use 'X is Y' format

    When defining a concept, lead with a direct declaration — not a context-setting paragraph. 'Schema markup is structured data vocabulary...' not 'When it comes to structured data...'

    Numbered frameworks with step labels

    Multi-step processes use numbered lists with bold step names and one-sentence definitions. AI systems cite numbered frameworks as complete blocks.

    Comparison tables for X vs Y content

    Any comparison content uses HTML tables with clear column headers. Tabular data gets extracted whole by AI — prose comparisons don't.

    FAQ answers are self-contained

    Each FAQ answer starts with a direct response in the first sentence. No 'It depends' or 'Great question!' — lead with the answer.

    No vague opening paragraphs

    'In today's landscape...' 'It's important to note...' 'As businesses increasingly...' — these get skipped entirely. Start with substance.

    Section headings describe their content

    H2/H3 headings are descriptive, not clever. 'How Schema Markup Improves AI Citations' works. 'The Schema Secret' doesn't.

    Clean heading hierarchy (H1 > H2 > H3)

    No skipping levels. One H1 per page. H3s are children of H2s. AI systems use heading structure to understand content relationships.

    Lists are HTML lists, not pseudo-lists

    Use actual <ul>/<ol> elements, not dashes or asterisks in prose. Structured lists get extracted cleanly; paragraph-embedded lists don't.

    Key data points in structured format

    Important statistics live in tables, callout cards, or list items — not buried in the middle of a paragraph. AI extracts structured data more reliably.


    Dimension 3: Schema Implementation (8 Checks)

    Schema markup is the machine-readable layer that tells AI systems what your content is. Incorrect schema is worse than no schema — it confuses the AI about your content's purpose.

    Schema Implementation

    Is your structured data correct and complete?

    JSON-LD format (not Microdata or RDFa)

    All structured data uses JSON-LD in script tags. Google and AI systems prefer JSON-LD. Microdata and RDFa are harder to maintain and less reliable.

    Article schema with required fields

    Blog/content pages include Article schema with headline, datePublished, dateModified, author (Person with @id), and publisher (Organization with @id).

    FAQ schema answers match visible content

    FAQ schema answers must be word-for-word identical to visible page content. Google and AI systems penalize mismatches between schema and displayed text.

    All schema URLs are absolute

    Every url, @id, mainEntityOfPage, and item URL in schema uses the full absolute URL (https://example.com/page), not relative paths (/page).

    No schema type misuse

    Service schema on service pages, Article on blog posts, FAQPage on FAQ sections. Using the wrong type confuses AI systems about content purpose.

    dateModified updated on content refresh

    When content is updated, dateModified reflects the actual update date. AI systems use this as a freshness signal for citation priority.

    mainEntityOfPage points to canonical URL

    Article and Service schema include mainEntityOfPage with the canonical URL. Confirms to AI systems which URL is authoritative for this content.

    Schema validates without errors

    Test all schema in Google Rich Results Test or Schema.org validator. Zero errors. Warnings are acceptable but errors indicate broken structured data.


    Dimension 4: Platform-Specific Readiness (10 Checks)

    Each AI search platform crawls and indexes differently. These checks ensure your content is technically accessible to ChatGPT (via GPTBot), Perplexity (via PerplexityBot), Claude (via ClaudeBot), and Google AI Overviews.

    Platform-Specific Readiness

    Can AI crawlers actually access and process your content?

    Content renders in initial HTML (SSG)

    Core content exists in the initial HTML response — not injected via client-side JavaScript. AI crawlers don't execute JS as reliably as Googlebot's WRS.

    robots.txt allows GPTBot, ClaudeBot, PerplexityBot

    Check robots.txt explicitly. No Disallow rules targeting AI crawlers. Blocking GPTBot means ChatGPT can't see your content.

    No noai or noimageai meta tags

    Meta tags like 'noai' or 'noimageai' signal to AI systems that content should not be used. Only add these if you intentionally want to block AI indexing.

    Structured data types match page function

    The primary schema type aligns with what the page does (Service for offerings, Article for content, HowTo for guides). Mixed signals reduce citation confidence.

    Page loads in under 3 seconds

    AI crawlers have timeout limits. Pages that take too long to load may be partially crawled or skipped entirely. Measure with Lighthouse or WebPageTest.

    Content not behind auth, paywall, or cookie wall

    AI crawlers can't log in, accept cookies, or pay subscriptions. If your content requires any interaction before displaying, AI systems can't index it.

    Images have descriptive alt text

    AI systems use alt text to understand image content and page context. Descriptive alt text ('AEO framework diagram showing 5 steps') adds context that improves citation relevance.

    Canonical URL is self-referencing and correct

    The canonical URL points to itself (not a different page). AI systems use canonical to determine which URL version is authoritative.

    Open Graph metadata present

    ChatGPT and Claude use OG tags (og:title, og:description, og:url) for previews and context. Missing OG data means AI systems have less information to work with.

    Content updated within last 12 months

    AI systems weight recency. Content with a dateModified older than 12 months may be deprioritized in favor of fresher sources covering the same topic.


    Dimension 5: Citation-Worthy Content Quality (7 Checks)

    Technical readiness gets you indexed. Content quality gets you cited. AI systems choose sources based on originality, specificity, and authority — the same qualities that earn trust from human readers.

    Citation-Worthy Content Quality

    Would an AI choose to cite you over the alternative?

    Original analysis, data, or frameworks

    Content includes something that can't be found by combining the top-10 Google results. Original research, proprietary frameworks, or first-hand data create citation anchors.

    Proprietary terminology or named frameworks

    Named concepts (like 'Entity Authority Stack' or 'Pipeline Gap') create unique citation targets. AI systems attribute named frameworks to their source.

    Specific data points with attributed sources

    Concrete numbers from authoritative sources — not 'studies show' or 'research indicates.' Attribution gives AI systems confidence to cite the data.

    Content demonstrates first-hand experience

    Practitioner signals — 'when we audited 50 B2B SaaS sites,' 'in our experience with fintech content' — distinguish first-hand knowledge from aggregated information.

    Clear author with verifiable expertise

    Named author with a professional profile, relevant job title, and published history in the topic area. Anonymous content is harder for AI to evaluate for authority.

    Content answers the query better than competitors

    Search the target query. Read the top 3 results. Does your content provide a more complete, more specific, or more current answer?

    Unique perspective or contrarian position

    If 10 pages say the same thing, none gets cited specifically. A differentiated take — backed by evidence — gives AI systems a reason to cite you over alternatives.


    Scoring Guide

    Count PASS checks across all 5 dimensions. Use this table to interpret your score:

    ScoreVerdictAction
    38-43Citation-readyMonitor citations across platforms. Maintain and iterate.
    30-37Near-readyFix critical failures first. Most gains come from Dimensions 1-2.
    20-29Significant gapsStructural work needed. Prioritize entity clarity and content structure.
    Below 20Not citation-readyStart with schema foundation and content restructuring before anything else.

    Priority order for fixes: Critical checks in Dimension 1 (Entity Clarity) and Dimension 2 (Content Structure) have the highest impact. A page with perfect schema but no extractable content won't get cited. A page with great content but no entity clarity gets cited — but attributed to someone else.


    What to Do Next

    Run this checklist on your highest-traffic page. If more than 10 checks fail, your content is functionally invisible to AI search — which means you're missing the channel where 38% of software buyers now start their research.

    For the full AEO methodology behind this checklist, read How to Rank in AI Search: The Complete AEO Guide. For a deeper look at the 5-Step AEO Framework and Entity Authority Stack, see the AEO Optimization service page.

    If you want help running this audit across your site, get in touch. The first thing we do is search your company name in ChatGPT — the results are always surprising.