xeo.works
xeo.works/resources/aeo-citation-readiness-checklist
AEO Citation Readiness Checklist
43 checks across 5 dimensions to ensure your B2B SaaS content gets cited by ChatGPT, Perplexity, Claude, and Google AI Overviews.
AEO (Answer Engine Optimization) citation readiness is the structural, schema, and content quality standard a page must meet to be cited by AI search platforms — ChatGPT, Perplexity, Claude, and Google AI Overviews. This checklist operationalizes XEO's AEO optimization methodology into 43 verifiable checks.
How to use this checklist: Audit one page at a time. Score each check as PASS or FAIL. Pages that fail more than 10 checks are functionally invisible to AI search. Start with your highest-traffic page.
94%
B2B buyers use AI in purchasing decisions
Forrester 2025
38%
Software buyers start with AI chatbots
Gartner 2026
$750B
US revenue through AI search by 2028
McKinsey 2025
Dimension 1: Entity Clarity (8 Checks)
AI systems need to understand who you are before they can cite you. Entity clarity means your brand, your people, and your services are unambiguously defined in both content and structured data. This maps to the first two layers of the Entity Authority Stack: Schema Foundation and Content Architecture.
Entity Clarity
Does the AI know who you are?
Entity statement in first 300 words
A clear 'X is Y' declaration that defines your company/product. Example: 'Stripe is a financial infrastructure platform for the internet.'
Organization schema with @id, name, url, sameAs
JSON-LD Organization or ProfessionalService schema with a stable @id URI, official name, canonical URL, and sameAs links to official profiles.
Author schema with Person type
Article/blog pages include an author with @type Person, name, jobTitle, and sameAs linking to LinkedIn/professional profiles.
Brand-to-topic association in schema
Schema knowsAbout or serviceType fields match the topics the page actually covers. If you claim expertise in 'fintech SEO,' your schema should say so.
Consistent entity naming throughout
Same company/product name used in schema, headings, meta, and body copy. No switching between abbreviations or alternate names mid-page.
sameAs links to all official profiles
Organization schema includes sameAs array with LinkedIn company page, Twitter/X, GitHub, Crunchbase, and any other official profiles.
Service/Product schema maps to actual offerings
Service schema serviceType and description match what you actually do — not a generic 'digital marketing' when you specialize in AEO.
BreadcrumbList schema present
Every page has BreadcrumbList JSON-LD showing site hierarchy (Home > Section > Page). Helps AI systems understand how content relates to the broader site.
Dimension 2: Content Structure for Extraction (10 Checks)
AI systems extract individual passages, not entire pages. Content structured for extraction — with direct answers, numbered frameworks, and comparison tables — gets cited. Unstructured prose gets skipped.
“In today's rapidly evolving digital landscape, it's important to note that many B2B SaaS companies are beginning to explore the possibilities of AI search optimization, which can potentially help them reach new audiences...”
“AEO (Answer Engine Optimization) is the practice of structuring content so AI search platforms — ChatGPT, Perplexity, Claude, Google AI Overviews — cite it in their responses. Unlike SEO, AEO optimizes for extraction, not ranking.”
Content Structure for Extraction
Can the AI pull a clean answer from your content?
Direct-answer first sentence in each H2 section
Every major section opens with a 40-60 word self-contained answer. If the AI only reads this sentence, does it get a complete answer?
Definition blocks use 'X is Y' format
When defining a concept, lead with a direct declaration — not a context-setting paragraph. 'Schema markup is structured data vocabulary...' not 'When it comes to structured data...'
Numbered frameworks with step labels
Multi-step processes use numbered lists with bold step names and one-sentence definitions. AI systems cite numbered frameworks as complete blocks.
Comparison tables for X vs Y content
Any comparison content uses HTML tables with clear column headers. Tabular data gets extracted whole by AI — prose comparisons don't.
FAQ answers are self-contained
Each FAQ answer starts with a direct response in the first sentence. No 'It depends' or 'Great question!' — lead with the answer.
No vague opening paragraphs
'In today's landscape...' 'It's important to note...' 'As businesses increasingly...' — these get skipped entirely. Start with substance.
Section headings describe their content
H2/H3 headings are descriptive, not clever. 'How Schema Markup Improves AI Citations' works. 'The Schema Secret' doesn't.
Clean heading hierarchy (H1 > H2 > H3)
No skipping levels. One H1 per page. H3s are children of H2s. AI systems use heading structure to understand content relationships.
Lists are HTML lists, not pseudo-lists
Use actual <ul>/<ol> elements, not dashes or asterisks in prose. Structured lists get extracted cleanly; paragraph-embedded lists don't.
Key data points in structured format
Important statistics live in tables, callout cards, or list items — not buried in the middle of a paragraph. AI extracts structured data more reliably.
Dimension 3: Schema Implementation (8 Checks)
Schema markup is the machine-readable layer that tells AI systems what your content is. Incorrect schema is worse than no schema — it confuses the AI about your content's purpose.
Entity Audit
Verify how AI systems currently see your brand
Content Structure
Restructure content for passage extraction
Schema Implementation
Add JSON-LD structured data
Citation-Worthy Content
Create content worth citing
Cross-Platform Monitoring
Track citations across AI platforms
Schema Implementation
Is your structured data correct and complete?
JSON-LD format (not Microdata or RDFa)
All structured data uses JSON-LD in script tags. Google and AI systems prefer JSON-LD. Microdata and RDFa are harder to maintain and less reliable.
Article schema with required fields
Blog/content pages include Article schema with headline, datePublished, dateModified, author (Person with @id), and publisher (Organization with @id).
FAQ schema answers match visible content
FAQ schema answers must be word-for-word identical to visible page content. Google and AI systems penalize mismatches between schema and displayed text.
All schema URLs are absolute
Every url, @id, mainEntityOfPage, and item URL in schema uses the full absolute URL (https://example.com/page), not relative paths (/page).
No schema type misuse
Service schema on service pages, Article on blog posts, FAQPage on FAQ sections. Using the wrong type confuses AI systems about content purpose.
dateModified updated on content refresh
When content is updated, dateModified reflects the actual update date. AI systems use this as a freshness signal for citation priority.
mainEntityOfPage points to canonical URL
Article and Service schema include mainEntityOfPage with the canonical URL. Confirms to AI systems which URL is authoritative for this content.
Schema validates without errors
Test all schema in Google Rich Results Test or Schema.org validator. Zero errors. Warnings are acceptable but errors indicate broken structured data.
Dimension 4: Platform-Specific Readiness (10 Checks)
Each AI search platform crawls and indexes differently. These checks ensure your content is technically accessible to ChatGPT (via GPTBot), Perplexity (via PerplexityBot), Claude (via ClaudeBot), and Google AI Overviews.
Platform-Specific Readiness
Can AI crawlers actually access and process your content?
Content renders in initial HTML (SSG)
Core content exists in the initial HTML response — not injected via client-side JavaScript. AI crawlers don't execute JS as reliably as Googlebot's WRS.
robots.txt allows GPTBot, ClaudeBot, PerplexityBot
Check robots.txt explicitly. No Disallow rules targeting AI crawlers. Blocking GPTBot means ChatGPT can't see your content.
No noai or noimageai meta tags
Meta tags like 'noai' or 'noimageai' signal to AI systems that content should not be used. Only add these if you intentionally want to block AI indexing.
Structured data types match page function
The primary schema type aligns with what the page does (Service for offerings, Article for content, HowTo for guides). Mixed signals reduce citation confidence.
Page loads in under 3 seconds
AI crawlers have timeout limits. Pages that take too long to load may be partially crawled or skipped entirely. Measure with Lighthouse or WebPageTest.
Content not behind auth, paywall, or cookie wall
AI crawlers can't log in, accept cookies, or pay subscriptions. If your content requires any interaction before displaying, AI systems can't index it.
Images have descriptive alt text
AI systems use alt text to understand image content and page context. Descriptive alt text ('AEO framework diagram showing 5 steps') adds context that improves citation relevance.
Canonical URL is self-referencing and correct
The canonical URL points to itself (not a different page). AI systems use canonical to determine which URL version is authoritative.
Open Graph metadata present
ChatGPT and Claude use OG tags (og:title, og:description, og:url) for previews and context. Missing OG data means AI systems have less information to work with.
Content updated within last 12 months
AI systems weight recency. Content with a dateModified older than 12 months may be deprioritized in favor of fresher sources covering the same topic.
Dimension 5: Citation-Worthy Content Quality (7 Checks)
Technical readiness gets you indexed. Content quality gets you cited. AI systems choose sources based on originality, specificity, and authority — the same qualities that earn trust from human readers.
Entity Authority Stack
Cross-Platform Citation
Consistent presence across AI search platforms, social profiles, and industry mentions
Topical Authority
Depth and breadth of content covering your core expertise areas
Content Architecture
Structured, extractable content with clear entity relationships
Schema Foundation
JSON-LD structured data defining your entity, services, and people
Citation-Worthy Content Quality
Would an AI choose to cite you over the alternative?
Original analysis, data, or frameworks
Content includes something that can't be found by combining the top-10 Google results. Original research, proprietary frameworks, or first-hand data create citation anchors.
Proprietary terminology or named frameworks
Named concepts (like 'Entity Authority Stack' or 'Pipeline Gap') create unique citation targets. AI systems attribute named frameworks to their source.
Specific data points with attributed sources
Concrete numbers from authoritative sources — not 'studies show' or 'research indicates.' Attribution gives AI systems confidence to cite the data.
Content demonstrates first-hand experience
Practitioner signals — 'when we audited 50 B2B SaaS sites,' 'in our experience with fintech content' — distinguish first-hand knowledge from aggregated information.
Clear author with verifiable expertise
Named author with a professional profile, relevant job title, and published history in the topic area. Anonymous content is harder for AI to evaluate for authority.
Content answers the query better than competitors
Search the target query. Read the top 3 results. Does your content provide a more complete, more specific, or more current answer?
Unique perspective or contrarian position
If 10 pages say the same thing, none gets cited specifically. A differentiated take — backed by evidence — gives AI systems a reason to cite you over alternatives.
Scoring Guide
Count PASS checks across all 5 dimensions. Use this table to interpret your score:
| Score | Verdict | Action |
|---|---|---|
| 38-43 | Citation-ready | Monitor citations across platforms. Maintain and iterate. |
| 30-37 | Near-ready | Fix critical failures first. Most gains come from Dimensions 1-2. |
| 20-29 | Significant gaps | Structural work needed. Prioritize entity clarity and content structure. |
| Below 20 | Not citation-ready | Start with schema foundation and content restructuring before anything else. |
Priority order for fixes: Critical checks in Dimension 1 (Entity Clarity) and Dimension 2 (Content Structure) have the highest impact. A page with perfect schema but no extractable content won't get cited. A page with great content but no entity clarity gets cited — but attributed to someone else.
What to Do Next
Run this checklist on your highest-traffic page. If more than 10 checks fail, your content is functionally invisible to AI search — which means you're missing the channel where 38% of software buyers now start their research.
For the full AEO methodology behind this checklist, read How to Rank in AI Search: The Complete AEO Guide. For a deeper look at the 5-Step AEO Framework and Entity Authority Stack, see the AEO Optimization service page.
If you want help running this audit across your site, get in touch. The first thing we do is search your company name in ChatGPT — the results are always surprising.
Free resource from xeo.works — Cross-Engine Optimization for B2B SaaS. Download at xeo.works/resources/aeo-citation-readiness-checklist