xeo.works
xeo.works/resources/google-seo-compliance-checklist
Google SEO Compliance Checklist
43 checks across 6 dimensions based on Google Search Central's official documentation. Verify spam policy compliance, crawl health, and search appearance.
Google Search Central publishes hundreds of pages of documentation on how Google crawls, indexes, and ranks content. This checklist distills that documentation into 43 verifiable checks across 6 dimensions — from spam policy compliance to search appearance optimization. Every check maps directly to Google's published guidance, not SEO folklore.
How to use this checklist: Audit one page at a time. Score each check as PASS, PARTIAL, or FAIL (2, 1, or 0 points). A single spam policy violation in Dimension 1 can cause site-wide penalties, so start there. For the full SEO methodology for B2B SaaS, read the service page.
40%
Reduction in low-quality content from Google March 2024 Core Update
Search Engine Journal
16,000+
Human quality raters evaluating Google search results globally
Google Blog
~50%
Google searches already showing AI summaries
McKinsey 2025
Dimension 1: Search Essentials Compliance (8 Checks)
A single violation here can trigger a manual action — a penalty that removes pages or your entire site from Google's index. These checks verify your site does not violate any of Google's spam policies. Treat every failure in this dimension as critical.
“Creating 12 vertical pages by swapping the industry name in an identical template. Same structure, same claims, same CTA — just 'fintech' replaced with 'healthcare.' Google classifies these as doorway pages.”
“Each vertical page has unique insights, industry-specific data points, distinct buyer personas, and methodology sections that explain how the approach adapts to that vertical's constraints.”
Search Essentials Compliance
Does your site comply with Google's spam policies?
No cloaking
Content served to users matches content served to crawlers. No conditional rendering based on user-agent detection. SSG pages naturally pass this — verify no server-side user-agent checks exist.
No doorway pages
Each page targets a distinct search intent with genuinely differentiated content. If you have multiple vertical or location pages, each must offer unique value — not identical templates with names swapped.
No hidden text or links
No CSS hiding text from visual users (white-on-white, font-size: 0, display: none on crawlable content, off-screen positioning). Exception: skip-to-content links and screen-reader-only accessibility helpers.
No keyword stuffing
Primary keyword appears naturally in H1, meta description, and first 100 words — but is not forced into every heading, alt attribute, and paragraph. Read the keyword density aloud. If it sounds robotic, it fails.
No scaled content abuse
Content demonstrates original analysis, unique perspective, or first-hand experience. Not mass-generated low-value pages. Each page must pass the anti-template test: could you swap the topic name and have the content still make sense? If yes, it fails.
No site reputation abuse
No third-party content published to exploit your domain authority. All content should be authored by or editorially controlled by your organization. Sponsored content without editorial oversight violates this policy.
No sneaky redirects
All redirects send users to expected destinations. No redirect chains exceeding 5 hops. Permanent content moves use 301 or 308 status codes. Temporary redirects (302) only for genuinely temporary moves.
No thin affiliation
If any page lists or recommends third-party tools, services, or products, it includes original evaluation criteria, editorial perspective, and genuine analysis — not just copied vendor descriptions.
Dimension 2: Helpful Content & People-First Signals (7 Checks)
Google's helpful content system evaluates whether your page was created to serve users or to attract search traffic. The framework is simple: Who created this content, How was it created, and Why does it exist? Pages that fail the "why" test — existing primarily to rank, not to help — get demoted site-wide.
Helpful Content & People-First Signals
Was this page created for people, or for search engines?
Clear authorship (Who)
Author identified with a visible byline that leads to background information. Blog posts show the author name. Service pages have organizational attribution. Schema includes author with @type Person or Organization.
Content creation transparency (How)
Content demonstrates human editorial oversight regardless of tools used. Practitioner voice, original analysis, and first-hand experience are evident. The content should not read as unedited AI output.
People-first purpose (Why)
Page exists to serve the target audience, not primarily to attract search traffic. The test: would this page exist if search engines didn't? If the only reason for the page is keyword volume, it fails.
No search-engine-first red flags
Page does not exhibit: content across unrelated topics, mainly summarizing without adding value, writing trending topics without existing expertise, or artificially dating pages to appear fresh.
Original value contribution
Content provides original information, analysis, or research beyond what is already available. Not a repackaging of the top 10 Google results. Original frameworks, proprietary data, or unique methodology create value.
Content matches stated expertise
All content falls within your organization's claimed area of expertise. A B2B SaaS SEO agency publishing medical advice fails this check. Stay in your lane.
Satisfying user experience on completion
After reading, the user has learned something useful or can take a clear next action. The page delivers on the promise made by its title and meta description. If a reader would return to search unsatisfied, the page fails.
Dimension 3: Crawling & Indexing Health (8 Checks)
If Google cannot efficiently discover, crawl, and index your pages, nothing else matters. These checks go beyond "is the page crawlable" to verify URL hygiene, link crawlability, sitemap completeness, and canonicalization. A technically clean site gets indexed faster and more completely.
Discover
Google finds your URL via sitemaps, internal links, or external links
Crawl
Googlebot requests and downloads the page HTML
Render
Web Rendering Service executes JavaScript to see final content
Index
Content is processed and stored in the search index
Serve
Page appears in search results when it matches a query
Crawling & Indexing Health
Can Google efficiently discover, crawl, and index your pages?
URL structure: descriptive, hyphenated, lowercase
All URLs use hyphens (not underscores), lowercase letters, and descriptive words (not IDs or parameters). No URL fragments used for content changes. Example: /seo-for-b2b-saas, not /page?id=123.
URL parameter minimization
No unnecessary query parameters that don't change page content. No tracking parameters (?utm_source, ?ref) in canonical URLs or internal links. Each URL should resolve to exactly one version of the content.
All links are crawlable anchor elements
Navigation, internal links, and CTAs use standard HTML anchor elements with href attributes. No JavaScript-only navigation, no click handlers on span or div elements as the sole path to a page.
Descriptive anchor text
Internal links use descriptive text explaining the destination — not 'click here' or 'read more' without context. The reader should know where the link leads from the text alone. Image links need descriptive alt text.
Sitemap completeness and validity
Your XML sitemap includes every indexable page. No missing pages, no non-indexable pages included. Referenced in robots.txt. Under 50MB and 50,000 URLs per sitemap. Blog posts and dynamic routes dynamically generated.
Canonical URL correctness
Every page has a self-referencing canonical tag with an absolute URL that matches the served URL exactly. No www vs. non-www mismatch. No trailing slash inconsistency. Canonical tells Google which URL is authoritative.
Robots.txt correctness
robots.txt does not block Googlebot from indexable content. No accidental Disallow rules on important pages. Sitemap directive present. Verify by testing in Google Search Console's robots.txt tester.
JavaScript rendering compatibility
Core page content exists in the initial HTML response — not injected entirely via client-side JavaScript. Googlebot's Web Rendering Service handles JS, but SSG/SSR ensures content is available immediately. Interactive elements can be client-rendered.
Dimension 4: Ranking System Alignment (6 Checks)
Google uses named ranking systems — passage ranking, freshness systems, original content systems, reviews systems, and link analysis — to determine which pages deserve to rank. These checks verify your content aligns with the specific signals each system evaluates.
Ranking System Alignment
Does your content align with Google's named ranking systems?
Passage-rankable content structure
Individual H2 sections can answer queries independently. Each section opens with a clear, self-contained answer that could satisfy a search query on its own. Google's passage ranking system identifies and ranks individual passages within a page.
Freshness signals for time-sensitive content
Blog posts and data-driven pages include accurate date signals. datePublished and dateModified in Article schema reflect reality. No artificially inflated dates. Year references match the current year — not '2024 trends' published in 2026.
Original content prominence
Original frameworks, proprietary data, or unique analysis appear in the first half of the page — not buried below generic overview material. Google's original content system rewards pages where the unique contribution is the centerpiece.
Review and evaluation quality
Listicle and comparison pages include insightful analysis, evidence of actual evaluation, clear criteria, and both benefits AND drawbacks. Google's reviews system specifically rewards evaluation content with editorial rigor. Auto-PASS for non-evaluation pages.
Internal link structure supports PageRank flow
Important pages (hub pages, high-conversion pages) receive proportionally more internal links. Link architecture follows a logical hierarchy: homepage links to hubs, hubs link to spokes, spokes link back. No orphan pages.
Content matches query intent
Content satisfies the dominant search intent for its target keyword. Informational queries need comprehensive answers. Commercial queries need comparison and evaluation. Transactional queries need clear CTAs. Mismatched intent means Google won't rank the page.
Dimension 5: Search Appearance Optimization (8 Checks)
How your page appears in Google search results directly affects click-through rate. Title links, meta description snippets, featured snippet eligibility, and structured data all determine whether a searcher clicks your result or your competitor's. These checks cover SEO and AEO signals that influence search visibility.
Search Result Anatomy
Title Link
60 chars max, unique, descriptive, includes primary keyword naturally
Meta Description Snippet
155 chars, unique per page, gives searchers a reason to click
Featured Snippet Eligibility
Direct answers in extractable format — paragraph, list, or table
Structured Data Enhancements
Rich results, FAQ dropdowns, review stars, breadcrumbs from schema markup
Search Appearance Optimization
How does your page look in Google search results?
Title link best practices
Title tag is unique to the page, concise (60 characters or fewer), descriptive, and not keyword-stuffed. Includes primary keyword naturally. Uses consistent site branding with a delimiter (e.g., 'Page Title | Brand').
Meta description snippet optimization
Meta description is unique per page, 155 characters or fewer, reads naturally, and describes the page's value proposition. Gives searchers enough information to decide whether to click. Not a keyword list.
Featured snippet eligibility
For question or definition queries: content includes a direct, concise answer (40-60 words) in a distinct format — paragraph, numbered list, or HTML table — that Google could extract. Answer appears in the first 300 words.
Structured data follows Google requirements
FAQ schema answers match visible page content word-for-word. Article schema includes all required fields (headline, datePublished, author). BreadcrumbList uses absolute URLs. All schema validates without errors in Google Rich Results Test.
Google Discover eligibility
Blog posts include high-quality images at least 1200 pixels wide. max-image-preview:large not restricted in meta robots. Title is compelling without being clickbait. Auto-PASS for non-content pages.
WebSite structured data for site name
Homepage includes WebSite schema with name (required) and url (required). Site name is consistent across schema, og:site_name, and on-page references. This controls how Google displays your site name in search results.
No important content in data-nosnippet
The data-nosnippet attribute excludes content from Google's search snippets. Verify that no important content is accidentally wrapped in data-nosnippet — this prevents Google from using it in the search result preview.
Title and meta accurately represent content
Title and meta description precisely describe what the page actually covers. Google rewrites inaccurate titles. If your title promises something the page doesn't deliver, Google will either rewrite it or rank the page lower.
Dimension 6: Mobile, Security & Technical Foundations (6 Checks)
Google uses mobile-first indexing — meaning Googlebot primarily crawls and indexes the mobile version of your site. HTTPS, viewport configuration, and semantic HTML landmarks are baseline requirements that most modern frameworks handle well. These checks verify nothing is misconfigured.
Mobile, Security & Technical Foundations
Are the technical foundations solid for Google's mobile-first indexing?
Mobile-first content parity
Same content, headings, structured data, and metadata on mobile as desktop. No content hidden behind mobile-only interactions that Google won't trigger. Responsive layouts naturally ensure parity — verify that comparison tables and complex elements don't degrade.
HTTPS with valid certificate
Site served over HTTPS with a valid certificate. No mixed content (HTTP resources loaded on HTTPS pages). HTTPS is a confirmed ranking signal and a baseline security requirement.
No intrusive interstitials
No popups, modals, or overlays blocking page content on load — especially on mobile. Inline CTAs and sticky bars are fine. Full-screen interstitials that prevent content access trigger Google's page experience penalty. Exception: legally required notices (cookie consent, age verification).
Proper HTTP status codes
404 pages return actual 404 status codes (not 200 with error text — a 'soft 404'). Permanent redirects use 301 or 308. Error pages return appropriate codes. Verify in Google Search Console's URL Inspection tool.
Viewport meta tag present
Document head includes a viewport meta tag: meta name='viewport' content='width=device-width, initial-scale=1'. This signals mobile-friendly design to Google. Avoid user-scalable=no unless accessibility requirements demand it.
Page experience supporting signals
HTTPS configured, no intrusive interstitials, mobile-friendly layout, and clear content hierarchy with semantic HTML landmarks: main, nav, header, footer. These collectively form Google's page experience signals.
"Google Says..." — Common Misconceptions
These callouts reference what Google Search Central actually states. Many widely-held SEO beliefs contradict Google's own documentation. Understanding these prevents you from optimizing for things Google explicitly says it doesn't use.
Google Says... "E-E-A-T is NOT a direct ranking factor." Google's systems use E-E-A-T (Experience, Expertise, Authoritativeness, Trustworthiness) as a quality assessment framework, not a direct ranking signal. Build E-E-A-T signals because they improve content quality and user trust — not because Google counts them as a ranking checkbox.
Google Says... "Content length doesn't determine rankings." There is no word count that guarantees higher rankings. A 500-word page that perfectly answers the query outranks a 5,000-word page that buries the answer. Write as much as the topic requires — no more, no less.
Google Says... "Duplicate content is not penalized; it's just inefficient." Google deduplicates identical content — it chooses one version to show. This is not a penalty. However, copying content from other sites is a separate issue (scraped content spam). Use canonicals to signal which version you prefer.
Google Says... "Perfect Core Web Vitals scores don't guarantee top rankings." Page experience is one of many ranking factors. A page with mediocre CWV and exceptional content will outrank a page with perfect CWV and thin content. Optimize CWV for user experience, not for guaranteed ranking improvement.
Scoring Guide
Score each check as PASS (2 points), PARTIAL (1 point), or FAIL (0 points). Sum across all 43 checks for a raw score out of 86. Normalize to 100: (raw score / 86) x 100.
| Normalized Score | Verdict | Action |
|---|---|---|
| 90-100 | Google-compliant | No Google-specific issues. Ship it. Monitor in Search Console. |
| 75-89 | Minor compliance gaps | Fix specific checks before next crawl. Low risk of penalties. |
| 55-74 | Significant compliance issues | Multiple guideline violations. Fix before promoting the page. |
| Below 55 | Critical compliance failure | Spam policy violation risk or fundamental crawling/indexing problems. Fix immediately. |
Priority order for fixes: Dimension 1 (Search Essentials) failures are the most dangerous — a single spam policy violation can cause site-wide deindexing. Dimension 3 (Crawling & Indexing) failures mean Google cannot access your content at all. Fix these two dimensions first before optimizing Dimensions 4-6.
What to Do Next
Run this checklist on your homepage first, then your highest-traffic pages. Spam policy violations in Dimension 1 are site-wide risks — a single doorway page or cloaking instance affects your entire domain.
For the full SEO methodology behind this checklist, see SEO for B2B SaaS. To understand how Google SEO compliance works alongside AI search optimization, read the AEO Optimization service page — the two are complementary, and pages that pass both checklists perform across both traditional and AI search.
If you want help running this audit across your site, get in touch. We start with the checks most B2B SaaS companies fail first: doorway page risk on vertical pages and structured data mismatches on blog posts.
Free resource from xeo.works — Cross-Engine Optimization for B2B SaaS. Download at xeo.works/resources/google-seo-compliance-checklist