Domain 1: Technical Accessibility
The most sophisticated content strategy is worthless if AI crawlers can't access, render, and parse your pages. This domain checks every technical layer between your content and an AI crawler's index entry.
Verify GPTBot, PerplexityBot, Google-Extended, and anthropic-ai are not blocked. Check for wildcard rules that might catch them.
AI crawlers operate on strict timeout budgets. Use GTmetrix or WebPageTest to measure time-to-first-byte (TTFB) from multiple locations.
Use Google's URL Inspection tool "Inspect URL" → "View Tested Page" → "HTML" to verify your key content is in the raw HTML, not dependent on JavaScript execution.
LCP under 2.5s, CLS under 0.1, FID/INP under 200ms. Check Google Search Console's Core Web Vitals report for field data.
AI crawlers refuse HTTP connections. Verify HTTPS is enforced with a 301 redirect from HTTP.
ChatGPT uses Bing's index. Bing Webmaster Tools is as important as Google Search Console for AI search visibility.
Domain 2: Schema Implementation Audit
Validate schema on your top 10 pages by traffic, your top 5 pillar articles, and at least one example of each content type you publish.
Must include: headline (matching H1), author (linked Person entity), publisher (linked Organization entity), datePublished, dateModified, description.
Every page with question-and-answer content should have FAQPage schema. Answers must be self-contained (no references to "as mentioned above").
Include: name, jobTitle, sameAs (LinkedIn, professional profiles), knowsAbout, affiliation (linked Organization entity).
Include: name, url, logo, foundingDate, sameAs (social profiles, Crunchbase, LinkedIn Company page).
Zero errors required. Warnings are acceptable but should be resolved. Run every schema block through search.google.com/test/rich-results.
Mismatches trigger trust flags in AI parsing systems. Copy-paste from the visible H1 — don't rewrite.
Domain 3: Content Quality Assessment
Evaluate your top 20 pages by organic traffic against each of these criteria. Pages failing more than 3 criteria are high-priority for revision.
Confirmed comprehensive topic coverage, not padding. Every section should add genuine value.
Read each section opening. If the actual answer appears after more than 80 words, it needs restructuring.
Replace "many websites" with "67% of websites in our study." Replace "improves performance" with "reduces LCP by an average of 0.8 seconds."
A 2–4 sentence summary before the introduction that answers the core question the article addresses.
Content that cites credible sources is treated as more authoritative. Minimum 3 external citations per pillar article.
Distinct from FAQPage schema — this is the visible content. Both are required: the visible FAQ section and the schema markup.
Domain 5: Indexation Audit
Use Bing's URL Inspection tool to verify specific pages are indexed. Critical for ChatGPT visibility.
Real-time ping to Bing (and supporting engines) when new content is published. Dramatically reduces indexation lag.
Confirm GPTBot, PerplexityBot, and Google-Extended are crawling your site. Absent bot activity means absent citations.
AI crawlers use RSS feeds for rapid content discovery. Truncated feeds force a full crawl cycle, slowing indexation.
Priority Scoring: What to Fix First
After running this audit, prioritize fixes in this order:
- Priority 1 (fix immediately): Any robots.txt rule blocking AI citation crawlers. Any JavaScript-dependent content on key pages. Bing Webmaster Tools not set up.
- Priority 2 (fix within 2 weeks): Missing FAQPage schema on Q&A content. Article schema missing dateModified or author entity links. No named authors on pillar content.
- Priority 3 (fix within 30 days): Pillar content under 2,000 words. No TL;DR blocks. Answers buried after 80+ words in section openings.
- Priority 4 (ongoing improvement): Increasing data density. Adding external citations. Building additional cluster content to deepen topical authority.
