www.apollo.ioscanned May 26, 2026 · 13:520.81s
Public AI visibility report

www.apollo.ioAI visibilityNeeds Work

This site has a useful foundation, but important gaps still limit AI readability.

Key strengths include AI guidance file and sitemap, while plain-text page access and content signals need attention.

Recommended next step

add content negotiation for Accept: text/markdown on the homepage and return a markdown representation with Content-Type: text/markdown.

Keep the HTML response for regular browser requests.

Why monitor after one scan?

AI visibility changes when teams ship new pages, edit pricing or docs, update sitemaps, or change crawler rules. Weekly monitoring catches those silent regressions before answer engines and agents start reading stale or broken signals.

Monitor weekly

Overall score

58/100
Needs Work
Go to fixesOverall position170 out of 491Leaderboard

// score breakdown

Points by check

8 checks

Crawlability10/20
Robots.txt7.5/15
llms.txt15/15
Sitemap10/10
Markdown support0/15
Semantic HTML5.7/10
Structured data10/10
Content signals0/5
3pass3warn2fail

Public link

llmscan.dev/scan/EGGhu7G9QD86nOIC38xlA

Signals checked

8 AI visibility signals

Fix bundle

4 copy-ready files

Share badge

Needs Work · 58/100

Add a polished proof badge

A compact badge for footer, press, or trust sections that links visitors to this public report.

Embed codellmscan.dev/scan/EGGhu7G9QD86nOIC38xlA
<a href="https://www.llmscan.dev/scan/EGGhu7G9QD86nOIC38xlA"
  target="_blank"
  rel="noopener"
>
  <img
    src="https://www.llmscan.dev/scan/EGGhu7G9QD86nOIC38xlA/badge.png"
    alt="LLM Scan AI visibility score badge"
    width="460"
    height="120"
    style="width: 260px; max-width: 100%; height: auto;"
  />
</a>
Open badge
L
LLM Scan
Needs Work
Score
58/100

Share your score

Post the public report with: “We scored 58/100 for AI-readability.”

Download fixes

Grab generated files and implementation notes for the highest-impact gaps.

Rescan weekly

Save this domain to catch regressions after content, sitemap, or robots changes.

Monitor weekly

// signal breakdown

8 signals AI systems depend on

The homepage is reachable, but the requested URL redirects before reaching the homepage and robots.txt contains AI crawler restrictions for Claude-Web.

Signal weight

10/20
Warn

Evidence

url
https://apollo.io/
finalUrl
https://www.apollo.io/
status
200

Recommendation

Next step: Review AI crawler Disallow rules and keep only the paths that should be excluded from AI crawler access; serve a non-empty HTML homepage with a canonical link tag.

robots.txt contains AI-crawler path blocks for Claude-Web.

Signal weight

8/15
Warn

Evidence

robotsTxtUrl
https://www.apollo.io/robots.txt
exists
true
rawRobotsTxt
User-agent: * Disallow: /*utm_ Disallow: /*source= Disallow: /*ref= Disallow: /*fbclid= Disallow: /*page= Disallow: /*sort= Disallow: /*__hstc= Disallow: /*__hssc= Disallow: /*__hsfp= Disallow: /directory/* Disallow: *netlify.apollo.io/* Allow: / User-agent: anthropic-ai Allow: / User-agent: ChatGPT-User Allow: / User-agent: ClaudeBot Allow: / User-agent: cohere-ai Allow: / User-agent: GPTBot Allow: / User-agent: OAI-SearchBot Allow: / User-agent: PerplexityBot Allow: / User-agent: CCbot Allow: / User-agent: Googlebot Allow: / User-agent: Googlebot-Image Allow: / User-agent: Googlebot-Video Allow: / User-agent: Googlebot-News Allow: / User-agent: AdsBot-Google Allow: / User-agent: AdsBot-Google-Mobile Allow: / User-agent: Google-Extended Allow: / User-agent: Bingbot Allow: / User-agent: DuckDuckGo Allow: / User-agent: Applebot Allow: / User-agent: facebot Allow: / User-agent: Yandex Allow: / User-agent: Baiduspider Allow: / User-agent: Sogou Allow: / User-agent: Naverbot Allow: / User-agent: MojeekBot Allow: / User-agent: PetalBot Allow: / User-agent: AhrefsBot Allow: / User-agent: SemrushBot Allow: / User-agent: MJ12bot Allow: / Sitemap: https://www.apollo.io/sitemap.xml Sitemap: https://www.apollo.io/academy/sitemap.xml Sitemap: https://www.apollo.io/magazine/sitemap.xml Sitemap: https://www.apollo.io/cms-landing-pages/sitemap.xml Sitemap: https://www.apollo.io/leads/sitemap.xml Sitemap: https://www.apollo.io/roles/sitemap.xml Sitemap: https://www.apollo.io/what-is/sitemap.xml Sitemap: https://www.apollo.io/insights/sitemap.xml Sitemap: https://www.apollo.io/sitemap-410.xml

Recommendation

Next step: Review AI crawler Disallow rules and keep only the paths that should be excluded from AI crawler access.

The llms.txt file was found and includes the expected text, length, heading, and URL signals.

Signal weight

15/15
Pass

Evidence

llmsTxtUrl
https://www.apollo.io/llms.txt
present
true
accessible
true

The sitemap.xml file is valid and contains URL entries.

Signal weight

10/10
Pass

Evidence

sitemapUrl
https://www.apollo.io/sitemap.xml
sitemapUrls
[https://www.apollo.io/sitemap.xml]
robotsSitemapUrls
[]

The homepage returned HTML when requested with Accept: text/markdown, so the server appears to ignore markdown content negotiation.

Signal weight

0/15
Fail

Evidence

url
https://www.apollo.io/
acceptHeader
text/markdown
status
200

Recommendation

Next step: Add content negotiation for Accept: text/markdown on the homepage and return a markdown representation with Content-Type: text/markdown. Keep the HTML response for regular browser requests.

The homepage has some semantic HTML signals, but one or more title, metadata, heading, landmark, content, or link text checks need improvement.

Signal weight

6/10
Warn

Evidence

url
https://www.apollo.io/
quality
partial
score
57

Recommendation

Next step: Use exactly one h1 element and move secondary section titles to h2-h6. Avoid skipped heading levels so sections progress from h1 to h2 to h3 without gaps. Add missing semantic elements: article.

Valid JSON-LD structured data was found with core Organization or WebSite schema.org types.

Signal weight

10/10
Pass

Evidence

url
https://www.apollo.io/
quality
good
hasStructuredData
true

Content-Signal directive not detected in headers, HTML metadata, or robots.txt.

Signal weight

0/5
Fail

Evidence

url
https://www.apollo.io/
hasContentSignals
false
hasContentSignalHeader
false

Recommendation

Next step: Add the standard directive 'Content-Signal: ai-train=no, search=yes, ai-input=yes' to robots.txt, HTML metadata, or HTTP headers so AI systems can discover content usage preferences.

// generated fixes

Downloadable fix files

Preview the generated files below. Enter your email to reveal the full fixes, download the bundle, or copy the agent-ready implementation prompt.

Done-for-you

Agency package

Not sure how to ship the technical fixes? Book a call and we can help turn this report into implemented updates.

Fix planning from your scan

Implementation guidance

AI visibility monitoring

llms.txtMarkdown
# AI Sales Platform | Apollo.io - Outbound, Inbound & Automation > Accelerate B2B sales with Apollo.io—an AI sales platform for prospecting, lead gen, and deal automation. Close more deals, faster, with smart data. This llms.txt file summarizes the public, canonical resources that AI assistants and crawlers should use to understand this site. ## Site Overview - Canonical URL: https://www.apollo.io/- Site type: organization
robots.txtTXT
# robots.txt additions# Copy these blocks into the existing robots.txt file. Keep current rules unless a note calls out a conflicting Disallow. # AI crawler access# Add explicit Allow rules for blocked AI crawlers; remove or narrow conflicting Disallow rules if your crawler target requires precedence.
schema.jsonJSON
{  "@context": "https://schema.org",  "@graph": [    {      "@type": "Organization",      "@id": "https://www.apollo.io/#organization",      "name": "AI Sales Platform | Apollo.io - Outbound, Inbound & Automation",      "description": "Accelerate B2B sales with Apollo.io—an AI sales platform for prospecting, lead gen, and deal automation. Close more deals, faster, with smart data.",      "url": "https://www.apollo.io/",      "logo": "https://www.apollo.io/_next/static/media/google-logo.cee709b6.svg",
head metaHTML
# Content-Signal recommendations Use these directives to make AI-use preferences explicit for compliant crawlers and AI systems. They are advisory signals, so keep them aligned with robots.txt, terms, and access controls. ## Recommended values - ai-train=no: AI model training, fine-tuning, and dataset creation.- search=yes: AI search indexing, snippets, and discovery.- ai-input=yes: AI answer grounding, retrieval, and generated-response context.