openai.comscanned May 28, 2026 · 12:101.35s
Public AI visibility report

openai.comAI visibilityPoor

This site is difficult for AI tools to read right now.

Key strengths include sitemap, while AI guidance file and plain-text page access need attention.

Recommended next step

publish an AI guidance file as text or markdown with more than 200 characters, markdown headings, and at least one absolute URL.

Why monitor after one scan?

AI visibility changes when teams ship new pages, edit pricing or docs, update sitemaps, or change crawler rules. Weekly monitoring catches those silent regressions before answer engines and agents start reading stale or broken signals.

Monitor weekly

Overall score

35/100
Poor
Go to fixesOverall position389 out of 492Leaderboard

// score breakdown

Points by check

8 checks

Crawlability10/20
Robots.txt7.5/15
llms.txt0/15
Sitemap10/10
Markdown support0/15
Semantic HTML7.1/10
Structured data0/10
Content signals0/5
1pass3warn4fail

Public link

llmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t

Signals checked

8 AI visibility signals

Fix bundle

4 copy-ready files

Share badge

Poor · 35/100

Add a polished proof badge

A compact badge for footer, press, or trust sections that links visitors to this public report.

Embed codellmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t
<a href="https://www.llmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t"
  target="_blank"
  rel="noopener"
>
  <img
    src="https://www.llmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t/badge.png"
    alt="LLM Scan AI visibility score badge"
    width="460"
    height="120"
    style="width: 260px; max-width: 100%; height: auto;"
  />
</a>
Open badge
L
LLM Scan
Poor
Score
35/100

Share your score

Post the public report with: “We scored 35/100 for AI-readability.”

Download fixes

Grab generated files and implementation notes for the highest-impact gaps.

Rescan weekly

Save this domain to catch regressions after content, sitemap, or robots changes.

Monitor weekly

// signal breakdown

8 signals AI systems depend on

The homepage is reachable, but robots.txt contains AI crawler restrictions for GPTBot, ChatGPT-User, ClaudeBot, Claude-Web, PerplexityBot, and Google-Extended.

Signal weight

10/20
Warn

Evidence

url
https://openai.com/
finalUrl
https://openai.com/
status
200

Recommendation

Next step: Review AI crawler Disallow rules and keep only the paths that should be excluded from AI crawler access; serve a non-empty HTML homepage with a canonical link tag.

robots.txt contains AI-crawler path blocks for GPTBot, ChatGPT-User, ClaudeBot, Claude-Web, PerplexityBot, and Google-Extended.

Signal weight

8/15
Warn

Evidence

robotsTxtUrl
https://openai.com/robots.txt
exists
true
rawRobotsTxt
User-agent: * Allow: / Disallow: /microsoft-for-startups/ Sitemap: https://openai.com/sitemap.xml

Recommendation

Next step: Review AI crawler Disallow rules and keep only the paths that should be excluded from AI crawler access.

No llms.txt file was found for this site.

Signal weight

0/15
Fail

Evidence

llmsTxtUrl
https://openai.com/llms.txt
present
false
accessible
false

Recommendation

Next step: Publish /llms.txt as text or markdown with more than 200 characters, markdown headings, and at least one absolute URL.

The sitemap index is valid and the checked child sitemap contains URL entries.

Signal weight

10/10
Pass

Evidence

sitemapUrl
https://openai.com/sitemap.xml
sitemapUrls
[https://openai.com/sitemap.xml]
robotsSitemapUrls
[]

The homepage could not be fetched with a markdown Accept header.

Signal weight

0/15
Fail

Evidence

url
https://openai.com/
acceptHeader
text/markdown
status
403

Recommendation

Next step: Make the homepage respond successfully to requests with Accept: text/markdown, either by serving markdown directly or by adding a markdown variant URL with correct content negotiation.

The homepage has some semantic HTML signals, but one or more title, metadata, heading, landmark, content, or link text checks need improvement.

Signal weight

7/10
Warn

Evidence

url
https://openai.com/
quality
partial
score
71

Recommendation

Next step: Shorten the meta description to 160 characters or fewer. Add exactly one h1 element that describes the page topic.

No JSON-LD, Microdata, or RDFa structured data was found on the page.

Signal weight

0/10
Fail

Evidence

url
https://openai.com/
quality
none
hasStructuredData
false

Recommendation

Next step: Add JSON-LD structured data with Organization or WebSite schema so AI systems can identify the site owner or website entity.

Content-Signal directive not detected in headers, HTML metadata, or robots.txt.

Signal weight

0/5
Fail

Evidence

url
https://openai.com/
hasContentSignals
false
hasContentSignalHeader
false

Recommendation

Next step: Add the standard directive 'Content-Signal: ai-train=no, search=yes, ai-input=yes' to robots.txt, HTML metadata, or HTTP headers so AI systems can discover content usage preferences.

// generated fixes

Downloadable fix files

Preview the generated files below. Enter your email to reveal the full fixes, download the bundle, or copy the agent-ready implementation prompt.

Done-for-you

Agency package

Not sure how to ship the technical fixes? Book a call and we can help turn this report into implemented updates.

Fix planning from your scan

Implementation guidance

AI visibility monitoring

llms.txtMarkdown
# OpenAI | Research & Deployment > We believe our research will eventually lead to artificial general intelligence, a system that can solve human-level problems. Building safe and beneficial AGI is our mission. This llms.txt file summarizes the public, canonical resources that AI assistants and crawlers should use to understand this site. ## Site Overview - Canonical URL: https://openai.com/- Site type: website
robots.txtTXT
# robots.txt additions# Copy these blocks into the existing robots.txt file. Keep current rules unless a note calls out a conflicting Disallow. # AI crawler access# Add explicit Allow rules for blocked AI crawlers; remove or narrow conflicting Disallow rules if your crawler target requires precedence.User-agent: GPTBotAllow: / User-agent: ChatGPT-UserAllow: /
schema.jsonJSON
{  "@context": "https://schema.org",  "@graph": [    {      "@type": "Organization",      "@id": "https://openai.com/#organization",      "name": "OpenAI | Research & Deployment",      "description": "We believe our research will eventually lead to artificial general intelligence, a system that can solve human-level problems. Building safe and beneficial AGI is our mission.",      "url": "https://openai.com/",      "logo": "https://openai.com/apple-icon.png?apple-icon.02be~wu.bus9e.png",
head metaHTML
# Content-Signal recommendations Use these directives to make AI-use preferences explicit for compliant crawlers and AI systems. They are advisory signals, so keep them aligned with robots.txt, terms, and access controls. ## Recommended values - ai-train=no: AI model training, fine-tuning, and dataset creation.- search=yes: AI search indexing, snippets, and discovery.- ai-input=yes: AI answer grounding, retrieval, and generated-response context.