openai.comscanned May 28, 2026 · 12:101.35s

Public AI visibility report

openai.comAI visibilityPoor

This site is difficult for AI tools to read right now.

Key strengths include sitemap, while AI guidance file and plain-text page access need attention.

Recommended next step

publish an AI guidance file as text or markdown with more than 200 characters, markdown headings, and at least one absolute URL.

Turn this scan into weekly monitoring.

Create a free workspace first, then unlock weekly monitoring for AI visibility changes after site, pricing, docs, sitemap, or crawler-rule updates.

Unlock monitoring View AI Optimization docs

Overall score

35/100

Poor

Download PDF

Go to fixes Overall position911 out of 1,131Leaderboard

// score breakdown

Points by check

8 checks

Crawlability10/20WARN

Robots.txt7.5/15WARN

llms.txt0/15FAIL

Sitemap10/10PASS

Markdown support0/15FAIL

Semantic HTML7.1/10WARN

Structured data0/10FAIL

Content signals0/5FAIL

1pass3warn4fail

Public link

llmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t

Signals checked

8 AI visibility signals

Fix bundle

4 copy-ready files

Share badge

Poor · 35/100

Add a polished proof badge

A compact badge for footer, press, or trust sections that links visitors to this public report.

Embed codellmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t

<a href="https://www.llmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t"
  target="_blank"
  rel="noopener"
>
  <img
    src="https://www.llmscan.dev/scan/w0Dg9cnVhKx89sf2yhc7t/badge.png"
    alt="LLM Scan AI visibility score badge"
    width="460"
    height="120"
    style="width: 260px; max-width: 100%; height: auto;"
  />
</a>

Open badge

LLM Scan

Poor

Score

35/100

Share your score

Post the public report with: “We scored 35/100 for AI-readability.”

Download fixes

Grab generated files and implementation notes for the highest-impact gaps.

Rescan weekly

Save this domain to catch regressions after content, sitemap, or robots changes.

Monitor weekly

// signal breakdown

8 signals AI systems depend on

The homepage is reachable, but robots.txt contains AI crawler restrictions for GPTBot, ChatGPT-User, ClaudeBot, Claude-Web, PerplexityBot, and Google-Extended.

Signal weight

10/20

Warn

Evidence

url: https://openai.com/
finalUrl: https://openai.com/
status: 200

Recommendation

Next step: Review AI crawler Disallow rules and keep only the paths that should be excluded from AI crawler access; serve a non-empty HTML homepage with a canonical link tag.

robots.txt contains AI-crawler path blocks for GPTBot, ChatGPT-User, ClaudeBot, Claude-Web, PerplexityBot, and Google-Extended.

Signal weight

8/15

Warn

Evidence

robotsTxtUrl: https://openai.com/robots.txt
exists: true
rawRobotsTxt: User-agent: * Allow: / Disallow: /microsoft-for-startups/ Sitemap: https://openai.com/sitemap.xml

Recommendation

Next step: Review AI crawler Disallow rules and keep only the paths that should be excluded from AI crawler access.

No llms.txt file was found for this site.

Signal weight

0/15

Fail

Evidence

llmsTxtUrl: https://openai.com/llms.txt
present: false
accessible: false

Recommendation

Next step: Publish /llms.txt as text or markdown with more than 200 characters, markdown headings, and at least one absolute URL.

The sitemap index is valid and the checked child sitemap contains URL entries.

Signal weight

10/10

Pass

Evidence

sitemapUrl: https://openai.com/sitemap.xml
sitemapUrls: [https://openai.com/sitemap.xml]
robotsSitemapUrls: []

The homepage could not be fetched with a markdown Accept header.

Signal weight

0/15

Fail

Evidence

url: https://openai.com/
acceptHeader: text/markdown
status: 403

Recommendation

Next step: Make the homepage respond successfully to requests with Accept: text/markdown, either by serving markdown directly or by adding a markdown variant URL with correct content negotiation.

The homepage has some semantic HTML signals, but one or more title, metadata, heading, landmark, content, or link text checks need improvement.

Signal weight

7/10

Warn

Evidence

url: https://openai.com/
quality: partial
score: 71

Recommendation

Next step: Shorten the meta description to 160 characters or fewer. Add exactly one h1 element that describes the page topic.

No JSON-LD, Microdata, or RDFa structured data was found on the page.

Signal weight

0/10

Fail

Evidence

url: https://openai.com/
quality: none
hasStructuredData: false

Recommendation

Next step: Add JSON-LD structured data with Organization or WebSite schema so AI systems can identify the site owner or website entity.

Content-Signal directive not detected in headers, HTML metadata, or robots.txt.

Signal weight

0/5

Fail

Evidence

url: https://openai.com/
hasContentSignals: false
hasContentSignalHeader: false

Recommendation

Next step: Add the standard directive 'Content-Signal: ai-train=no, search=yes, ai-input=yes' to robots.txt, HTML metadata, or HTTP headers so AI systems can discover content usage preferences.

// generated fixes

Downloadable fix files

Preview the generated files below. Enter your email to reveal the full fixes, download the bundle, or copy the agent-ready implementation prompt.

Done-for-you

Agency package

Not sure how to ship the technical fixes? Book a call and we can help turn this report into implemented updates.

Fix planning from your scan

Implementation guidance

AI visibility monitoring

llms.txtMarkdown

# OpenAI | Research & Deployment > We believe our research will eventually lead to artificial general intelligence, a system that can solve human-level problems. Building safe and beneficial AGI is our mission. This llms.txt file summarizes the public, canonical resources that AI assistants and crawlers should use to understand this site. ## Site Overview - Canonical URL: https://openai.com/- Site type: website

robots.txtTXT

# robots.txt additions# Copy these blocks into the existing robots.txt file. Keep current rules unless a note calls out a conflicting Disallow. # AI crawler access# Add explicit Allow rules for blocked AI crawlers; remove or narrow conflicting Disallow rules if your crawler target requires precedence.User-agent: GPTBotAllow: / User-agent: ChatGPT-UserAllow: /

schema.jsonJSON

{  "@context": "https://schema.org",  "@graph": [    {      "@type": "Organization",      "@id": "https://openai.com/#organization",      "name": "OpenAI | Research & Deployment",      "description": "We believe our research will eventually lead to artificial general intelligence, a system that can solve human-level problems. Building safe and beneficial AGI is our mission.",      "url": "https://openai.com/",      "logo": "https://openai.com/apple-icon.png?apple-icon.02be~wu.bus9e.png",

head metaHTML

# Content-Signal recommendations Use these directives to make AI-use preferences explicit for compliant crawlers and AI systems. They are advisory signals, so keep them aligned with robots.txt, terms, and access controls. ## Recommended values - ai-train=no: AI model training, fine-tuning, and dataset creation.- search=yes: AI search indexing, snippets, and discovery.- ai-input=yes: AI answer grounding, retrieval, and generated-response context.

openai.comAI visibilityPoor

Points by check

Add a polished proof badge

8 signals AI systems depend on

Downloadable fix files

Similar AI visibility reports