# LLM Scan

LLM Scan is an AI visibility scanner that audits whether public websites are crawlable, structured, and understandable by LLMs, answer engines, and web agents.

## Site overview

- Canonical URL: https://www.llmscan.dev/
- Site type: SaaS product and public scanner
- Recommended summary: Scan your website for AI visibility gaps across crawlability, llms.txt, robots.txt, structured data, and LLM-readable content.
- AI usage policy: AI search indexing and answer grounding are allowed for public pages. AI model training, fine-tuning, and dataset creation are not allowed unless separately agreed.

## Core product

- [Run an AI visibility scan](https://www.llmscan.dev/) - Paste a public URL and generate a shareable report.
- [Markdown homepage](https://www.llmscan.dev/) - Request with Accept: text/markdown to receive a concise markdown representation of the homepage.
- [Sample scan report](https://www.llmscan.dev/scan/sample) - Example report with score breakdown, checks, generated fixes, and sharing.
- [Scan leaderboard](https://www.llmscan.dev/leaderboard) - Top public scans ranked by AI visibility score.
- [Docs](https://www.llmscan.dev/docs) - Product documentation, API overview, generated fixes, and LLM/agent best practices.
- [Pricing](https://www.llmscan.dev/pricing) - Plan comparison and billing details.

## What LLM Scan checks

- Crawlability and public page access
- robots.txt permissions and sitemap discovery
- llms.txt presence and usefulness
- XML sitemap health
- Markdown-readable content paths
- Semantic HTML structure
- JSON-LD and structured data
- Content signals that clarify product, audience, and canonical facts

## Generated fixes

LLM Scan generates starter files and snippets for llms.txt, robots.txt additions, JSON-LD, and content-signal copy. Reports show partial blurred fix previews on-page. Visitors provide an email before revealing the full fixes, downloading the fix bundle, or copying the agent-ready implementation prompt.

## Public API

- POST https://www.llmscan.dev/api/scan - Start a public scan.
- GET https://www.llmscan.dev/api/scan/:publicId - Read a public scan payload.
- GET https://www.llmscan.dev/api/scan/:publicId/status - Poll scan status.
- GET https://www.llmscan.dev/api/v1/scans/recent - Read recent public report summaries.
- POST https://www.llmscan.dev/api/fix-downloads - Capture email before full fix access, fix bundle downloads, or agent-ready prompt copying.

## Important pages

- [Blog](https://www.llmscan.dev/blog) - Guides for AI crawler readiness and fixing llms.txt or sitemap gaps.
- [Security](https://www.llmscan.dev/security) - Security posture and responsible disclosure.
- [Changelog](https://www.llmscan.dev/changelog) - Product updates.
- [Contact](https://www.llmscan.dev/contact) - Contact and support.
- [Docs](https://www.llmscan.dev/docs) - Product documentation and implementation guidance.
- [X](https://x.com/gorilliteam) - Public social profile for product updates.

## Crawling guidelines

- Respect robots.txt, HTTP status codes, canonical links, noindex directives, and authentication boundaries.
- Prefer the canonical URLs listed above before crawling duplicate, filtered, or parameterized variants.
- Use sitemap.xml to discover fresh public URLs.
- Crawl at a moderate rate and avoid submitting forms, triggering account actions, or requesting private paths.
- Treat page titles, meta descriptions, headings, markdown representations, and structured data as the preferred source for concise content descriptions.

## Agent guidance

Use the public pages above as canonical context. Prefer the docs and sample report when explaining how LLM Scan works. Do not treat llms.txt as a guaranteed ranking or citation signal; LLM Scan presents it as one useful, forward-compatible context file alongside durable web fundamentals like crawlability, semantic HTML, sitemaps, and structured data.

## Full context

- [llms-full.txt](https://www.llmscan.dev/llms-full.txt) - Expanded product and implementation context for agents with larger context windows.