# LLM Scan LLM Scan is an AI visibility scanner that audits whether public websites are crawlable, structured, and understandable by LLMs, answer engines, and web agents. ## Site overview - Canonical URL: https://www.llmscan.dev/ - Site type: SaaS product and public scanner - Recommended summary: Scan your website for AI visibility gaps across crawlability, llms.txt, robots.txt, structured data, and LLM-readable content. - AI usage policy: AI search indexing and answer grounding are allowed for public pages. AI model training, fine-tuning, and dataset creation are not allowed unless separately agreed. ## Core product - [Run an AI visibility scan](https://www.llmscan.dev/) - Paste a public URL and generate a shareable report. - [Markdown homepage](https://www.llmscan.dev/) - Request with Accept: text/markdown to receive a concise markdown representation of the homepage. - [Sample scan report](https://www.llmscan.dev/scan/sample) - Example report with score breakdown, checks, generated fixes, and sharing. - [Scan leaderboard](https://www.llmscan.dev/leaderboard) - Top public scans ranked by AI visibility score. - [Docs](https://www.llmscan.dev/docs) - Product documentation, API overview, generated fixes, and LLM/agent best practices. - [Pricing](https://www.llmscan.dev/pricing) - Plan comparison and billing details. ## What LLM Scan checks - Crawlability and public page access - robots.txt permissions and sitemap discovery - llms.txt presence and usefulness - XML sitemap health - Markdown-readable content paths - Semantic HTML structure - JSON-LD and structured data - Content signals that clarify product, audience, and canonical facts ## Generated fixes LLM Scan generates starter files and snippets for llms.txt, robots.txt additions, JSON-LD, and content-signal copy. Reports show partial blurred fix previews on-page. Visitors provide an email before revealing the full fixes, downloading the fix bundle, or copying the agent-ready implementation prompt. ## Public API - POST https://www.llmscan.dev/api/scan - Start a public scan. - GET https://www.llmscan.dev/api/scan/:publicId - Read a public scan payload. - GET https://www.llmscan.dev/api/scan/:publicId/status - Poll scan status. - GET https://www.llmscan.dev/api/v1/scans/recent - Read recent public report summaries. - POST https://www.llmscan.dev/api/fix-downloads - Capture email before full fix access, fix bundle downloads, or agent-ready prompt copying. ## Important pages - [Blog](https://www.llmscan.dev/blog) - Guides for AI crawler readiness and fixing llms.txt or sitemap gaps. - [Security](https://www.llmscan.dev/security) - Security posture and responsible disclosure. - [Changelog](https://www.llmscan.dev/changelog) - Product updates. - [Contact](https://www.llmscan.dev/contact) - Contact and support. - [Docs](https://www.llmscan.dev/docs) - Product documentation and implementation guidance. - [X](https://x.com/gorilliteam) - Public social profile for product updates. ## Crawling guidelines - Respect robots.txt, HTTP status codes, canonical links, noindex directives, and authentication boundaries. - Prefer the canonical URLs listed above before crawling duplicate, filtered, or parameterized variants. - Use sitemap.xml to discover fresh public URLs. - Crawl at a moderate rate and avoid submitting forms, triggering account actions, or requesting private paths. - Treat page titles, meta descriptions, headings, markdown representations, and structured data as the preferred source for concise content descriptions. ## Agent guidance Use the public pages above as canonical context. Prefer the docs and sample report when explaining how LLM Scan works. Do not treat llms.txt as a guaranteed ranking or citation signal; LLM Scan presents it as one useful, forward-compatible context file alongside durable web fundamentals like crawlability, semantic HTML, sitemaps, and structured data. ## Full context - [llms-full.txt](https://www.llmscan.dev/llms-full.txt) - Expanded product and implementation context for agents with larger context windows.