heijka.nlscanned May 30, 2026 · 07:002.33s
Public AI visibility report

heijka.nlAI visibilityPoor

This site is difficult for AI tools to read right now.

Key strengths include sitemap and structured data, while homepage access and crawler policy need attention.

Recommended next step

remove AI crawler Disallow: / rules or replace them with narrower path-level restrictions for private content only.

Why monitor after one scan?

AI visibility changes when teams ship new pages, edit pricing or docs, update sitemaps, or change crawler rules. Weekly monitoring catches those silent regressions before answer engines and agents start reading stale or broken signals.

Monitor weekly

Overall score

29/100
Poor
Go to fixesOverall position454 out of 492Leaderboard

// score breakdown

Points by check

8 checks

Crawlability0/20
Robots.txt0/15
llms.txt0/15
Sitemap10/10
Markdown support0/15
Semantic HTML8.6/10
Structured data10/10
Content signals0/5
2pass1warn5fail

Public link

llmscan.dev/scan/n4YrJO6S73uItEWqa7RPO

Signals checked

8 AI visibility signals

Fix bundle

4 copy-ready files

Share badge

Poor · 29/100

Add a polished proof badge

A compact badge for footer, press, or trust sections that links visitors to this public report.

Embed codellmscan.dev/scan/n4YrJO6S73uItEWqa7RPO
<a href="https://www.llmscan.dev/scan/n4YrJO6S73uItEWqa7RPO"
  target="_blank"
  rel="noopener"
>
  <img
    src="https://www.llmscan.dev/scan/n4YrJO6S73uItEWqa7RPO/badge.png"
    alt="LLM Scan AI visibility score badge"
    width="460"
    height="120"
    style="width: 260px; max-width: 100%; height: auto;"
  />
</a>
Open badge
L
LLM Scan
Poor
Score
29/100

Share your score

Post the public report with: “We scored 29/100 for AI-readability.”

Download fixes

Grab generated files and implementation notes for the highest-impact gaps.

Rescan weekly

Save this domain to catch regressions after content, sitemap, or robots changes.

Monitor weekly

// signal breakdown

8 signals AI systems depend on

The homepage is reachable, but robots.txt blocks GPTBot from crawling the site.

Signal weight

0/20
Fail

Evidence

url
https://heijka.nl/
finalUrl
https://heijka.nl/
status
200

Recommendation

Next step: Remove AI crawler Disallow: / rules or replace them with narrower path-level restrictions for private content only.

robots.txt explicitly blocks GPTBot from the whole site.

Signal weight

0/15
Fail

Evidence

robotsTxtUrl
https://heijka.nl/robots.txt
exists
true
rawRobotsTxt
User-agent: * Disallow: /?s=* Disallow: /*?add-to-cart= Disallow: /*?add_to_wishlist= Disallow: /*?orderby= Disallow: /winkelwagen/ Disallow: /afrekenen/ Disallow: /mijn-account/ # Filter Everything PRO / filter URLs Disallow: /*-or-* Disallow: /wp-content/uploads/wc-logs/ Disallow: /wp-content/uploads/woocommerce_transient_files/ Disallow: /wp-content/uploads/woocommerce_uploads/ Disallow: /wp-admin/ Allow: /wp-admin/admin-ajax.php Sitemap: https://heijka.nl/sitemap_index.xml # Block SEO crawlers User-agent: DotBot Disallow: / User-agent: AhrefsBot Disallow: / User-agent: SemrushBot Disallow: / User-agent: SemrushBot-SA Disallow: / User-agent: MJ12bot Disallow: / User-agent: Rogerbot Disallow: / User-agent: BLEXBot Disallow: / User-agent: Barkrowler Disallow: / User-agent: Gigabot Disallow: / User-agent: Exabot Disallow: / User-agent: CCBot Disallow: / # Block AI crawlers User-agent: GPTBot Disallow: / User-agent: meta-externalagent Disallow: / User-agent: anthropic-ai Disallow: / User-agent: cohere-ai Disallow: / # Limit Applebot to main category pages only User-agent: Applebot Disallow: /product-categorie/*/*

Recommendation

Next step: Remove the GPTBot Disallow: / rule or add narrower Allow/Disallow rules if GPTBot should be able to discover public content.

No llms.txt file was found for this site.

Signal weight

0/15
Fail

Evidence

llmsTxtUrl
https://heijka.nl/llms.txt
present
false
accessible
false

Recommendation

Next step: Publish /llms.txt as text or markdown with more than 200 characters, markdown headings, and at least one absolute URL.

The sitemap index is valid and the checked child sitemap contains URL entries.

Signal weight

10/10
Pass

Evidence

sitemapUrl
https://heijka.nl/sitemap_index.xml
sitemapUrls
[https://heijka.nl/sitemap.xml]
robotsSitemapUrls
[]

The homepage returned HTML when requested with Accept: text/markdown, so the server appears to ignore markdown content negotiation.

Signal weight

0/15
Fail

Evidence

url
https://heijka.nl/
acceptHeader
text/markdown
status
200

Recommendation

Next step: Add content negotiation for Accept: text/markdown on the homepage and return a markdown representation with Content-Type: text/markdown. Keep the HTML response for regular browser requests.

The homepage has some semantic HTML signals, but one or more title, metadata, heading, landmark, content, or link text checks need improvement.

Signal weight

9/10
Warn

Evidence

url
https://heijka.nl/
quality
partial
score
86

Recommendation

Next step: Avoid skipped heading levels so sections progress from h1 to h2 to h3 without gaps.

Valid JSON-LD structured data was found with core Organization or WebSite schema.org types.

Signal weight

10/10
Pass

Evidence

url
https://heijka.nl/
quality
good
hasStructuredData
true

Content-Signal directive not detected in headers, HTML metadata, or robots.txt.

Signal weight

0/5
Fail

Evidence

url
https://heijka.nl/
hasContentSignals
false
hasContentSignalHeader
false

Recommendation

Next step: Add the standard directive 'Content-Signal: ai-train=no, search=yes, ai-input=yes' to robots.txt, HTML metadata, or HTTP headers so AI systems can discover content usage preferences.

// generated fixes

Downloadable fix files

Preview the generated files below. Enter your email to reveal the full fixes, download the bundle, or copy the agent-ready implementation prompt.

Done-for-you

Agency package

Not sure how to ship the technical fixes? Book a call and we can help turn this report into implemented updates.

Fix planning from your scan

Implementation guidance

AI visibility monitoring

llms.txtMarkdown
# Heijka Kantoormeubelen – Dat zit goed > Ergonomische kantoormeubelen voor comfort, gezondheid en productiviteit. Persoonlijk advies, snelle levering en gratis proefzitten sinds 2011. Dat zit goed. This llms.txt file summarizes the public, canonical resources that AI assistants and crawlers should use to understand this site. ## Site Overview - Canonical URL: https://heijka.nl/- Site type: place
robots.txtTXT
# robots.txt additions# Copy these blocks into the existing robots.txt file. Keep current rules unless a note calls out a conflicting Disallow. # AI crawler access# Add explicit Allow rules for blocked AI crawlers; remove or narrow conflicting Disallow rules if your crawler target requires precedence.User-agent: GPTBotAllow: / User-agent: ChatGPT-UserAllow: /
schema.jsonJSON
{  "@context": "https://schema.org",  "@graph": [    {      "@type": "Organization",      "@id": "https://heijka.nl/#organization",      "name": "Heijka Kantoormeubelen – Dat zit goed",      "description": "Ergonomische kantoormeubelen voor comfort, gezondheid en productiviteit. Persoonlijk advies, snelle levering en gratis proefzitten sinds 2011. Dat zit goed.",      "url": "https://heijka.nl/",      "logo": "https://heijka.nl/wp-content/uploads/heijka-logo-lichte-achtergrond.svg"
head metaHTML
# Content-Signal recommendations Use these directives to make AI-use preferences explicit for compliant crawlers and AI systems. They are advisory signals, so keep them aligned with robots.txt, terms, and access controls. ## Recommended values - ai-train=no: AI model training, fine-tuning, and dataset creation.- search=yes: AI search indexing, snippets, and discovery.- ai-input=yes: AI answer grounding, retrieval, and generated-response context.