Robots.txt Tester

Inspect how weaviate.io controls crawler access, blocked paths, sitemap references and AI crawler rules.

Preview

Score: 92

  • Important content paths appear blocked in robots.txt.

Get your full report + exact fixes

See what’s hurting your SEO and how to fix it step by step.

  • Full breakdown
  • Actionable fixes
  • Prioritized next steps

No spam. One email with your report and next steps.

Robots.txt Status

Robots.txt Status Present Score 92 /100 · Strong
Domain weaviate.io
Last analyzed June 18, 2026

View Full Robots.txt

Robots.txt Content Preview

Sitemap: https://weaviate.io/sitemap-index.xml
LLMS: https://weaviate.io/llms.txt

User-agent: *
Allow: /
Allow: /llms.txt
Disallow: /*?*
Disallow: /expert-sessions
Disallow: /blog/rss.xml
Disallow: /blog/atom.xml
Disallow: /feed
Disallow: /feed.xml
Disallow: /rss
Disallow: /rss.xml
Disallow: /atom
Disallow: /atom.xml

# AI Search Engine Bots
User-agent: GPTBot
Allow: /


User-agent: ChatGPT-User
Allow: /


User-agent: PerplexityBot
Allow: /


User-agent: ClaudeBot
Allow: /


User-agent: anthropic-ai
Allow: /


User-agent: Applebot-Extended
Allow: /

User-agent Rules

User-agent(s) Allowed paths Disallowed paths
*
  • /
  • /llms.txt
  • /*?*
  • /expert-sessions
  • /blog/rss.xml
  • /blog/atom.xml
  • /feed
  • /feed.xml
  • /rss
  • /rss.xml
  • /atom
  • /atom.xml
gptbot
  • /
No explicit Disallow rules.
chatgpt-user
  • /
No explicit Disallow rules.
perplexitybot
  • /
No explicit Disallow rules.
claudebot
  • /
No explicit Disallow rules.
anthropic-ai
  • /
No explicit Disallow rules.
applebot-extended
  • /
No explicit Disallow rules.

Blocked and Allowed Paths

Blocked paths
  • /*?*
  • /expert-sessions
  • /blog/rss.xml
  • /blog/atom.xml
  • /feed
  • /feed.xml
  • /rss
  • /rss.xml
  • /atom
  • /atom.xml
Allowed paths
  • /
  • /llms.txt
Crawl-delay No Crawl-delay directive detected.

Sitemaps Detected

AI Crawler Policy

No explicit blocks were detected for common AI crawlers (GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot, Google-Extended).

Issues Found

  • Important content paths appear blocked in robots.txt.

Recommendations

  • Document your AI crawler policy explicitly in robots.txt so future bots know how to treat your content.
  • Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.

Analyze this site with other tools

Want a website that actually generates leads?

Start a conversion-focused website project with a team that builds fast, SEO-optimized sites for real businesses.