Robots.txt Tester
Inspect how weaviate.io controls crawler access, blocked paths, sitemap references and AI crawler rules.
Preview
Score: 92
- Important content paths appear blocked in robots.txt.
Get your full report + exact fixes
See what’s hurting your SEO and how to fix it step by step.
- Full breakdown
- Actionable fixes
- Prioritized next steps
Robots.txt Status
Robots.txt Status
Present
Score
92
/100
· Strong
View Full Robots.txt
Robots.txt Content Preview
Sitemap: https://weaviate.io/sitemap-index.xml LLMS: https://weaviate.io/llms.txt User-agent: * Allow: / Allow: /llms.txt Disallow: /*?* Disallow: /expert-sessions Disallow: /blog/rss.xml Disallow: /blog/atom.xml Disallow: /feed Disallow: /feed.xml Disallow: /rss Disallow: /rss.xml Disallow: /atom Disallow: /atom.xml # AI Search Engine Bots User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: PerplexityBot Allow: / User-agent: ClaudeBot Allow: / User-agent: anthropic-ai Allow: / User-agent: Applebot-Extended Allow: /
User-agent Rules
| User-agent(s) | Allowed paths | Disallowed paths |
|---|---|---|
| * |
|
|
| gptbot |
|
No explicit Disallow rules. |
| chatgpt-user |
|
No explicit Disallow rules. |
| perplexitybot |
|
No explicit Disallow rules. |
| claudebot |
|
No explicit Disallow rules. |
| anthropic-ai |
|
No explicit Disallow rules. |
| applebot-extended |
|
No explicit Disallow rules. |
Blocked and Allowed Paths
| Blocked paths |
|
|---|---|
| Allowed paths |
|
| Crawl-delay | No Crawl-delay directive detected. |
Sitemaps Detected
AI Crawler Policy
No explicit blocks were detected for common AI crawlers (GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot, Google-Extended).
Issues Found
- Important content paths appear blocked in robots.txt.
Recommendations
- Document your AI crawler policy explicitly in robots.txt so future bots know how to treat your content.
- Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.