Robots.txt Tester
Inspect how qck.sh controls crawler access, blocked paths, sitemap references and AI crawler rules.
Preview
Score: 100
- Document your AI crawler policy explicitly in robots.txt so future bots know how to treat your content.
- Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.
Get your full report + exact fixes
See what’s hurting your SEO and how to fix it step by step.
- Full breakdown
- Actionable fixes
- Prioritized next steps
Robots.txt Status
Robots.txt Status
Present
Score
100
/100
· Strong
View Full Robots.txt
Robots.txt Content Preview
# QCK — robots.txt # Canonical: https://qck.sh/robots.txt # Default crawlers User-agent: * Allow: / Disallow: /app/ Disallow: /admin/ Disallow: /api/ # Search engines — explicit allow User-agent: Googlebot Allow: / Disallow: /app/ Disallow: /admin/ User-agent: Googlebot-Image Allow: / User-agent: Bingbot Allow: / Disallow: /app/ Disallow: /admin/ User-agent: DuckDuckBot Allow: / User-agent: YandexBot Allow: / # AI assistants and LLM crawlers — explicit allow for visibility in AI answers User-agent: GPTBot Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: meta-externalagent Allow: / User-agent: Bytespider Allow: / User-agent: cohere-ai Allow: / User-agent: Cohere-Bot Allow: / User-agent: Diffbot Allow: / User-agent: DuckAssistBot Allow: / User-agent: MistralAI-User Allow: / User-agent: YouBot Allow: / User-agent: AI2Bot Allow: / User-agent: FriendlyCrawler Allow: / # Common Crawl — used as training data by many AI providers User-agent: CCBot Allow: / # Sitemap and LLM index Sitemap: https://qck.sh/sitemap-index.xml # LLM-friendly index (llmstxt.org spec) # https://qck.sh/llms.txt # https://qck.sh/llms-full.txt Host: qck.sh
User-agent Rules
| User-agent(s) | Allowed paths | Disallowed paths |
|---|---|---|
| * |
|
|
| googlebot |
|
|
| googlebot-image |
|
No explicit Disallow rules. |
| bingbot |
|
|
| duckduckbot |
|
No explicit Disallow rules. |
| yandexbot |
|
No explicit Disallow rules. |
| gptbot |
|
No explicit Disallow rules. |
| oai-searchbot |
|
No explicit Disallow rules. |
| chatgpt-user |
|
No explicit Disallow rules. |
| claudebot |
|
No explicit Disallow rules. |
| claude-web |
|
No explicit Disallow rules. |
| anthropic-ai |
|
No explicit Disallow rules. |
| perplexitybot |
|
No explicit Disallow rules. |
| perplexity-user |
|
No explicit Disallow rules. |
| google-extended |
|
No explicit Disallow rules. |
| googleother |
|
No explicit Disallow rules. |
| applebot |
|
No explicit Disallow rules. |
| applebot-extended |
|
No explicit Disallow rules. |
| meta-externalagent |
|
No explicit Disallow rules. |
| meta-externalagent |
|
No explicit Disallow rules. |
| bytespider |
|
No explicit Disallow rules. |
| cohere-ai |
|
No explicit Disallow rules. |
| cohere-bot |
|
No explicit Disallow rules. |
| diffbot |
|
No explicit Disallow rules. |
| duckassistbot |
|
No explicit Disallow rules. |
| mistralai-user |
|
No explicit Disallow rules. |
| youbot |
|
No explicit Disallow rules. |
| ai2bot |
|
No explicit Disallow rules. |
| friendlycrawler |
|
No explicit Disallow rules. |
| ccbot |
|
No explicit Disallow rules. |
Blocked and Allowed Paths
| Blocked paths |
|
|---|---|
| Allowed paths |
|
| Crawl-delay | No Crawl-delay directive detected. |
Sitemaps Detected
AI Crawler Policy
No explicit blocks were detected for common AI crawlers (GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot, Google-Extended).
Recommendations
- Document your AI crawler policy explicitly in robots.txt so future bots know how to treat your content.
- Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.