Robots.txt Tester
Inspect how thadhanisafety.com controls crawler access, blocked paths, sitemap references and AI crawler rules.
Robots.txt Status
Robots.txt Status
Present
Score
76
/100
ยท Needs review
View Full Robots.txt
Robots.txt Content Preview
User-agent: * Disallow: /wp-content/uploads/wc-logs/ Disallow: /wp-content/uploads/woocommerce_transient_files/ Disallow: /wp-content/uploads/woocommerce_uploads/ Disallow: /*?add-to-cart= Disallow: /*?*add-to-cart= Disallow: /wp-admin/ Allow: /wp-admin/admin-ajax.php # START YOAST BLOCK # --------------------------- User-agent: * Disallow: /?s= Disallow: /page/*/?s= Disallow: /search/ Disallow: /wp-json/ Disallow: /?rest_route= User-agent: AdsBot Disallow: / User-agent: CCBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: GPTBot Disallow: / Sitemap: http://thadhanisafety.com/sitemap_index.xml # --------------------------- # END YOAST BLOCK
User-agent Rules
| User-agent(s) | Allowed paths | Disallowed paths |
|---|---|---|
| * |
|
|
| * | No explicit Allow rules. |
|
| adsbot | No explicit Allow rules. |
|
| ccbot | No explicit Allow rules. |
|
| google-extended | No explicit Allow rules. |
|
| gptbot | No explicit Allow rules. |
|
Blocked and Allowed Paths
| Blocked paths |
|
|---|---|
| Allowed paths |
|
| Crawl-delay | No Crawl-delay directive detected. |
Sitemaps Detected
AI Crawler Policy
At least one AI crawler (such as GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot or Google-Extended) appears to be blocked by robots.txt.
Issues Found
- At least one user-agent has Disallow: / which blocks the entire site.
- Sitemap URL appears to be unreachable: http://thadhanisafety.com/sitemap_index.xml
- Blocking CSS or JS may prevent search engines from rendering pages correctly.
Recommendations
- Avoid blocking the entire site (Disallow: /); restrict only sensitive or low-value paths instead.
- Review your AI crawler policy for GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot and Google-Extended to ensure it matches your content strategy.
- Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.