πŸ€–Search Engine Crawl Control

Free Robots.txt Generator

Create professional robots.txt files to control how search engines crawl your website. Block unwanted bots, set crawl delays, and improve your SEO instantly.

⚑ Quick Presets

βš™οΈ Robots.txt Configuration

Use * for all bots, or specify like "Googlebot", "Bingbot", "DuckDuckBot"

Paths crawlers should NOT access. Use / to block entire site.

Paths crawlers CAN access (overrides Disallow for specific URLs)

⚠️ Googlebot ignores crawl-delay. Use Google Search Console for Google crawl rate.

Add your XML sitemap location to help search engines discover all your pages

Add custom user-agent blocks or advanced rules (one per line)

πŸ“– How to Use Your Robots.txt File

  1. Configure your rules above using presets or custom settings
  2. Click "Generate Robots.txt" to create your file
  3. Click "Download" to save the robots.txt file
  4. Upload the file to your website's root directory (e.g., https://yourdomain.com/robots.txt)
  5. Test your robots.txt usingGoogle Search Console's robots.txt Tester
  6. Changes typically take effect within 24-48 hours

πŸ€– Common Bot User-Agents

GooglebotGoogle Search
BingbotBing Search
SlurpYahoo Search
DuckDuckBotDuckDuckGo
BaiduspiderBaidu Search
YandexBotYandex Search
facebookexternalhitFacebook
TwitterbotTwitter/X
LinkedInBotLinkedIn
GPTBotChatGPT / OpenAI
CCBotCommon Crawl
PetalBotHuawei Search
πŸ“‹ Common Rules

Popular Robots.txt Rules

Ready-to-use rules for common scenarios β€” just check and generate

Block Entire Site

User-agent: *
Disallow: /

Maintenance mode, staging sites

Allow Everything

User-agent: *
Allow: /

Default for most websites

Block Admin Area

User-agent: *
Disallow: /wp-admin/
Disallow: /admin/

WordPress, CMS admin

Block Login Pages

User-agent: *
Disallow: /login/
Disallow: /signin/

Authentication pages

Block Images

User-agent: *
Disallow: /images/
Disallow: /uploads/

Media directories

Googlebot Only

User-agent: Googlebot
Allow: /
User-agent: *
Disallow: /

Allow only Google

How to Use Your Robots.txt File

βš™οΈ
1

Configure Rules

Select presets or add custom rules for your website

πŸš€
2

Generate File

Click generate to create your robots.txt file

πŸ“
3

Upload to Root

Save as robots.txt and upload to your website root

βœ…
4

Test & Validate

Use Google Search Console to test your file

Important Notes & Best Practices

⚠️

Robots.txt Doesn't Guarantee Blocking

Search engines may still index pages if they find external links. Use noindex meta tags or password protection for sensitive content.

πŸ“

Syntax Matters

Robots.txt is case-sensitive. Use correct capitalization for user-agents and paths. One space after colon is required.

πŸ”—

Always Include Sitemap

Add your sitemap URL in robots.txt to help search engines discover all your pages efficiently.

πŸ“Š

Test Before Deploying

Always test your robots.txt file using Google Search Console's robots.txt Tester to avoid blocking important pages.

Frequently Asked Questions

What is the difference between Allow and Disallow?β–Ό

Disallow tells crawlers NOT to access specific URLs or directories. Allow explicitly permits crawling of specific URLs even within a disallowed directory. For example, you could disallow /private/ but allow /private/public-page.html.

Can I block specific search engines only?β–Ό

Yes! Use specific user-agent names like 'Googlebot' for Google, 'Bingbot' for Bing, 'Slurp' for Yahoo, or 'DuckDuckBot' for DuckDuckGo. Add rules under each specific user-agent.

What is Crawl-delay and should I use it?β–Ό

Crawl-delay tells bots to wait a specified number of seconds between requests. This is useful for reducing server load on busy sites. However, Googlebot ignores crawl-delay - use Google Search Console for crawl rate adjustment instead.

Where should I place my robots.txt file?β–Ό

The robots.txt file must be placed in the root directory of your website. For example: https://yourdomain.com/robots.txt. Search engines automatically check this location when crawling your site.

How long does it take for robots.txt changes to take effect?β–Ό

Search engines typically re-check robots.txt every few days. Google may take up to 24 hours to detect changes. You can force a re-crawl using Google Search Console's URL Inspection tool.

Can robots.txt prevent indexing of my pages?β–Ό

Robots.txt prevents crawling but doesn't guarantee that pages won't be indexed. If other sites link to your blocked pages, search engines might still index them without crawling. Use noindex meta tags for complete removal.