Whooked

Go Back   Whooked > Web Development > Web Dev

Reply
 
LinkBack Thread Tools
  #1 (permalink)  
Old May 24th, 2008
th13rteen's Avatar
Proud Whooked Addict
 
Join Date: Sep 2007
Posts: 2,277
Thanks: 14
Thanked 41 Times in 31 Posts
th13rteen is on a distinguished road
Send a message via MSN to th13rteen
Information Guide to Robots.txt

This little guide will teach you pretty much everything about Robots.txt in simple words.

What is it?

Its basically a text file which you upload to the root of your hosting account/server and it tells search engine spiders which page to crawl and which one's they should not.

Why use this?

You don't have to. Only use this when you don't want some of your pages indexed or even the whole site. Like a private site. That's when you would use it.

How does it look like?

Here's a typical example of a robots.txt

Code:
User-agent: * 
Disallow: /cgi-bin/
Disallow: /images/
Disallow: /private-file.html/
Disallow: /private/my-stuff.html/
How do I get it to work?

User-agent: * <-- Is telling which crawlers should keep out. You can add a specific crawler or apply it to all of them by using an asterisk, *.

Disallow: <-- Its telling which directories or files they should not crawl. If you don't want them to crawl your site then use a forward slash, /.

Well you first need to know which pages you don't want the spiders to crawl. From there is should be pretty simple.

Let me know if you need help.

-th13rteen
__________________
WebRampage! - Your crazy little guide to the web
Reply With Quote
  #2 (permalink)  
Old May 24th, 2008
elTico's Avatar
Member
 
Join Date: Jan 2008
Location: Chepe!
Posts: 267
Thanks: 1
Thanked 3 Times in 3 Posts
elTico is on a distinguished road
Send a message via MSN to elTico
Default

Thanks for this again dude.. I really need this...
__________________
Reply With Quote
  #3 (permalink)  
Old May 24th, 2008
Member
 
Join Date: May 2008
Posts: 13
Thanks: 0
Thanked 0 Times in 0 Posts
lineage2peru can only hope to improve
Star

Nice thanks for its
Reply With Quote
Reply

Bookmarks

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -4. The time now is 06:26 PM.