Stop Bots? How To Control and Identify MSNBot Using Robots.txt

Posted in E-Commerce, Technology by wayne.porter on November 30th, 2006

Naturally you are going to need to use robots.txt file, despite what engines say- this is not always obeyed nor will those block or throttle undesired agents.

Be sure when using that you put opening and closing brackets < and > around the META to use in its proper syntax.

Restrict MSNBot from indexing a page : META NAME=”msnbot” CONTENT=”noindex” /
Restrict all robots from indexing a page: META NAME=”*” CONTENT=”noindex” /

Restrict MSNBot from following links on page: META NAME=”msnbot” CONTENT=”nofollow” /

Restrict all bots from following links on a page: META NAME=”robots” CONTENT=”nofollow” /

Block MSNBot from indexing & following links: META NAME=”msnbot” CONTENT=”noindex,nofollow” /

Stop MSNBot from caching a page: META NAME=”msnbot” CONTENT=”nocache” /

or as an alternative: META NAME=”msnbot” CONTENT=”noarchive” /

Limit MSBOT crawl frequency

If you occasionally get high traffic from MSNBot, you can specify a crawl delay parameter in the robots.txt file to specify how often, in seconds, MSNBot can access your website. To do this, add this syntax to your robots.txt file:

User-agent: msnbot
Crawl-delay: 120

If you still find that MSNBot is placing too high a load on your web server, contact MSN Search Site Owner Support.

When you contact us about an issue, include the following information so that we can help you more quickly:

* The target website address that MSNBot put in the robots.txt file
* The date range when the issue occurred
* The access logs

Consider a free beta trial of botsense.com bot blocking control to control rogue bots and agents using .htaccess and apache.

Popularity: 2% [?]

One Response to “Stop Bots? How To Control and Identify MSNBot Using Robots.txt”

  1. Tu le Says:

    where can i find the file robots.txt ? I am very interested in trying to stop these bots ?

    Thanks,

    Tu

Leave a Reply

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Related Links



Close
E-mail It