Best way to stop crawler bots

March 26, 2015   /   by Marco  / Categories :  Technology

I recently discovered one of my VPS servers was running constantly at 1-2 CPU load and received notifications from the VPS provider that it was using more than my fair share of CPU resources and temporarily suspended the VPS to prevent it from impacting my neighbouring customers – which was fair enough.

Upon investigation there were several reasons why the CPU load was high and one of them being that the sight was being crawled by different bots, Google, Bing, Yahoo, Ahref, Yandex, Twitter and the list goes on. So in order to reduce the load I decided to investigate what the best way was to prevent all the bots except for Google to crawl through my websites (please note that on the same VPS I’m hosting multiple websites).

After some researching and testing, the best way I was able to stop all bots except for Google was to include the following in the robots.txt file:

User-Agent: *
Disallow: /

User-Agent: Googlebot
Allow: /

User-Agent: Googlebot-Mobile
Allow: /

User-Agent: Googlebot-Image
Allow: /

User-Agent: Mediapartners-Google
Allow: /

User-Agent: Adsbot-Google
Allow: /

I know some people may ask “Why only allow Google?” The answer to this question is because that’s the only search engine that I’ve noticed that has the highest referral visitors to my websites. I don’t see the point of having the other crawlers use up CPU and resources which could potentially slow down the website and I’d rather keep the website nice and clean with minimal user traffic. Also, I’ve noticed other crawlers are used for competitor analysis which I don’t really use.

robots text file

 

Do you have any other tips? Is this the best way to do this? If you have any other tips please let me know.

DO YOU LIKE WHAT YOU'VE READ?
Join our subscription list and receive our content right in your mailbox. If you like to receive some Great deals our Freebies then subscribe now!

Our Sponsors

  • Follow us:

OTHER ARTICLES YOU MAY LIKE


CHATGPT USEFUL FEATURE: TEMPORARY CHAT

CHATGPT USEFUL FEATURE: TEMPORARY CHAT

Everyday use of ChatGPT naturally trains you to think in threads. You ask a question, you get an answer, you refine the prompt, and the conversation accretes context. It’s one of the reasons ChatGPT feels more like a companion than a search box: it remembers what you said so it can respond appropriately next time […]

read more
HOW TO GET FREE UPGRADE TO PERPLEXITY.AI PRO FOR 12 MONTHS AND TESTING PRO FEATURES

HOW TO GET FREE UPGRADE TO PERPLEXITY.AI PRO FOR 12 MONTHS AND TESTING PRO FEATURES

If you spend time researching, comparing, or synthesising information online, you have probably encountered Perplexity.ai — a research-focused assistant that blends large language models with live web search and citation trails. The Pro plan supercharges that experience with priority access to powerful models, higher usage limits, and premium capabilities baked directly into the interface. Right […]

read more

Like our Page