|
|||||||
| Register | FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read |
| Spiders, Crawlers and web robots Intelligence on search engine spider bots and identification, bad bots from spam botnets, content scrapers, tools to identify web robots, blocking malicious bots. |
![]() |
|
|
Thread Tools |
|
#1
|
||||
|
||||
|
grvcrawler/0.3
174.129.158.205 ec2-174-129-158-205.compute-1.amazonaws.com
grvcrawler/0.3 grvcrawler/0.3 is a bot running on the same IP that the Omgili.com forum search engine crawler has been run on, so I assume this might be a name change for the spider bot, but with no link to a spider bot identification page in their user agent to provide transparency to webmasters and server administrators, this bot will be banned by the uninformed, this is a great way for a botmaster or new search engine to shoot themselves in the foot. |
|
#2
|
||||
|
||||
|
You haven't blacklisted AWS yet? :P At what point do you say enough is enough?
|
|
#3
|
||||
|
||||
|
Well, if I ban the entire data center I would block Alexa and all of Amazon too most likely, I think the way things are going I will have blocked every IP c-network they have soon.
Once we get our new firewall system finalized I may have a better way to do things, it is not done yet. |
|
#4
|
||||
|
||||
|
174.129.158.205 ec2-174-129-158-205.compute-1.amazonaws.com
grvcrawler/0.3 grvcrawler was back again tonight quickly scanning our thread content, in keeping with our policy of banning bots that do not run a link back to a comprehensive spider ID page in their user agent, the IP c-network was banned. Sorry guys but you will have to do a lot better job of providing transparency before we will allow you to continue scanning content. |
![]() |
| Thread Tools | |
|
|