<?xml version="1.0" encoding="ISO-8859-1"?>

<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
	<channel>
		<title>Forum Posters Union - Spiders, Crawlers and web robots</title>
		<link>http://www.forumpostersunion.com</link>
		<description>Intelligence on search engine spider bots and identification, bad bots from spam botnets, content scrapers, tools to identify web robots, blocking malicious bots.</description>
		<language>en</language>
		<lastBuildDate>Fri, 10 Sep 2010 19:16:30 GMT</lastBuildDate>
		<generator>vBulletin</generator>
		<ttl>60</ttl>
		<image>
			<url>http://www.forumpostersunion.com/images/misc/rss.jpg</url>
			<title>Forum Posters Union - Spiders, Crawlers and web robots</title>
			<link>http://www.forumpostersunion.com</link>
		</image>
		<item>
			<title>X4d Backlinkchecker web robot</title>
			<link>http://www.forumpostersunion.com/showthread.php?t=14804&amp;goto=newpost</link>
			<pubDate>Sat, 04 Sep 2010 21:36:35 GMT</pubDate>
			<description>*04:20 PM Guest   Viewing Member List 
   85.214.19.188 findashop.de  
X4d Backlinkchecker  *

*No idea why this bot was scanning our member list,...</description>
			<content:encoded><![CDATA[<div><b>04:20 PM Guest   Viewing Member List <br />
   85.214.19.188 <i>findashop.de  </i><br />
X4d Backlinkchecker  </b><br />
<br />
<b>No idea why this bot was scanning our member list, could be another one of those poorly programmed SEO analysis bots !!! </b><br />
<br />
<b>PS: No link back to a <a href="http://www.forumpostersunion.com/showthread.php?t=3454" target="_blank">web crawler or spider ID page </a>in the user agent = BANNED !!</b><br />
<br />
<br />
IP Location:   Germany Berlin Strato Rechenzentrum Berlin  <br />
<br />
Resolve Host:  findashop.de  <br />
<br />
IP Address:  85.214.19.188       <br />
Reverse IP:  88 websites use this address. (examples: 3d-full-hd.net 3d-projektor.net 50plus-blog.net anbaubalkon-info.de)  <br />
<br />
inetnum:        85.214.16.0 - 85.214.139.255<br />
netname:        STRATO-RZG-DED2<br />
descr:          Strato Rechenzentrum, Berlin<br />
country:        DE<br />
admin-c:        SRDS-RIPE<br />
tech-c:         SRDS-RIPE<br />
remarks:        **************************************************  **********<br />
remarks:        * Please send abuse complaints to    *<br />
remarks:        * or fax +49-30-88615-755 ONLY.                            *<br />
remarks:        * Abuse reports to other e-mail addresses will be ignored. *<br />
remarks:        **************************************************  **********<br />
status:         ASSIGNED PA<br />
mnt-by:         STRATO-RZG-MNT<br />
source:         RIPE # Filtered<br />
<br />
role:           RIPE contact Strato Rechenzentrum AG Dedicated Server<br />
address:        Strato Rechenzentrum AG<br />
address:        Pascalstr. 10<br />
address:        D-10587 Berlin<br />
address:        Germany<br />
phone:          +49 30 39802-0<br />
abuse-mailbox:  <br />
admin-c:        XX1-RIPE<br />
tech-c:         WB14-RIPE<br />
tech-c:         CHSE-RIPE<br />
nic-hdl:        SRDS-RIPE<br />
remarks:        **************************************************  **********<br />
remarks:        * Please send abuse complaints to    *<br />
remarks:        * or fax +49-30-88615-755 ONLY.                            *<br />
remarks:        * Abuse reports to other e-mail addresses will be ignored. *<br />
remarks:        *                                                          *<br />
remarks:        * For peering requests or operational issues please look   *<br />
remarks:        * at the information in the AS6724 RIPE database object.   *<br />
remarks:        **************************************************  **********<br />
mnt-by:         STRATO-RZG-MNT<br />
source:         RIPE # Filtered<br />
<br />
route:        85.214.0.0/16<br />
descr:        Strato Rechenzentrum<br />
origin:       AS6724<br />
mnt-by:       STRATO-RZG-MNT<br />
source:       RIPE # Filtered<br />
<br />
route:          85.214.0.0/15<br />
descr:          Strato Rechenzentrum<br />
origin:         AS6724<br />
mnt-by:         STRATO-RZG-MNT<br />
source:         RIPE # Filtered</div>

]]></content:encoded>
			<category domain="http://www.forumpostersunion.com/forumdisplay.php?f=167">Spiders, Crawlers and web robots</category>
			<dc:creator>AnthonyCea</dc:creator>
			<guid isPermaLink="true">http://www.forumpostersunion.com/showthread.php?t=14804</guid>
		</item>
		<item>
			<title>atraxbot/0.3</title>
			<link>http://www.forumpostersunion.com/showthread.php?t=14737&amp;goto=newpost</link>
			<pubDate>Thu, 02 Sep 2010 21:55:58 GMT</pubDate>
			<description>*04:42 PM Guest    Viewing Thread 
 174.46.170.173 
Atrax Solutions atraxbot/0.3; http://www.atraxsolutions.com/atraxbot*

IP Location:   United...</description>
			<content:encoded><![CDATA[<div><b>04:42 PM Guest    Viewing Thread <br />
 174.46.170.173 <br />
Atrax Solutions atraxbot/0.3; <a href="http://www.atraxsolutions.com/atraxbot" target="_blank">http://www.atraxsolutions.com/atraxbot</a></b><br />
<br />
IP Location:   United States Kansas City Tw Telecom Holdings Inc  <br />
<br />
IP Address:  174.46.170.173       <br />
<br />
NetRange:       174.46.0.0 - 174.47.255.255<br />
CIDR:           174.46.0.0/15<br />
OriginAS:       <br />
NetName:        TWTC-NETBLK-17<br />
NetHandle:      NET-174-46-0-0-1<br />
Parent:         NET-174-0-0-0-0<br />
NetType:        Direct Allocation<br />
NameServer:     NS1.TWTELECOM.NET<br />
NameServer:     NS2.TWTELECOM.NET<br />
RegDate:        2009-01-14<br />
Updated:        2009-01-14<br />
Ref:            <a href="http://whois.arin.net/rest/net/NET-174-46-0-0-1" target="_blank">http://whois.arin.net/rest/net/NET-174-46-0-0-1</a><br />
<br />
OrgName:        tw telecom holdings, inc.<br />
OrgId:          TWTC<br />
Address:        10475 Park Meadows Drive<br />
City:           Littleton<br />
StateProv:      CO<br />
PostalCode:     80124<br />
Country:        US<br />
RegDate:        1999-03-17<br />
Updated:        2008-07-01<br />
Ref:            <a href="http://whois.arin.net/rest/org/TWTC" target="_blank">http://whois.arin.net/rest/org/TWTC</a><br />
<br />
ReferralServer: rwhois://rwhois.twtelecom.net:4321<br />
<br />
OrgAbuseHandle: TWTAD-ARIN<br />
OrgAbuseName:   tw telecom Abuse Desk<br />
OrgAbusePhone:  +1-800-898-6473 <br />
OrgAbuseEmail:  <br />
OrgAbuseRef:    <a href="http://whois.arin.net/rest/poc/TWTAD-ARIN" target="_blank">http://whois.arin.net/rest/poc/TWTAD-ARIN</a><br />
<br />
OrgTechHandle: NST12-ARIN<br />
OrgTechName:   NOC SWIP Team<br />
OrgTechPhone:  +1-800-898-6473 <br />
OrgTechEmail:  <br />
OrgTechRef:    <a href="http://whois.arin.net/rest/poc/NST12-ARIN" target="_blank">http://whois.arin.net/rest/poc/NST12-ARIN</a><br />
<br />
OrgNOCHandle: TDN1-ARIN<br />
OrgNOCName:   TWTC Data NOC<br />
OrgNOCPhone:  +1-800-898-6473 <br />
OrgNOCEmail:  <br />
OrgNOCRef:    <a href="http://whois.arin.net/rest/poc/TDN1-ARIN" target="_blank">http://whois.arin.net/rest/poc/TDN1-ARIN</a><br />
<br />
RAbuseHandle: TWTAD-ARIN<br />
RAbuseName:   tw telecom Abuse Desk<br />
RAbusePhone:  +1-800-898-6473 <br />
RAbuseEmail:  <br />
RAbuseRef:    <a href="http://whois.arin.net/rest/poc/TWTAD-ARIN" target="_blank">http://whois.arin.net/rest/poc/TWTAD-ARIN</a><br />
<br />
RTechHandle: ZT87-ARIN<br />
RTechName:   IP Manager<br />
RTechPhone:  +1-800-829-0420 <br />
RTechEmail:  <br />
RTechRef:    <a href="http://whois.arin.net/rest/poc/ZT87-ARIN" target="_blank">http://whois.arin.net/rest/poc/ZT87-ARIN</a><br />
<br />
RNOCHandle: TDN1-ARIN<br />
RNOCName:   TWTC Data NOC<br />
RNOCPhone:  +1-800-898-6473 <br />
RNOCEmail:  <br />
RNOCRef:    <a href="http://whois.arin.net/rest/poc/TDN1-ARIN" target="_blank">http://whois.arin.net/rest/poc/TDN1-ARIN</a><br />
<br />
== Additional Information From rwhois://rwhois.twtelecom.net:4321 ==<br />
<br />
network:Class-Name:network<br />
network:ID:9b636318-34c1-11de-a5e8-0015c5e45005<br />
network:Auth-Area:174.46.0.0/15<br />
network:Network-Name:GlobalEntity-174-46-170-160<br />
network:IP-Network:174.46.170.160/27<br />
network:Organization;I:425c8fde-34c0-11de-aa55-0015c5e45005<br />
network:Org-Name:nXn TECH<br />
network:Street-Address:8300 NORMAN CENTER DR<br />
network:City:BLOOMINGTON<br />
network:State:MN<br />
network:Postal-Code:55437<br />
network:Country-Code:us<br />
network:Phone:none<br />
network:Admin-Contact;I:none<br />
network:Tech-Contact;I:none<br />
network:Abuse-Contact;I:<br />
network:Updated:20090429070429000</div>

]]></content:encoded>
			<category domain="http://www.forumpostersunion.com/forumdisplay.php?f=167">Spiders, Crawlers and web robots</category>
			<dc:creator>AnthonyCea</dc:creator>
			<guid isPermaLink="true">http://www.forumpostersunion.com/showthread.php?t=14737</guid>
		</item>
		<item>
			<title>DoCoMo/1.0/D209i user agent?</title>
			<link>http://www.forumpostersunion.com/showthread.php?t=14646&amp;goto=newpost</link>
			<pubDate>Sun, 29 Aug 2010 15:00:07 GMT</pubDate>
			<description>I did a search on this useragent but came up with nothing definite;

DoCoMo/1.0/D209i

The IP for the one hit today is 200.223.17.114

Can anyone...</description>
			<content:encoded><![CDATA[<div>I did a search on this useragent but came up with nothing definite;<br />
<br />
DoCoMo/1.0/D209i<br />
<br />
The IP for the one hit today is 200.223.17.114<br />
<br />
Can anyone shed some light on this one please?</div>

]]></content:encoded>
			<category domain="http://www.forumpostersunion.com/forumdisplay.php?f=167">Spiders, Crawlers and web robots</category>
			<dc:creator>wingnut1</dc:creator>
			<guid isPermaLink="true">http://www.forumpostersunion.com/showthread.php?t=14646</guid>
		</item>
		<item>
			<title>Hailoobot/1.2</title>
			<link>http://www.forumpostersunion.com/showthread.php?t=14476&amp;goto=newpost</link>
			<pubDate>Fri, 20 Aug 2010 22:22:10 GMT</pubDate>
			<description>05:03 PM Guest   Viewing Index 
  38.125.95.250 
Mozilla/5.0 (compatible; Hailoobot/1.2; +http://www.hailoo.com/spider.html)


---Quote---
Web...</description>
			<content:encoded><![CDATA[<div>05:03 PM Guest   Viewing Index <br />
  38.125.95.250 <br />
Mozilla/5.0 (compatible; Hailoobot/1.2; +<a href="http://www.hailoo.com/spider.html" target="_blank">http://www.hailoo.com/spider.html</a>)<br />
<br />
<div style="margin:20px; margin-top:5px; ">
	<div class="smallfont" style="margin-bottom:2px">Quote:</div>
	<table cellpadding="6" cellspacing="0" border="0" width="100%">
	<tr>
		<td class="alt2">
			<hr />
			
				Web Crawling <br />
Like most Internet search engines, Hailoo is essentially an enormous index of the Internet. In order to create this index, Hailoo sends out &quot;web crawlers&quot; to explore and download interesting websites. A &quot;web crawler,&quot; also known as a web spider or robot, is a computer program which visits websites and extracts links from each page in order to find more pages to download. Each page is then added to Hailoo's index so that it can potentially appear in a search result. <br />
<br />
Unlike other Internet search engines, <b>Hailoo focuses specifically on the Arabic portion of the web</b>. Hailoo currently fetches more Arabic pages than any existing search engine. Our web crawlers are designed to specifically seek out Arabic web pages, or web pages which may be interesting to Middle Easterners or Arabic-speaking people. 
			
			<hr />
		</td>
	</tr>
	</table>
</div></div>

]]></content:encoded>
			<category domain="http://www.forumpostersunion.com/forumdisplay.php?f=167">Spiders, Crawlers and web robots</category>
			<dc:creator>AnthonyCea</dc:creator>
			<guid isPermaLink="true">http://www.forumpostersunion.com/showthread.php?t=14476</guid>
		</item>
		<item>
			<title>goblox Bot Search</title>
			<link>http://www.forumpostersunion.com/showthread.php?t=14407&amp;goto=newpost</link>
			<pubDate>Wed, 18 Aug 2010 04:09:57 GMT</pubDate>
			<description>*173.242.114.45 server2.web-leader.net 
goblox Bot Search * 

No idea what these guys are doing, but since they are running an unidentified bot (no...</description>
			<content:encoded><![CDATA[<div><b>173.242.114.45 server2.web-leader.net <br />
goblox Bot Search </b> <br />
<br />
No idea what these guys are doing, but since they are running an unidentified bot (no link back to a <a href="http://www.forumpostersunion.com/showthread.php?t=3454" target="_blank">spider identification page</a> in user agent) they are banned.</div>

]]></content:encoded>
			<category domain="http://www.forumpostersunion.com/forumdisplay.php?f=167">Spiders, Crawlers and web robots</category>
			<dc:creator>AnthonyCea</dc:creator>
			<guid isPermaLink="true">http://www.forumpostersunion.com/showthread.php?t=14407</guid>
		</item>
	</channel>
</rss>
