aus disallow

beispiel
1. htaccess
2. gesammelte ips

Navigation:
Frontpage / Content « htaccess
Disallow ist tot: http://disallow.de/blog/2006/07/05/disallow-ist-tot/


Schon bestehende Anti-Bot-htaccess-Dateien

Wer schon eine htaccess hat um Bots auszusperren möge diese doch bitte hier einstellen:


order deny,allow
deny from 38.0.0.0/8
deny from 38.112.0.0/13
deny from 59.186.77.108
deny from 63.88.212.
deny from 63.88.213.
deny from 64.111.192.0/20
deny from 64.124.122.
deny from 66.175.0.0/18
deny from 67.15.0.0/17
deny from 67.15.128.0/18
deny from 67.15.192.0/19
deny from 67.15.224.0/20
deny from 67.19.137.178
deny from 69.41.160.0/20
deny from 83.44.196.131
deny from 85.18.175.238
deny from 193.243.251.0/25
deny from 195.63.97.80
deny from 195.101.157.0/24
deny from 200.52.160/19
deny from 207.248.224/19
deny from 209.97.192.0/19
deny from 209.172.
deny from 212.65.242.237
deny from 212.19.35.224/28
deny from 212.121.162.
deny from 212.149.48.43
deny from 213.83.55.32/27
deny from 213.83.55.128/27
deny from 213.83.55.224/27
deny from 213.239.239.
deny from 217.20.112
deny from 217.20.113
deny from 217.20.114
deny from 217.20.115
deny from 217.20.116
deny from 217.20.117
deny from 217.20.118
deny from 217.20.119
deny from 217.172.186.195
deny from 217.218.0.0/17
deny from 217.218.128.0/19
deny from 217.218.160.0/20
deny from 217.218.176.0/21
deny from 217.218.184.0/22
deny from 217.218.188.0/23
deny from 217.218.190.0/24
deny from 217.218.191.0/32

Options +FollowSymlinks
RewriteEngine On
RewriteBase /

RewriteCond %{REMOTE_ADDR} ^63.148.99.2(2[4-9]|[3-4][0-9]|5[0-5])$ [OR] # Cyveillance spybot
RewriteCond %{REMOTE_ADDR} ^12\.148\.196\.(12[8-9]¦1[3-9][0-9]¦2[0-4][0-9]¦25[0-5])$ [OR] # NameProtect spybot
RewriteCond %{REMOTE_ADDR} ^12\.148\.209\.(19[2-9]¦2[0-4][0-9]¦25[0-5])$ [OR] # NameProtect spybot
RewriteCond %{REMOTE_ADDR} ^217\.73\.165\.40 [OR] # MSIE60-kennung aus ru
RewriteCond %{REMOTE_ADDR} ^70\.84\.[0-9]+\.[0-9]+$ [OR] # theplanet ISP
RewriteCond %{REMOTE_ADDR} ^70\.85\.([0-9]{1,2}|1([01][0-9]|2[0-7]))\.[0-9]+$ [OR] # theplanet ISP
RewriteCond %{REMOTE_ADDR} ^195\.166\.237\. [OR] # nigerianischer BettelbriefAdressenSammler
RewriteCond %{REMOTE_ADDR} ^64\.62\.(12[8-9]|1[3-9][0-9]|2[0-5][0-9])\. [OR] # OmniexplorerBot
RewriteCond %{REMOTE_ADDR} ^61\.135\130\.74 [OR] # sohu-search
RewriteCond %{REMOTE_ADDR} ^61\.135\131\.230 [OR] # sohu-search
RewriteCond %{REMOTE_ADDR} ^221\.(13[8-9]|1[4-6][0-9])\. [OR] # Korea Telecom
RewriteCond %{REMOTE_ADDR} ^66\.197\.(12[8-9]|1[3-9][0-9]|2[0-4][0-9]|25[0-5])\. [OR] # Dungeons Realm
RewriteCond %{HTTP_USER_AGENT} ^[A-Z]+$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*adressendeutschland.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^ArtfaceBot [OR]
RewriteCond %{HTTP_USER_AGENT} ^BlackWidow [OR]
RewriteCond %{HTTP_USER_AGENT} BorderManager [OR]
RewriteCond %{HTTP_USER_AGENT} BruinBot [OR]
RewriteCond %{HTTP_USER_AGENT} ^Bot\ mailto:craftbot@yahoo.com [OR]
RewriteCond %{HTTP_USER_AGENT} ^CherryPicker [OR]
RewriteCond %{HTTP_USER_AGENT} ^ChinaClaw [OR]
RewriteCond %{HTTP_USER_AGENT} ^CopyRightCheck [OR]
RewriteCond %{HTTP_USER_AGENT} ^ChinaClaw [OR]
RewriteCond %{HTTP_USER_AGENT} ^CMUImageBot/spider.pl [OR]
RewriteCond %{HTTP_USER_AGENT} ^Convera [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Crawler.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^Crescent [OR]
RewriteCond %{HTTP_USER_AGENT} ^Custo [OR]
RewriteCond %{HTTP_USER_AGENT} ^DISCo [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Downloader.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^Download\ Demon [OR]
RewriteCond %{HTTP_USER_AGENT} ^PingALink [OR]
RewriteCond %{HTTP_USER_AGENT} ^eCatch [OR]
RewriteCond %{HTTP_USER_AGENT} ^EirGrabber [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailCollector [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailSiphon [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailWolf [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmeraldShield [OR]
RewriteCond %{HTTP_USER_AGENT} ^Express\ WebPictures [OR]
RewriteCond %{HTTP_USER_AGENT} ^ExtractorPro [OR]
RewriteCond %{HTTP_USER_AGENT} ^EyeNetIE [OR]
RewriteCond %{HTTP_USER_AGENT} EuripBot [OR]
RewriteCond %{HTTP_USER_AGENT} ^Faxobot [OR]
RewriteCond %{HTTP_USER_AGENT} ^FlashGet [OR]
RewriteCond %{HTTP_USER_AGENT} ^GetRight [OR]
RewriteCond %{HTTP_USER_AGENT} ^Go!Zilla [OR]
RewriteCond %{HTTP_USER_AGENT} ^Go-Ahead-Got-It [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Grabber.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^GrabNet [OR]
RewriteCond %{HTTP_USER_AGENT} ^GornKer [OR]
RewriteCond %{HTTP_USER_AGENT} ^Grafula [OR]
RewriteCond %{HTTP_USER_AGENT} ^Holmes [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*HTTrack.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} TurnitinBot [OR]
RewriteCond %{HTTP_REFERER} iaea\.org [OR] # spambot
RewriteCond %{HTTP_USER_AGENT} ^ia_archiver [OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Stripper [OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Sucker [OR]
RewriteCond %{HTTP_USER_AGENT} Indy\ Library [OR] # spambot
RewriteCond %{HTTP_USER_AGENT} ^Infosearch [OR]
RewriteCond %{HTTP_USER_AGENT} ^InnerpriseBot [OR]
RewriteCond %{HTTP_USER_AGENT} ^InterGET [OR]
RewriteCond %{HTTP_USER_AGENT} ^Internet\ Ninja [OR]
RewriteCond %{HTTP_USER_AGENT} ^IRLbot [OR]
RewriteCond %{HTTP_USER_AGENT} ^Irvine [OR]
RewriteCond %{HTTP_USER_AGENT} ^JetCar [OR]
RewriteCond %{HTTP_USER_AGENT} ^JOC\ Web\ Spider [OR]
RewriteCond %{HTTP_USER_AGENT} DTS\ Agent [OR] # spambot
RewriteCond %{HTTP_USER_AGENT} ^Knowledge.com [OR]
RewriteCond %{HTTP_USER_AGENT} ^larbin [OR]
RewriteCond %{HTTP_USER_AGENT} ^LeechFTP [OR]
RewriteCond %{HTTP_USER_AGENT} ^lwp-trivial [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mass\ Downloader [OR]
RewriteCond %{HTTP_USER_AGENT} ^MIDown\ tool [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mister\ PiX [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mnogosearch [OR]
RewriteCond %{HTTP_USER_AGENT} ^oBot [OR] # spybot
RewriteCond %{HTTP_USER_AGENT} ^Ocelli [OR]
RewriteCond %{HTTP_USER_AGENT} ^OmniExplorer [OR]
RewriteCond %{HTTP_USER_AGENT} ^McBot [OR] # oesterreichische softwarefirma
RewriteCond %{HTTP_USER_AGENT} ^Microsoft [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mozilla.*\ obot [OR] # spybot
RewriteCond %{HTTP_USER_AGENT} ^NaverBot [OR]
RewriteCond %{HTTP_USER_AGENT} ^Navroad [OR]
RewriteCond %{HTTP_USER_AGENT} ^NearSite [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetAnts [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetSpider [OR]
RewriteCond %{HTTP_USER_AGENT} ^Net\ Vampire [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetZIP [OR]
RewriteCond %{HTTP_USER_AGENT} ^NICErsPRO [OR]
RewriteCond %{HTTP_USER_AGENT} NPBot [OR] # NameProtect spybot
RewriteCond %{HTTP_USER_AGENT} NG/2.0 [OR]
RewriteCond %{HTTP_USER_AGENT} ^NG-Search [OR] # ein Arcor-Bot
RewriteCond %{HTTP_USER_AGENT} ^Octopus [OR]
RewriteCond %{HTTP_USER_AGENT} ^PageGrabber [OR]
RewriteCond %{HTTP_USER_AGENT} ^Papa\ Foto [OR]
RewriteCond %{HTTP_USER_AGENT} ^pavuk [OR]
RewriteCond %{HTTP_USER_AGENT} ^pcBrowser [OR]
RewriteCond %{HTTP_USER_AGENT} ^PEERbot [OR] # favicon-browser
RewriteCond %{HTTP_USER_AGENT} PicSpider [OR] # bildkiste de
RewriteCond %{HTTP_USER_AGENT} ^RedKernel [OR] # keine robotstxt
RewriteCond %{HTTP_USER_AGENT} ^RIN\.\ Web\ crawler [OR] # spambot
RewriteCond %{HTTP_USER_AGENT} RPT-HTTPClient [OR] # nicht robots
RewriteCond %{HTTP_USER_AGENT} compatible\ ;\ MSIE\ 6.0 [OR] # spambot (note extra space before semicolon)
RewriteCond %{HTTP_USER_AGENT} ^IE\ \d\.\d\ Compatible.*Browser$ [OR] # spambot
RewriteCond %{HTTP_USER_AGENT} Microsoft\ URL\ Control [OR] # spambot
RewriteCond %{HTTP_USER_AGENT} ^Shit [OR] # favicon-browser
RewriteCond %{HTTP_USER_AGENT} ^stat [OR] # statcrawler@gmail.com
RewriteCond %{HTTP_USER_AGENT} ^.*Sucker.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} sohu [OR] # search oder agent
RewriteCond %{HTTP_USER_AGENT} ^SSM [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*thebestofnet.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Image\ Collector [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebAuto [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebBandit [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebCopier [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebFetch [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebGo\ IS [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebLeacher [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebReaper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebSauger [OR]
RewriteCond %{HTTP_USER_AGENT} ^Website\ eXtractor [OR]
RewriteCond %{HTTP_USER_AGENT} ^Website\ Quester [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebStripper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebWhacker [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebZIP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Wget [OR]
RewriteCond %{HTTP_USER_AGENT} ^Widow [OR]
RewriteCond %{HTTP_USER_AGENT} ^WorQmada [OR]
RewriteCond %{HTTP_USER_AGENT} ^Xaldon\ WebSpider [OR]
RewriteCond %{HTTP_USER_AGENT} ^Yotta [OR] # Car Search Engine
RewriteCond %{HTTP_USER_AGENT} ^Zeus [OR]
RewriteCond %{HTTP_USER_AGENT} \([^\)]+$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^[a-z0-9]+
RewriteCond %{HTTP_USER_AGENT} !^msnbot
RewriteCond %{HTTP_USER_AGENT} !^appie
RewriteRule ^.*$ - [F]



2.)

# Spambots nach User_agent aussperren
RewriteCond %{HTTP_USER_AGENT} ^.*Whacker.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*FileHound.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*TurnitinBot.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*JoBo.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*adressendeutschland.*$
RewriteCond %{HTTP_USER_AGENT} ^BlackWidow [OR]
RewriteCond %{HTTP_USER_AGENT} ^Bot\ mailto:craftbot@yahoo.com [OR]
RewriteCond %{HTTP_USER_AGENT} ^CherryPicker [OR]
RewriteCond %{HTTP_USER_AGENT} ^ChinaClaw [OR]
RewriteCond %{HTTP_USER_AGENT} ^DISCo [OR]
RewriteCond %{HTTP_USER_AGENT} ^Download\ Demon [OR]
RewriteCond %{HTTP_USER_AGENT} ^JetCar [OR]
RewriteCond %{HTTP_USER_AGENT} ^JOC\ Web\ Spider [OR]
RewriteCond %{HTTP_USER_AGENT} ^larbin [OR]
RewriteCond %{HTTP_USER_AGENT} ^LeechFTP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mass\ Downloader [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetSpider [OR]
RewriteCond %{HTTP_USER_AGENT} ^NICErsPRO [OR]
RewriteCond %{HTTP_USER_AGENT} ^Octopus [OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Explorer [OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Navigator [OR]
RewriteCond %{HTTP_USER_AGENT} ^PageGrabber [OR]
RewriteCond %{HTTP_USER_AGENT} ^ReGet [OR]
RewriteCond %{HTTP_USER_AGENT} ^Siphon [OR]
RewriteCond %{HTTP_USER_AGENT} ^SiteSnagger [OR]
RewriteCond %{HTTP_USER_AGENT} ^Surfbot [OR]
RewriteCond %{HTTP_USER_AGENT} ^tAkeOut [OR]
RewriteCond %{HTTP_USER_AGENT} ^VoidEYE [OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Image\ Collector [OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Sucker [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebAuto [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebCopier [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebFetch [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebReaper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebSauger [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebStripper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebZIP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Wget [OR]
RewriteCond %{HTTP_USER_AGENT} ^Zeus [OR]
RewriteCond %{HTTP_USER_AGENT} ^TurnitinBot [OR]
RewriteRule /* http://www.mvssolutions.com/spam.html [L,R]

-

Deny from .move.to
Deny from .fullspeed.to
Deny from .comweb.pl
Deny from .etap.pl
Deny from .net.pl
Deny from .mypiece.com
Deny from .guest.de
Deny from .pagina.de
Deny from .superbikeclub.com
Deny from .splinder.com
Deny from 206.251.160.
Deny from 207.44.128.
Deny from 66.179.0.
Deny from 202.181.
Deny from 207.44.196.34
Deny from 66.179.230.80

-

Deny from 195.225.177.80

-
Local:
2006-07-05 14:46:26 - Recent page revisions in XML format (this page)
Global:
Frontpage / Content - UserSettings, logged in as p54A4DE70.dip.t-dialin.net - RecentChanges - Recent page revisions in XML format (all pages)




Navigation:
Frontpage / Content « IPs
Disallow ist tot: http://disallow.de/blog/2006/07/05/disallow-ist-tot/


IPs für die Datenbank von Disallow

Hier können schonmal IPs gesammelt werden die in die Datenbank sollen:



IPs aus den Logs von soultcer.net inkl. Timestamp des letzten Besuchs

cfetch/1.0
204.14.48.4 (1125039400)


findlinks/0.961 (+http://wortschatz.uni-leipzig.de/findlinks/)
139.18.13.203 (1125093546)


msnbot/1.0 (+http://search.msn.com/msnbot.htm)
65.54.188.11 (1130757616)
65.54.188.12 (1130046926)


Googlebot
Googlebot/2.1 (+http://www.google.com/bot.html)
66.249.64.4 (1126600551)
66.249.64.6 (1129002132)
66.249.64.10 (1127103881)
66.249.64.13 (1130556454)
66.249.64.15 (1128655609)
66.249.64.25 (1128663530)
66.249.64.27 (1128567957)
66.249.64.28 (1129913581)
66.249.64.30 (1129948017)
66.249.64.33 (1128003061)
66.249.64.35 (1127707147)
66.249.64.36 (1125381950)
66.249.64.37 (1128937209)
66.249.64.44 (1130123637)
66.249.64.47 (1128480239)
66.249.64.50 (1130122802)
66.249.64.54 (1129608192)
66.249.64.55 (1129944404)
66.249.64.58 (1128961330)
66.249.64.66 (1129957036)
66.249.64.68 (1127045084)
66.249.64.79 (1128001928)
66.249.65.67 (1126344107)
66.249.64.77 (1129949288)
66.249.71.1 (1128742156)
66.249.71.3 (1129864267)
66.249.71.14 (1129688043)
66.249.71.15 (1130469379)
66.249.71.17 (1129691243)
66.249.71.18 (1128005671)
66.249.71.28 (1128018611)
66.249.71.29 (1127967317)
66.249.71.32 (1129908159)
66.249.71.39 (1128966204)
66.249.71.40 (1129899940)
66.249.71.42 (1130295936)
66.249.71.47 (1128307371)
66.249.71.50 (1130210348)
66.249.71.53 (1130036675)
66.249.71.55 (1130641242)
66.249.71.55 (1129691759)
66.249.71.56 (1130733201)
66.249.71.57 (1129347824)
66.249.71.67 (1129911562)
66.249.71.69 (1128965460)
66.249.71.70 (1129897059)
66.249.71.72 (1129947447)
66.249.71.73 (1128927807)
81.169.137.25 (1129460155) -
> vermutlich fake, da ein leerer Referer mitgesendet wurde, der echte Googlebot sendet überhaupt keinen Referer

Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
66.249.65.19 (1127444399)
66.249.65.131 (1127506932)
66.249.65.172 (1129107569)
66.249.65.175 (1130201098)
66.249.65.202 (1127551489)
66.249.66.14 (1129431344)
66.249.66.235 (1128582042)

Mediapartners-Google/2.1
66.249.65.79 (1128445065)
66.249.65.133 (1128172929)
66.249.65.168 (1128515344)
66.249.65.172 (1129224625)
66.249.65.175 (1130003257)
66.249.66.14 (1129918083)
66.249.66.81 (1128696894)
66.249.66.161 (1128063898)
66.249.72.173 (1130607588)
66.249.72.179 (1130438602)
66.249.72.244 (1130261837)

Mozilla/4.0 (compatible; Google Desktop)
Sofern das halt wirklich von Google kommt...
213.196.205.140 (1130546901)


Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 4.0; obot)
213.252.152.24 (1125585594)


Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)
68.142.249.24 (1128950821)
68.142.249.48 (1127295822)
68.142.249.95 (1129069143)
68.142.250.28 (1126442453)
68.142.250.80 (1130398141)
68.142.250.88 (1130624357)
68.142.250.119 (1128914441)
68.142.251.44 (1129407966)
68.142.251.55 (1128223068)
202.160.180.83 (1128654297)
66.196.91.123 (1128118320)
66.196.91.167 (1128799999)
66.196.101.88 (1129101080)
68.142.249.24 (1130739521)
68.142.249.95 (1130767263)
68.142.250.119 (1130330317)


Mozilla/4.0 (compatible; MSIE 6.0; DocBrown - www.metadoctor.de/docbrown.php)
217.160.167.139 (1130605793)


SurveyBot/2.3 (Whois Source)
64.246.161.190 (1127693006)
64.246.165.150 (1126529219)
64.246.165.180 (1127100948)
64.246.165.190 (1128295514)
64.246.165.210 (1130124971)
64.246.178.34 (1128903623)


Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; phpSitemapNG 1.5.0)
83.133.49.201 (1126987309)


ConveraCrawler/0.9d (+http://www.authoritativeweb.com/crawl)
63.241.61.8 (1126934245)


thumbshots-de-Bot (Version: 1.02, powered by www.thumbshots.de)
212.112.229.155 (1127215367)


Mozilla/4.0 (compatible; MSIE 5.0; Windows NT; Girafabot; girafabot at girafa dot com; http://www.girafa.com)
64.210.196.198 (1127308327)


Snappy/1.1 ( http://www.urltrends.com/ )
205.138.199.126 (1130074462)


InternetArchive/0.8-dev (Nutch; http://lucene.apache.org/nutch/bot.html; nutch-agent@lucene.apache.org)
207.241.238.11 (1127954688)


DataparkSearch/4.32 (+http://www.dataparksearch.org/)
82.135.28.34 (1128111331)


Mozilla/2.0 (compatible; Ask Jeeves/Teoma; +http://sp.ask.com/docs/about/tech_crawling.html)
65.214.44.77 (1128252979)
65.214.45.13 (1129043663)


Gigabot/2.0
64.62.168.18 (1128940820)
64.62.168.20 (1128940896)
64.62.168.30 (1128547048)
64.62.168.45 (1128946836)


psbot/0.1 (+http://www.picsearch.com/bot.html)
217.212.224.146 (1128594667)


Seekbot/1.0 (http://www.seekbot.net/bot.html) HTTPFetcher/2.1
212.18.3.146 (1128697718)


Microsoft URL Control - 6.00.8862
80.86.200.54 (1128967526)


SBIder/0.8-dev (SBIder; http://www.sitesell.com/sbider.html; http://support.sitesell.com/contact-support.html)
64.34.145.197 (1129043207)


sohu agent
220.181.26.112 (1129417327)


LinkWalker
209.167.50.22 (1129684868)


xirq/0.1-beta (xirq; http://www.xirq.com; xirq@xirq.com)
67.18.178.36 (1130263498)




Fundsachen aus den Logs von bull

Syntryx ANT Scout Chassis Pheromone
http://www.Syntryx.com/ ANT Chassis 9.36; Mozilla/4.0 compatible crawler
64.92.201.114


SBIder/0.8-dev (SBIder; http://www.sitesell.com/sbider.html; http://support.sitesell.com/contact-support.html)
64.34.145.197


Sensis Web Crawler (search_comments\\at\\sensis\\dot\\com\\dot\\au)
66.151.181.10


envolk[ITS]spider/1.6 (+http://www.envolk.com/envolkspider.html)
71.102.140.247


Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; ODP links test; http://tuezilla.de/test-odp-links-agent.html)
81.169.154.94


cfetch/1.0
38.112.6.182


RufusBot (Rufus Web Miner; http://64.124.122.252/feedback.html)
64.124.122.228


RedKernel WWW-Spider 2/0 (+http://www-spider.redkernel-softwares.com/)
66.55.143.162


Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)
66.55.138.98


KFSW-Bot (Version: 1.01, powered by KFSW, www.kfsw.de)
212.227.64.121


84.9.48.0 - 84.9.51.255
bulldog dsl, GB. Holt einzelne Seiten (ohne Bilder/CSS) mit stets variabler IP und User-agent. Nur in diesem IP-Bereich.


209.97.203.36
kehrt periodisch mit gefaktem UA "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)" wieder.
Gehört zu RackForce Hosting, der IP-Bereich kann mit deny from 209.97.192.0/19 gesperrt werden.


Mozilla/4.0 (compatible; MSIE 4.0; Windows NT; Site Server 3.0 Robot) ACR
208.236.180.41
gehört zu acr.org


Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; heritrix/1.3.0 +http://www.cs.washington.edu/research/networking/websys/)
205.175.111.5


Mozilla/4.0 (compatible; Win32; WinHttp.WinHttpRequest.5)
62.149.130.176
Die IP gehört zu Technorail/Aruba, einem italienischen Hoster (62.149.128.0/17)


IlTrovatore-Setaccio/1.2 (Italy search engine; http://www.iltrovatore.it/bot.html; bot@iltrovatore.it)
213.215.201.235


ExactSearch
71.245.214.192


Mozilla/6.0
193.63.239.165 - Oxford University Press.


iCCrawler (http://www.iccenter.net)
212.89.128.89


Hi! I'm CsCrawler, my homepage: http://www.kde.cs.uni-kassel.de/lehre/ss2005/googlespam/crawler.html RPT-HTTPClient/0.3-3
141.51.167.141


Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.5) Gecko/20041128 Firefox/1.0 (Debian package 1.0-4)
194.145.226.2 - nicht den User-agent blocken
Local:
2006-07-05 14:46:06 - Recent page revisions in XML format (this page)
Global:
Frontpage / Content - UserSettings, logged in as p54A4DE70.dip.t-dialin.net - RecentChanges - Recent page revisions in XML format (all pages)
Valid XHTML 1.0 Transitional :: Valid CSS :: Powered by WikkaWiki