Forums › Life › Computers, Gadgets & Technology › Forum, Blog & Community Software › Bad bot block list
I’ve decided to start limiting bot access to the site so I’ve disallowed them all baring facebook, yahoo, bing, msn and google in robots. txt, I’ve started blocking the IPs of bots who don’t respect robots.txt and expanded Who’s Online with better spider reporting. So if you’re ever in Who’s Online and see a bot you think should be banned please paste their details below such as:
Spinn3r Spider | 22:29 | Viewing Page Home | 174-36-241-156.robot.spinn3r.com Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.0.19; aggregator:Spinn3r (Spinn3r 3.1); http://spin |
Thanks!
Magpie Spider 23:09 /forums/calendar.php?do=getinfo&e=2899 Viewing Event
illumiNaughty ” The Return” DINO PSARES.SABERTOOTH.ZEUS.DRAVEN.NOSF.MAZIEG
94.228.34.233
magpie-crawler/1.1 (U; Linux amd64; en-GB; +Social Media Monitoring Tools | Brandwatch)
I’m not sure of what you mean by not respecting robots…Is it when the operate outside their normal ip range?
there’s a file robots.txt in the server (I’ve shown it below).
it tells robots where they are allowed on the site – and where they are not – if they disobey it they are booted.
i.e these are the droids we are looking for – and others can JTFO
User-agent: Googlebot-Mobile
Allow: /
User-agent: Googlebot-Mobile
Allow: /
User-agent: Yahoo! Slurp
Allow: /
user-Agent: Yahoo-MMCrawler
Allow: /
user-Agent: YahooSeeker
Allow: /
user-Agent: msnbot-media
Allow: /
user-Agent: facebookexternalhit
Allow: /
User-agent: Googlebot
Disallow: /forums/index.php
Disallow: /forums/members/list/
Disallow: /forums/admincp/
Disallow: /forums/clientscript/
Disallow: /forums/cpstyles/
Disallow: /forums/images/
Disallow: /forums/modcp/
Disallow: /forums/ajax.php
Disallow: /forums/cron.php
Disallow: /forums/editpost.php
Disallow: /forums/global.php
Disallow: /forums/inlinemod.php
Disallow: /forums/joinrequests.php
Disallow: /forums/login.php
Disallow: /forums/misc.php
Disallow: /forums/moderator.php
Disallow: /forums/newattachment.php
Disallow: /forums/newreply.php
Disallow: /forums/newthread.php
Disallow: /forums/online.php
Disallow: /forums/postings.php
Disallow: /forums/printthread.php
Disallow: /forums/private.php
Disallow: /forums/profile.php
Disallow: /forums/register.php
Disallow: /forums/report.php
Disallow: /forums/reputation.php
Disallow: /forums/search.php
Disallow: /forums/sendmessage.php
Disallow: /forums/showgroups.php
Disallow: /forums/showpost.php
Disallow: /forums/subscription.php
Disallow: /forums/threadrate.php
Disallow: /forums/usercp.php
Disallow: /forums/usernote.php
Disallow: /forums/faq.php
User-agent: Bingbot
Disallow: /forums/index.php
Disallow: /forums/members/list/
Disallow: /forums/admincp/
Disallow: /forums/clientscript/
Disallow: /forums/cpstyles/
Disallow: /forums/images/
Disallow: /forums/modcp/
Disallow: /forums/ajax.php
Disallow: /forums/cron.php
Disallow: /forums/editpost.php
Disallow: /forums/global.php
Disallow: /forums/inlinemod.php
Disallow: /forums/joinrequests.php
Disallow: /forums/login.php
Disallow: /forums/misc.php
Disallow: /forums/moderator.php
Disallow: /forums/newattachment.php
Disallow: /forums/newreply.php
Disallow: /forums/newthread.php
Disallow: /forums/online.php
Disallow: /forums/postings.php
Disallow: /forums/printthread.php
Disallow: /forums/private.php
Disallow: /forums/profile.php
Disallow: /forums/register.php
Disallow: /forums/report.php
Disallow: /forums/reputation.php
Disallow: /forums/search.php
Disallow: /forums/sendmessage.php
Disallow: /forums/showgroups.php
Disallow: /forums/showpost.php
Disallow: /forums/subscription.php
Disallow: /forums/threadrate.php
Disallow: /forums/usercp.php
Disallow: /forums/usernote.php
Disallow: /forums/faq.php
User-agent: *
Disallow: /
Isn’t it possible in the Admin Panel to just allow the ones you want, or do the unwanted ones get in anyway?
Like Oddvar I’m not totally sure what you mean :crazy_diz
This one ???
Baidu Spider
07:41 showthread.php?t=35973&pagenumber= Viewing Thread
Nimai Le Santo (RIP): An Overdose and his Father’s Reaction
123.125.71.106
Mozilla/5.0 (compatible; Baiduspider/2.0; +???????????Baiduspider)
some bot is is also here as guests, last night I saw one looking in thrash for penis pills….it was stopped though. Infact there are many bots in here, I think they follow when people are asking bing, yahoo, google etc….
Whitevector Crawler Spider
12:08
forumdisplay.php?f=29&pagenumber= Viewing Forum The Law
217.149.51.37
Whitevector Crawler (+Whitevector)
In Who’s Online use display only Bots, sort by Last Activity Date and select Show User Agent… 🙂
not quite a bad bot but it made me laugh…
MSNBot Spider 21:30 showthread.php?t=28033&pagenumber= Viewing Thread
Microsoft wants to record your life… Check this out!!!
207.46.192.64
What is this?
Guest 10:10
Viewing Who’s Online
173.193.228.66
libwww-perl/5.837
And how to deal with this? ;
GeoTime – Geo-Temporal Information Visualization
Metropolitan Police trials GeoTime tracking software | Security Management | ZDNet UK
Police Buys GeoTime Software, Will Track Suspects Online | ITProPortal.com
Is there a way to sort this out?
@!sinner69! 437330 wrote:
What is this?
Guest 10:10
Viewing Who’s Online
173.193.228.66
libwww-perl/5.837
thats one of our external IP’s for the main site. I think its our own bot, and part of the software to make the site work..
@!sinner69! 437333 wrote:
And how to deal with this? ;
GeoTime – Geo-Temporal Information Visualization
Metropolitan Police trials GeoTime tracking software | Security Management | ZDNet UK
Police Buys GeoTime Software, Will Track Suspects Online | ITProPortal.com
Is there a way to sort this out?
this is up to the users to be careful with their devices. That said on forums there is little correct geographical info in a post (IP addresses can be hundreds of miles off) unless people are silly enough to repeatedly brag about crime.
in fact what gets people nicked from forum posts is not feds and smart technology but folk what lurk on forums and grass up stuff and we have been good at warning the users not to incriminate themselves.
0
Voices
36
Replies
Tags
This topic has no tags
Forums › Life › Computers, Gadgets & Technology › Forum, Blog & Community Software › Bad bot block list