Gossamer Forum
Home : General : Internet Technologies :

WebStripper

Quote Reply
WebStripper
Has anyone got experience with WebStripper?

I just noticed in my apache logs, that somebody using webstripper has downloaded my whole site, including all dynamic pages on the forum. Shouldn't a well behaved robot know that spidering dynamic pages does not make much sense?

Ivan
-----
Iyengar Yoga Resources / GT Plugins
Quote Reply
Re: [yogi] WebStripper In reply to
Site strippers like that should be banned. It takes the whole fun out of making a distinct site, that no-one else has. I mean, whats the point in having a 100% copied site? Unimpressed

Regarding stopping them. I'm not sure where...but I'm sure Paul came up with some code to put in Apache that would stop them being allowed to spider your server/sites Smile

Andy (mod)
andy@ultranerds.co.uk
Want to give me something back for my help? Please see my Amazon Wish List
GLinks ULTRA Package | GLinks ULTRA Package PRO
Links SQL Plugins | Website Design and SEO | UltraNerds | ULTRAGLobals Plugin | Pre-Made Template Sets | FREE GLinks Plugins!
Quote Reply
Re: [yogi] WebStripper In reply to
Code:
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} ^.*EmailSiphon [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Wget [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^.*EmailWolf [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^.*HTTrack [NC]
RewriteRule ^.*$ http://www.aol.com [L]

Just keep adding naughty bots to the list - don't use [OR] for the last one in the list.

It's too late now your site has been grabbed but you can at least prevent anything more.

Last edited by:

Paul: Feb 7, 2003, 2:10 AM
Quote Reply
Re: [Paul] WebStripper In reply to
What does the last line do? Does this bounce the stripper to whatever url is shown below?

Quote:
RewriteRule ^.*$ http://www.aol.com [L]

Thanks Smile

------------------------------------------

Last edited by:

DogTags: Feb 7, 2003, 5:50 AM
Quote Reply
Re: [DogTags] WebStripper In reply to
Yeah - if it tries to grab your site it will just grab aol.com a load of times ;)
Quote Reply
Re: [Paul] WebStripper In reply to
Ya know, I should have asked the following, too:

Is the bounce line necessary, or can you just have the stripper lines?

I think I'd prefer preventing the strippers, but not bouncing to anywhere.

Thanks Smile

------------------------------------------
Quote Reply
Re: [yogi] WebStripper In reply to
Excellent thread over here

=> http://www.webmasterworld.com/forum13/687-1-15.htm

Last edited by:

Thomas.: Feb 7, 2003, 6:48 AM
Quote Reply
Re: [Thomas.] WebStripper In reply to
Thanks for the link, but that page is for members only....

Ivan
-----
Iyengar Yoga Resources / GT Plugins
Quote Reply
Re: [DogTags] WebStripper In reply to
I think changing the URL to a dash should work, ie -