Hi,
Ok sounds good. What you could do, is log in via SSH and then "tail" the access_log, to see if you can see any references to Baiduspider:
tail -n20 -f access_log | grep "Baidu'
That would be a more proactive way to see if they are still getting through :)
Can it be prevented somehow?
I'm not sure how accurate this article is (as it was from 2006), but you should certainly be able to exclude the "guest" parameter from being read by google:
http://cutroni.com/...ry-string-variables/
Cheers
Andy (mod)
andy@ultranerds.co.uk
Want to give me something back for my help? Please see my Amazon Wish List
GLinks ULTRA Package | GLinks ULTRA Package PRO
Links SQL Plugins | Website Design and SEO | UltraNerds | ULTRAGLobals Plugin | Pre-Made Template Sets | FREE GLinks Plugins!
Ok sounds good. What you could do, is log in via SSH and then "tail" the access_log, to see if you can see any references to Baiduspider:
Code:
cd /path/to/logs tail -n20 -f access_log | grep "Baidu'
That would be a more proactive way to see if they are still getting through :)
Quote:
One other thing - I see that Googlebot also reads pages that have part "guest=1131854" Can it be prevented somehow?
I'm not sure how accurate this article is (as it was from 2006), but you should certainly be able to exclude the "guest" parameter from being read by google:
http://cutroni.com/...ry-string-variables/
Cheers
Andy (mod)
andy@ultranerds.co.uk
Want to give me something back for my help? Please see my Amazon Wish List
GLinks ULTRA Package | GLinks ULTRA Package PRO
Links SQL Plugins | Website Design and SEO | UltraNerds | ULTRAGLobals Plugin | Pre-Made Template Sets | FREE GLinks Plugins!