This morning I received a very unpleasant mail with this content:
Your web hosting account for mywebsite.org has been deactivated, as of 03/11/2016. (reason: site causing performance problems)
etc. etc. etc. ...
What???
After I contacted the hosting company and after a shorter "investigation", it became clear that it was a monster named "Baiduspider" that caused all the problems ..
Here are a few lines from the acces-log:
"GET /forum/gforum.cgi?username=bozo202;guest=2662521&t=search_engine HTTP/1.0"
"GET /forum/gforum.cgi?username=ANICO;guest=1737188&t=search_engine HTTP/1.0"
"GET /forum/gforum.cgi?do=user_list;sb=user_username;so=ASC;first=D;guest=3208431&t=search_engine HTTP/1.0"
"GET /forum/gforum.cgi?do=search;guest=2184932&t=search_engine HTTP/1.0"
etc. etc. etc. ...
Please note that "guest=*******" is always present in every single line ....
Now, ..
The bottom line is that I (read "we") need to stop: "Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)"
How?
Many people have the same problem as me but it seems that "Baiduspider" is unstoppable!
Any idea how to solve this efficiently? Unfortunately, it seems that in this case both robots.txt and .htaccess are not very helpful ..
Thanks in advance :)
Quote:
Web hosting account deactivated for "mywebsite.org" Your web hosting account for mywebsite.org has been deactivated, as of 03/11/2016. (reason: site causing performance problems)
etc. etc. etc. ...
What???
After I contacted the hosting company and after a shorter "investigation", it became clear that it was a monster named "Baiduspider" that caused all the problems ..
Here are a few lines from the acces-log:
"GET /forum/gforum.cgi?username=bozo202;guest=2662521&t=search_engine HTTP/1.0"
"GET /forum/gforum.cgi?username=ANICO;guest=1737188&t=search_engine HTTP/1.0"
"GET /forum/gforum.cgi?do=user_list;sb=user_username;so=ASC;first=D;guest=3208431&t=search_engine HTTP/1.0"
"GET /forum/gforum.cgi?do=search;guest=2184932&t=search_engine HTTP/1.0"
etc. etc. etc. ...
Please note that "guest=*******" is always present in every single line ....
Now, ..
The bottom line is that I (read "we") need to stop: "Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)"
How?
Many people have the same problem as me but it seems that "Baiduspider" is unstoppable!
Any idea how to solve this efficiently? Unfortunately, it seems that in this case both robots.txt and .htaccess are not very helpful ..
Thanks in advance :)