yep.. my spider is programmed to not accept .cgi files with a ? after it... to avoid those problems.. except i still have to parse out goto.com sites.. TOOO MANY!
i also delete all sites that do not reside on the same domain as the url... so if i get like
http://something.cjb.net
since cjb forwards.. i delete that site.. and add the site that it forwards to
jerry
i also delete all sites that do not reside on the same domain as the url... so if i get like
http://something.cjb.net
since cjb forwards.. i delete that site.. and add the site that it forwards to
jerry