Gossamer Forum
Home : General : Perl Programming :

Link harvesting or what?

Quote Reply
Link harvesting or what?
Last night I downloaded copies of all my hitcounter files and then deleted the originals from my site. This afternoon I looked in the folder and they were all there again. Now I know this isn't due to high amounts of traffic because I only get an average 1000 hits a month. Upon looking at the time the files were created (all within seconds/minutes of each other) and the last IP address logged, it appears someone is harvesting the links from my directory or it's just a robot spidering my site. The IP that appears in the logs is 209.202.148.41. Does anyone know if this IP address is assosiated with any search engine?

This buggs me because I caught a competing site harvesting my links before...

--Drew
Quote Reply
Re: Link harvesting or what? In reply to
Drew....If you conduct a tracert, you can find out the source....

Code:

Tracing route to 209.202.148.41 over a maximum of 30 hops



1 239 ms 266 ms 185 ms max250.earthnet.net [204.144.146.250]

2 144 ms 594 ms 504 ms earthnet-gw.earthnet.net [199.45.146.254]

3 1051 ms 1290 ms 1173 ms border-from-26-earthnet.boulder.co.coop.net [207.174.144.61]

4 1292 ms 1024 ms 1288 ms core-gw-eth-11-4.boulder.co.coop.net [199.45.137.142]

5 1305 ms 1295 ms 1294 ms bordercore3-serial1-0-0.Denver.cw.net [166.48.101.253]

6 904 ms 1029 ms 1018 ms corerouter1.WillowSprings.cw.net [204.70.9.135]

7 1293 ms 1147 ms 1287 ms core9.WillowSprings.cw.net [204.70.9.57]

8 1310 ms 1028 ms 616 ms ibr01-s10-1-0.okbr01.exodus.net [216.32.132.181]

9 1045 ms 799 ms 775 ms bbr02-g1-0.okbr01.exodus.net [216.34.183.66]

10 298 ms 418 ms 286 ms bbr01-p5-0.wlhm01.exodus.net [216.32.132.210]

11 770 ms 774 ms 658 ms dcr04-g2-0.wlhm01.exodus.net [64.14.70.66]

12 654 ms 405 ms 656 ms csr02-ve243.wlhm01.exodus.net [64.14.70.106]

13 490 ms 667 ms 781 ms 209.202.148.41

Trace complete.


I tried an nslookup on the IP Address you provided and I got the following results:

Code:

**.****.com can't find 209.202.148.41: Non-existent host/domain


One resource that you might want to use to ask about harvesting links from your site is:

http://www.searchengineforums.com/

If you search for the IP address, you may find some Threads that have posted it before. Users of this forum post potential attacks and IP Addresses.

Also, check your server logs and see if you can find the paths where this IP address has taken.

Regards,

Eliot Lee


Quote Reply
Re: Link harvesting or what? In reply to
I've had spiders crawl my site before with the IP address 209.202.148.32 range before. Close, but it's probably not the same spider/person.
Later,
Paul

http://www.fullmoonshining.com for Pearl Jam Fans
Quote Reply
Re: Link harvesting or what? In reply to
I think it's the same set of spiders. A tracert on that IP brings up exodus.net like the IP I posted. It appears exodus.net is the hosting provider for those IP's so I'm going ask them and see if they could tell me anything.

Searches in several spider IP databases haven't turned up anything yet.

Thanks a lot Smile

--Drew