Gossamer Forum
Home : General : Internet Technologies :

A tough robots.txt question

Quote Reply
A tough robots.txt question
We have a test server which maps to url http://abc.sitename.com. We move all our files up there for testing, and once we are done, we mirror the content from the test server to our live server http://www.sitename.com.

My problem is that I need the test server to be visible to the outside world, but I cant allow any spiders to index it. Yet on my main site, I want spiders to be able to index it. The problem comes up due to the mirroring process. Every file on the test server mirrors to the live server. So if I just used a simple robots.txt file and said:

User-agent: *
Disallow: /

that would end up on the live server upon mirroring and disallow spiders out there as well. Is there any way to specify an exact url in a robots.txt file like:

User-agent: *
Disallow: http://abc.sitename.com/

I'm hoping there's a way to disallow everything on the test server, yet allow everything on the live server.
Subject Author Views Date
Thread A tough robots.txt question ngoodman 3683 Apr 29, 2003, 8:08 AM
Post Re: [ngoodman] A tough robots.txt question
hennagaijin 3549 Apr 30, 2003, 6:00 AM