Gossamer Forum
Home : General : Internet Technologies :

Re: [Andy] Internal Search Engine

Quote Reply
Re: [Andy] Internal Search Engine In reply to
I spotted this after a quick read through the docs:

Quote:
#
# If there are particular pages that you definitely do NOT want to index, you
# can use the exclude_urls attribute. The value is a list of string patterns.
# If a URL matches any of the patterns, it will NOT be indexed. This is
# useful to exclude things like virtual web trees or database accesses. By
# default, all CGI URLs will be excluded. (Note that the /cgi-bin/ convention
# may not work on your web server. Check the path prefix used on your web
# server.)
#
exclude_urls: /cgi-bin/ .cgi

#
# Since ht://Dig does not (and cannot) parse every document type, this
# attribute is a list of strings (extensions) that will be ignored during
# indexing. These are *only* checked at the end of a URL, whereas
# exclude_url patterns are matched anywhere.
#
bad_extensions: .wav .gz .z .sit .au .zip .tar .hqx .exe .com .gif \
.jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg .mov .avi .css

Will that not do what you want? It looks like you can exclude folders, files and even specific file extensions.

I really just glanced over the docs so you might want to dig (no pun intended :) ) into it a little deeper to see if it will work for you.

~Charlie
Subject Author Views Date
Thread Internal Search Engine Andy 8211 Dec 19, 2003, 3:22 AM
Post Re: [Andy] Internal Search Engine
Wil 7938 Dec 19, 2003, 3:34 AM
Thread Re: [Andy] Internal Search Engine
Chaz 8000 Dec 19, 2003, 6:38 AM
Thread Re: [Chaz] Internal Search Engine
Andy 8004 Dec 19, 2003, 6:40 AM
Thread Re: [Andy] Internal Search Engine
Chaz 8010 Dec 19, 2003, 6:49 AM
Thread Re: [Chaz] Internal Search Engine
Andy 8013 Dec 19, 2003, 6:54 AM
Thread Re: [Andy] Internal Search Engine
Alex 7992 Dec 19, 2003, 3:07 PM
Post Re: [Alex] Internal Search Engine
Andy 7977 Dec 20, 2003, 2:21 AM