Gossamer Forum
Home : Products : Gossamer Links : Discussions :

dmoz import

Quote Reply
dmoz import
Hi,

This question may have already been covered, if so, my apologies.

Links SQL beta 4, is their an importer for DMOZ content? or are we still using the 2 step process, i.e. import for links sql 1.x and then import 1.x to 2.x?

Thanks


-
Eric Geiler
Links SQL User
Quote Reply
Re: dmoz import In reply to
Yes, I think you still have to import to LinksSQL1.13 and then into NG BETA4.

Paul Wilson. Shocked
(Dont blame me if I'm wrong!)
Quote Reply
Re: dmoz import In reply to
i think there will be a new script, alex said there would be an importer script in the post in one of the threads about dmoz i started.

http://www.ASciFi.com/ - The Science Fiction Portal
Quote Reply
Re: dmoz import In reply to
In Reply To:
i think there will be a new script
It has been written and tested as a part of nph-import.cgi. It's not yet available yet to the masses ;) but whenever the next beta release comes out (should be soon) it will be included.

Note that although it is a .cgi file, this import will NOT be available via the web interface to nph-import.cgi (due to the amount of time parsing the entire DMOZ file takes), only through the shell interface.

I'm not sure when the next beta release will be out, but I can tell you that we're getting close to having everything finished and ready to go.

Jason Rhinelander
Gossamer Threads
jason@gossamer-threads.com
Quote Reply
Re: dmoz import In reply to
Alex said tomorrow (although tomorrow has been the word for the previous 3 days :) ). Anyway, glad to here it is going to be in it. Will be very useful.

Can you confirm which log file it is designed to use?

http://www.ASciFi.com/ - The Science Fiction Portal
Quote Reply
Re: dmoz import In reply to
In Reply To:
Can you confirm which log file it is designed to use?
What do you mean by "log file"? Do you mean dmoz file? You can download the file from dmoz directly (http://www.dmoz.com/rdf.html); the import can handle the file gzip compressed, or uncompressed (Note that some browsers (IE for example) will uncompress the file when it downloads it - if you end up with a 6xxMB or larger file, then it's not compressed).

The actual file you need is the
Code:
content.rdf.u8.gz
file.

Jason Rhinelander
Gossamer Threads
jason@gossamer-threads.com
Quote Reply
Re: dmoz import In reply to
thanks, yeah it was just the u8 bit or not i was not sure about. thanks.

http://www.ASciFi.com/ - The Science Fiction Portal
Quote Reply
Re: dmoz import In reply to
It'll work with both the u8 one and the non-u8 one.

Jason Rhinelander
Gossamer Threads
jason@gossamer-threads.com
Quote Reply
Re: dmoz import In reply to
even better. any idea when it will be released along with beta5? tomorrow as a date is getting confusing especially as i am on a different time zone :)

http://www.ASciFi.com/ - The Science Fiction Portal
Quote Reply
Re: dmoz import In reply to
Beta 5 was released last night

Jason Rhinelander
Gossamer Threads
jason@gossamer-threads.com
Quote Reply
Re: dmoz import In reply to
excellent.. i missed it. thanks so much

http://www.ASciFi.com/ - The Science Fiction Portal
Quote Reply
Re: dmoz import In reply to
sorry wrong one.

i type this:

perl nph-import.cgi --rdf-category="Top/Arts" --import RDF --source=/home/ascifi/www/structure.rdf.u8.gz.txt --rdf-add-date="2001-01-10"

and get

GT::SQL (17722): Wrong argument passed to this subroutine. Usage: The defs directory '' does not exist, or is not a directory. at /home/ascifi/www/cgi-bin/dir/admin/Links/Import/RDFS2.pm line 68.