Gossamer Forum
Home : Products : Gossamer Links : Discussions :

DMOZ import help

Quote Reply
DMOZ import help
I am working on the dmoz .rdf import. The file size of the .rdf is in the 700 MB range (uncompressed). The import has been running for 6 days now and no where near completion. Last check, Catlinks are at 667,426 and Categories are at 54,205.

My question is a matter of performance. Does this at all sound odd that the import has been running for 6 days and only 1/4 of the way complete? At this rate, were talking 2-3 weeks for total import (fingers are crossed that it doesn't unexpectedly terminate along the way).

CPU resources are averaging 90-95% and memory only 10-12%. This leads me to believe it's not a performance issue on my end as the memory isn't strapped and now using disk cache.

The machine: 400 MHZ PII, 196 MB Ram, newest Perl, newest MySQL and mod_perl.

Any ideas on how to increase performance and lower import time?

The import will soon be running for an entire week. Does this sound odd?


Subject Author Views Date
Thread DMOZ import help lisco 2355 Feb 20, 2001, 4:10 PM
Post Re: DMOZ import help
Paul 2298 Feb 20, 2001, 4:21 PM
Thread Re: DMOZ import help
Alex 2295 Feb 20, 2001, 4:53 PM
Post Re: DMOZ import help
pugdog 2284 Feb 20, 2001, 8:01 PM
Thread Re: DMOZ import help
lisco 2269 Feb 21, 2001, 12:39 PM
Post Re: DMOZ import help
Alex 2248 Feb 23, 2001, 9:41 PM