May 17, 2000, 9:12 AM
Veteran / Moderator (6956 posts)
May 17, 2000, 9:12 AM
Post #2 of 4
Views: 1795
The ODP file is 100+ meg COMPRESSED. It uncompresses to almost 600 meg.
You can download it to your local machine, and cut it up, but you need an editor that can handle files of arbitrary size. I've given up with doing that on Windows, since the memory leaks are so bad the system just crashes after doing it. On Unix, JOE (and probably Emacs, though I don't know since I haven't used it in almost 20 years) can handle files of arbitrary size. To make it easier, I've cut the file into 50 meg pieces, then edit each one separately. Fortunately, for the pieces I wanted none crossed a 'split' boundary. If they did, you'd have to put the two files together, or split them differently.
I did a bunch of searching the net for RDF references, and of course, windows crashed before I could get them posted. But, there is a lot of information out there if you search for RDF format, but _none_ allows parsing/cuttting up of the DMOZ file, or an RDF file of that size.