I just had a bit of spare time before my dinner so wrote a script to split the 1GB DMOZ data file into seperate files. It will parse any category you desire and dump it into a seperate file.
It should be run from the command prompt (if using windows) or a telnet/ssh window (if using *nix).
Usage is pretty simple....
Just do:
perl rdf_parse.pl
...and you'll get a summary of options.
Hope someone finds it useful.
It should be run from the command prompt (if using windows) or a telnet/ssh window (if using *nix).
Usage is pretty simple....
Code:
perl rdf_parse.pl --rdf=/path/to/content.rdf.u8 --out=/path/to/new.file --cat=Top/ArtsJust do:
perl rdf_parse.pl
...and you'll get a summary of options.
Hope someone finds it useful.