Hello,
I have developed a Windows based program that will extract the dmoz directory from the net rather than the RDF dump. The program was developed to be a small client side extractor and spider suitable for pulling subcategories from the dmoz ... example if you had a photography site and wanted to just pull the photography directory from the dmoz as a starting point.
The program essentially lets you navigate to the desired category then auto extracts down through the subcategories. All the records are then entered to a database keeping the Description and Title. The program further is capable of spidering each site for keywords and email and then converting the entire db into Links 2.0 format files.
Anyway I have the program to the point that I need some input with respect to functionality and such. If anyone has any interest please email me and I'll send you the download URL.
Thanks.
I have developed a Windows based program that will extract the dmoz directory from the net rather than the RDF dump. The program was developed to be a small client side extractor and spider suitable for pulling subcategories from the dmoz ... example if you had a photography site and wanted to just pull the photography directory from the dmoz as a starting point.
The program essentially lets you navigate to the desired category then auto extracts down through the subcategories. All the records are then entered to a database keeping the Description and Title. The program further is capable of spidering each site for keywords and email and then converting the entire db into Links 2.0 format files.
Anyway I have the program to the point that I need some input with respect to functionality and such. If anyone has any interest please email me and I'll send you the download URL.
Thanks.