
vineet.yadav.iiit at gmail
May 13, 2012, 11:55 AM
Post #1 of 1
(80 views)
Permalink
|
|
Fwd: Indexing Wikipedia with Solr/Lucene
|
|
Hi all, I want to create Lucene/Solr index of wikipedia xml dump. I used Solr example(http://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia) to index wikipedia xml dump. Since in wikipedia, Category and external links are part of wikipedia text, I am not able to index category and external links separately. I want to index Category, Externals links etc separately and store them in separate fields. Would anyone please be kind enough to give me a bit of advice? Thanks Vineet Yadav --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe [at] lucene For additional commands, e-mail: java-user-help [at] lucene
|