Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User

Fwd: Indexing Wikipedia with Solr/Lucene

 

 

Lucene java-user RSS feed   Index | Next | Previous | View Threaded


vineet.yadav.iiit at gmail

May 13, 2012, 11:55 AM

Post #1 of 1 (80 views)
Permalink
Fwd: Indexing Wikipedia with Solr/Lucene

Hi all,
I want to create Lucene/Solr index of wikipedia xml dump. I used Solr
example(http://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia)
to index wikipedia xml dump. Since in wikipedia, Category and external
links are part of wikipedia text, I am not able to index category and
external links separately.     I want to index  Category, Externals
links etc separately and store them in separate fields.
Would anyone please be kind enough to give me a bit of advice?
Thanks
Vineet Yadav

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene

Lucene java-user RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.