Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User

Supported File Formats - PDF, MHT

 

 

Lucene java-user RSS feed   Index | Next | Previous | View Threaded


naman.gupta at gmail

Feb 12, 2008, 11:13 PM

Post #1 of 2 (445 views)
Permalink
Supported File Formats - PDF, MHT

Hey

Does lucene support the files in pdf and mht file formats. I wasnt able to
retrieve any results after creating an index of such files. This is the
first time i am using lucene.

Thanks

Naman K Gupta


jpstotz at gmx

Feb 12, 2008, 11:55 PM

Post #2 of 2 (406 views)
Permalink
Re: Supported File Formats - PDF, MHT [In reply to]

Naman Gupta wrote:
> Does lucene support the files in pdf and mht file formats. I wasnt able to
> retrieve any results after creating an index of such files.

Well, the answer is simple: Lucene itself does not support any file
format. You need a file parser that converts your files to a plain text
representation. For details please see the Lucene FAQ:

http://wiki.apache.org/lucene-java/LuceneFAQ#head-c45f8b25d786f4e384936fa93ce1137a23b7e422

Jan

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene

Lucene java-user RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.