Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: c-dev

lucene searching in pdf

 

 

Lucene c-dev RSS feed   Index | Next | Previous | View Threaded


anton.feldmann at uni-bielefeld

Mar 20, 2006, 12:13 PM

Post #1 of 1 (3581 views)
Permalink
lucene searching in pdf

I am writing a program to search into an PDF document. I have problems
with generate an index file outof a lot of pdf documents. I want that i
can store more than one pdfFile into the indexFile and i want to that
the program is giving back the 1. file (apsolutepath) 2. word and lexem
3. score 4. and line how do i get n pdf documents in one indexfile
stored by 1, 2, 4?
i wrote a program that make an index of my filesystem and i can search
in the filesystem to find files. i can not read pdf files and pars them
with lucene.

i want to have an analyzer for all language lucene works with.

IndexWriter write = new IndexWriter(index, new GermanAnalyzer(),
true);

i use only the germananalyzer.

cheers

anton feldmann

Lucene c-dev RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact lists@gossamer-threads.com
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.