Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: kinosearch: discuss

lucene indexes

 

 

kinosearch discuss RSS feed   Index | Next | Previous | View Threaded


eric_morgan at infomotions

Jan 26, 2008, 7:22 AM

Post #1 of 3 (876 views)
Permalink
lucene indexes

Can KinoSearch (version 0.162) read Lucene (version 2.3.0) indexes?
At first glance, it seems the answer is no.

--
Eric Lease Morgan


_______________________________________________
KinoSearch mailing list
KinoSearch [at] rectangular
http://www.rectangular.com/mailman/listinfo/kinosearch


marvin at rectangular

Jan 26, 2008, 7:41 AM

Post #2 of 3 (818 views)
Permalink
Re: lucene indexes [In reply to]

On Jan 26, 2008, at 7:22 AM, Eric Lease Morgan wrote:

> Can KinoSearch (version 0.162) read Lucene (version 2.3.0) indexes?
> At first glance, it seems the answer is no.

The only release of KS that could read a Lucene (version 1.4.3) index
was 0.05, and that was only for pure ASCII source material.

The Lucene file format is gnarly -- it uses the illegal aberration
"modified UTF-8" for text encoding, it's compromised by exceedingly
complex optimizations, etc. The format wasn't originally designed to
be public; the spec was published as an afterthought. Developments
since 1.4.3 have made it even harder to work with.

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/



_______________________________________________
KinoSearch mailing list
KinoSearch [at] rectangular
http://www.rectangular.com/mailman/listinfo/kinosearch


eric_morgan at infomotions

Jan 26, 2008, 7:48 AM

Post #3 of 3 (820 views)
Permalink
Re: lucene indexes [In reply to]

On Jan 26, 2008, at 10:41 AM, Marvin Humphrey wrote:

> The only release of KS that could read a Lucene (version 1.4.3)
> index was 0.05, and that was only for pure ASCII source material.
>
> The Lucene file format is gnarly -- it uses the illegal aberration
> "modified UTF-8" for text encoding, it's compromised by exceedingly
> complex optimizations, etc. The format wasn't originally designed
> to be public; the spec was published as an afterthought.
> Developments since 1.4.3 have made it even harder to work with.

Alas, sigh.

BTW, I see that version 0.20 of KinoSearch supports sorting. Cool!

--
Eric Lease Morgan



_______________________________________________
KinoSearch mailing list
KinoSearch [at] rectangular
http://www.rectangular.com/mailman/listinfo/kinosearch

kinosearch discuss RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.