Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: General
Index Ratio
 

Index | Next | Previous | View Flat


MelbourneBeerBaron at gmail

Jun 24, 2009, 5:47 PM


Views: 545
Permalink
Index Ratio

Hi, I just completed a batch test index of ~1100 documents of various file
types and I noticed that the original documents take up about 145MB but my
index is only 1.7MB?? I remember reading somewhere that the typical
compression rate is about 20-30% or something, but mine is a little over 1%!
I'm not complaining or anything It just struck me a odd especially as I have
a lot of archive files and emails with attachments that I parse as well. Has
anyone else experienced something like this, I'm just curious.

Cheers. Brett.
--
View this message in context: http://www.nabble.com/Index-Ratio-tp24195272p24195272.html
Sent from the Lucene - General mailing list archive at Nabble.com.

Subject User Time
Index Ratio MelbourneBeerBaron at gmail Jun 24, 2009, 5:47 PM
    Re: Index Ratio ted.dunning at gmail Jun 24, 2009, 7:17 PM
    Re: Index Ratio MelbourneBeerBaron at gmail Jun 24, 2009, 7:39 PM
        Re: Index Ratio ted.dunning at gmail Jun 24, 2009, 7:43 PM
    Re: Index Ratio chris_j_collins at yahoo Jun 24, 2009, 7:47 PM
        Re: Index Ratio MelbourneBeerBaron at gmail Jun 24, 2009, 9:07 PM
    Re: Index Ratio otis_gospodnetic at yahoo Jun 24, 2009, 9:06 PM
        Re: Index Ratio MelbourneBeerBaron at gmail Jun 24, 2009, 9:28 PM
            Re: Index Ratio chris_j_collins at yahoo Jun 24, 2009, 9:34 PM
                Re: Index Ratio MelbourneBeerBaron at gmail Jun 24, 2009, 9:57 PM
                    Re: Index Ratio MelbourneBeerBaron at gmail Jun 24, 2009, 10:23 PM

  Index | Next | Previous | View Flat
 
 


Interested in having your list archived? Contact lists@gossamer-threads.com
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.