Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User

What is the percent of size of lucene's index ?

 

 

Lucene java-user RSS feed   Index | Next | Previous | View Threaded


isaacrc82 at gmail

Jul 23, 2008, 9:00 AM

Post #1 of 3 (185 views)
Permalink
What is the percent of size of lucene's index ?

I need to know what is the percent of size of lucene's index respect the
information I'm going to index, I have read some articles that say if a I
index 120 Gb of information the index will grow until 40 Gb, that means the
percent is 30 %, Could somebody tell me how can be proved that ?
Is there any official document of apache lucene where says that ?
I hope somebody can help me.
Thanks.
Ariel


ian.lea at gmail

Jul 23, 2008, 9:11 AM

Post #2 of 3 (180 views)
Permalink
Re: What is the percent of size of lucene's index ? [In reply to]

I think there are too many variables to give a simple answer.

How much of your data are you storing? Indexing? Compressing?

Get a representative sample of your data and try it out.


--
Ian.


On Wed, Jul 23, 2008 at 5:00 PM, Ariel <isaacrc82[at]gmail.com> wrote:
> I need to know what is the percent of size of lucene's index respect the
> information I'm going to index, I have read some articles that say if a I
> index 120 Gb of information the index will grow until 40 Gb, that means the
> percent is 30 %, Could somebody tell me how can be proved that ?
> Is there any official document of apache lucene where says that ?
> I hope somebody can help me.
> Thanks.
> Ariel
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe[at]lucene.apache.org
For additional commands, e-mail: java-user-help[at]lucene.apache.org


mhall at informatics

Jul 23, 2008, 9:11 AM

Post #3 of 3 (178 views)
Permalink
Re: What is the percent of size of lucene's index ? [In reply to]

You can also use Luke after you've created your indexes to get their
exact size, and other interesting data points.

Like Ian said though, the decisions you make on a field by field basis
will make your index size vary quite a bit, so probably the best thing
you could do is simply try it out, and then examine it.

Matt

Ian Lea wrote:
> I think there are too many variables to give a simple answer.
>
> How much of your data are you storing? Indexing? Compressing?
>
> Get a representative sample of your data and try it out.
>
>
> --
> Ian.
>
>
> On Wed, Jul 23, 2008 at 5:00 PM, Ariel <isaacrc82[at]gmail.com> wrote:
>
>> I need to know what is the percent of size of lucene's index respect the
>> information I'm going to index, I have read some articles that say if a I
>> index 120 Gb of information the index will grow until 40 Gb, that means the
>> percent is 30 %, Could somebody tell me how can be proved that ?
>> Is there any official document of apache lucene where says that ?
>> I hope somebody can help me.
>> Thanks.
>> Ariel
>>
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe[at]lucene.apache.org
> For additional commands, e-mail: java-user-help[at]lucene.apache.org
>
>

--
Matthew Hall
Software Engineer
Mouse Genome Informatics
mhall[at]informatics.jax.org
(207) 288-6012



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe[at]lucene.apache.org
For additional commands, e-mail: java-user-help[at]lucene.apache.org

Lucene java-user RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact lists@gossamer-threads.com
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.