<?xml version="1.0" encoding="iso-8859-1" ?>
<?xml-stylesheet title="XSL_formatting" type="text/xsl" href="/images/lists/rssstyle2.xsl"?>
<rss version="2.0">
<channel>
<title>Lucene | Java-User</title>
<description>Mailing List Archive by Gossamer Threads</description>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<language>en-us</language>
<copyright>(c) Gossamer Threads Inc. All rights reserved.</copyright>
<lastBuildDate>11 Oct  2008 17:25:07 -0800</lastBuildDate>
<ttl>120</ttl>
<image>
<title>Gossamer Threads | Lucene | Java-User</title>
<width>75</width>
<height>23</height>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<url>http://www.gossamer-threads.com/images/lists/rss_logo.jpg</url>
</image>
<item>
<title>Lucene 2.4.0 release</title>
<description>Release 2.4.0 of Lucene is now available! With 2.4.0 we have relaxed the backwards compatibility policy of the Fieldable interface: we now allow chan</description>
<pubDate>11 Oct  2008 08:50:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66225</link>
</item><item>
<title>Is lucene right for us</title>
<description>Hello All, At my university we have over 20,000 small file ranging from 20k to 500k per directory and we would like to index them. I was wondering if</description>
<pubDate>11 Oct  2008 07:22:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66223</link>
</item><item>
<title>Retrieving Top Terms for a subset of the index (or for all results of a query)</title>
<description>Hello everyone. I&amp;#039;ve been fiddeling with the idea of retrieving the top  terms from a subset of the index (i.e. top terms from the documents  retrie</description>
<pubDate>11 Oct  2008 03:36:04 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66221</link>
</item><item>
<title>Re: Indexing Scalability, Multiwriter?</title>
<description>Glen,  Thank you for the details there. Its really great what you&amp;#039;ve done and I will study it some more! I too though about using multiple writers i</description>
<pubDate>11 Oct  2008 02:21:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66220</link>
</item><item>
<title>Re: Indexing Scalability, Multiwriter?</title>
<description>IndexWriter is thread-safe and has been for a while (http://www.mail-archive.com/lucene-dev@jakarta.apache.org/msg00157.html) so you don&amp;#039;t have to wor</description>
<pubDate>10 Oct  2008 19:17:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66219</link>
</item><item>
<title>Indexing Scalability, Multiwriter?</title>
<description>Hi gang,  Wondering how folks have address scaled up indexing. I saw old threads about using clustered webapp with JNDI singleton index writer due to</description>
<pubDate>10 Oct  2008 14:05:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66218</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Actually looking at this a little deeper maybe Lucene could/should automatically be doing this &amp;quot;short&amp;quot; optimisation here? Given a comparitively small</description>
<pubDate>10 Oct  2008 09:22:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66214</link>
</item><item>
<title>RE: Question regarding sorting and memory consumption in lucene</title>
<description>I have had a similar problem. What I do is load all the date field values at index startup, convert dates (timestamps) to a Julian date (# of seconds</description>
<pubDate>10 Oct  2008 08:57:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66213</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>That&amp;#039;s a really good idea Mark! :) Thanks! Will try to see if can make a quick change with your suggestion.  (Too bad quick isn&amp;#039;t really a word in my</description>
<pubDate>10 Oct  2008 08:45:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66212</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Update: The statement &amp;quot;...cost is field size (10 bytes ?) times number of documents&amp;quot; is wrong. What you actually have is the cost of the unique strin</description>
<pubDate>10 Oct  2008 08:07:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66209</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Yes, I understand that, and I did mean the number of documents, but I read  in the javadoc that: &amp;quot;For String fields, the cache is larger: in additio</description>
<pubDate>10 Oct  2008 07:58:14 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66208</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>I think you have your memory cost calculation wrong. The cost is field size (10 bytes ?) times number of documents NOT number of unique terms. The cac</description>
<pubDate>10 Oct  2008 07:43:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66207</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Unfortunately no, since the documents that are added may come form a new  &amp;quot;source&amp;quot; containing old documents aswell..:/ I tried deploying our webappli</description>
<pubDate>10 Oct  2008 07:25:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66206</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Assuming content is added in chronological order and with no updates to existing docs couldn&amp;#039;t you rely on internal Lucene document id to give a chron</description>
<pubDate>10 Oct  2008 06:18:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66205</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>I&amp;#039;ll follow up on my own question... Let&amp;#039;s say that we have 4 years of data, meaning that there will be roughly  4 * 365 = 1460 unique terms for our</description>
<pubDate>10 Oct  2008 05:51:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66202</link>
</item><item>
<title>Re: Only last field indexed</title>
<description>True, I guess I was thinking of things from a search-only perspective when I claimed they were identical... But you&amp;#039;re absolutely right in that you ca</description>
<pubDate>10 Oct  2008 05:47:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66201</link>
</item><item>
<title>Re: Release 2.4 on ibiblio</title>
<description>The release bits are indeed propagating through all mirrors, but I&amp;#039;m  going to wait until tomorrow to do the announcement, to make sure all  mirrors</description>
<pubDate>10 Oct  2008 05:31:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66200</link>
</item><item>
<title>Release 2.4 on ibiblio</title>
<description>Hi there, I&amp;#039;ve just noticed that there is already a 2.4 release available on ibiblio  (http://mirrors.ibiblio.org/pub/mirrors/maven2/org/apache/luce</description>
<pubDate>10 Oct  2008 05:27:40 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66199</link>
</item><item>
<title>Question regarding sorting and memory consumption in lucene</title>
<description>Hello, I&amp;#039;ve read a lot of threads now on memory consumption and sorting,  and I think I have a pretty good understanding of how things work, but I</description>
<pubDate>10 Oct  2008 05:09:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66198</link>
</item><item>
<title>Re: Buzz measurement - Aggregate functions</title>
<description>Ah, sorry. Just saw the bit about the free text query too. A FieldCache is the answer here I suspect in order to quickly retrieve the date values for</description>
<pubDate>10 Oct  2008 03:04:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66196</link>
</item><item>
<title>Re: Buzz measurement - Aggregate functions</title>
<description>Assuming your date data is held as YYYYMMDD and you want daily totals....     Term startTerm=new Term(&amp;quot;date&amp;quot;,&amp;quot;20080101&amp;quot;);     TermEnum termEn</description>
<pubDate>10 Oct  2008 02:40:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66195</link>
</item><item>
<title>Buzz measurement - Aggregate functions</title>
<description>Hi. Anyone have an idea of how I would create a query which finds the data backing a trend graph where date is X and num(docs) is on Y axis ? This i</description>
<pubDate>10 Oct  2008 02:12:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66194</link>
</item><item>
<title>Re: wizard for search in Lucene</title>
<description>From what I can understand, you want to insert the word &amp;quot;history&amp;quot; and then  get proposed &amp;quot;related&amp;quot; terms in combination with your input query. In ess</description>
<pubDate>10 Oct  2008 00:33:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66193</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>Hi Ganesh, Your situation seems pretty straight. I did not really split my database (storage), just that while indexing, I indexed the data into &amp;#039;n&amp;#039;</description>
<pubDate>09 Oct  2008 21:49:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66192</link>
</item><item>
<title>RE: Only last field indexed</title>
<description>Kalani, They are put into the document as separate fields. That&amp;#039;s why there are getFields() and deleteFields() methods on Document. John G. -----Or</description>
<pubDate>09 Oct  2008 19:29:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66191</link>
</item>
</channel>
</rss>
