<?xml version="1.0" encoding="iso-8859-1" ?>
<?xml-stylesheet title="XSL_formatting" type="text/xsl" href="/images/lists/rssstyle2.xsl"?>
<rss version="2.0">
<channel>
<title>Lucene | Java-User</title>
<description>Mailing List Archive by Gossamer Threads</description>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<language>en-us</language>
<copyright>(c) Gossamer Threads Inc. All rights reserved.</copyright>
<lastBuildDate>12 Oct  2008 21:12:37 -0800</lastBuildDate>
<ttl>120</ttl>
<image>
<title>Gossamer Threads | Lucene | Java-User</title>
<width>75</width>
<height>23</height>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<url>http://www.gossamer-threads.com/images/lists/rss_logo.jpg</url>
</image>
<item>
<title>Re: Enumerating all the terms of a particular field</title>
<description>Someone just asked this question a week ago (unforunatley they asked it on the wrong list)... http://www.nabble.com/Can-I-filter-the-results-returne</description>
<pubDate>12 Oct  2008 18:59:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66248</link>
</item><item>
<title>Re: Searching sets of documents</title>
<description>all folders which match &amp;quot;A AND Y&amp;quot;, do you search for file name? If yes, A or Y in &amp;quot;A AND Y&amp;quot; is a Strring too, so you can do it by: construct a Lucene</description>
<pubDate>12 Oct  2008 17:27:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66247</link>
</item><item>
<title>Enumerating all the terms of a particular field</title>
<description>Hello,  How can I get a list of all the terms for a particular field? Is the right approach to extend FilteredTermEnum? Thanks!!</description>
<pubDate>12 Oct  2008 15:13:01 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66246</link>
</item><item>
<title>Detecting why a collection of documents matched a query</title>
<description>Hello, I noticed that indexSearcher.explain() method is not supposed to be run for a large collection of documents, so I am looking for an alternativ</description>
<pubDate>12 Oct  2008 14:50:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66245</link>
</item><item>
<title>Searching sets of documents</title>
<description>Hi, I want to search for sets of documents. For instance I index some folders with documents in it and now I do not want to find certain documents bu</description>
<pubDate>12 Oct  2008 11:11:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66240</link>
</item><item>
<title>Re: Access Scoring Values of Lucene for Post-Processing</title>
<description>Have a look at the o.a.lucene.search.function package and the  ValueSourceQuery. You will probably be able to factor in those pieces  during scorin</description>
<pubDate>12 Oct  2008 08:40:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66235</link>
</item><item>
<title>Re: Retrieving Top Terms for a subset of the index (or for all results of a query)</title>
<description>How large of a subset are you talking? You might look at the FitleredTermEnum class, but you will probably  have to do some work on it to extend it</description>
<pubDate>12 Oct  2008 08:38:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66234</link>
</item><item>
<title>Re: Is lucene right for us</title>
<description>Lucene should work quite well for this, you&amp;#039;ll just need some  infrastructure around it to get the file and extract the contents (see  Lucene&amp;#039;s Tika</description>
<pubDate>12 Oct  2008 07:53:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66233</link>
</item><item>
<title>Re: Lucene 2.4.0 release</title>
<description>Congratulations!  A truly stellar achievement. Can&amp;#039;t wait to dive in! On Sat, 2008-10-11 at 11:50 -0400, Michael McCandless wrote: &amp;gt; Release 2.4.0</description>
<pubDate>11 Oct  2008 19:28:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66228</link>
</item><item>
<title>Lucene 2.4.0 release</title>
<description>Release 2.4.0 of Lucene is now available! With 2.4.0 we have relaxed the backwards compatibility policy of the Fieldable interface: we now allow chan</description>
<pubDate>11 Oct  2008 08:50:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66225</link>
</item><item>
<title>Is lucene right for us</title>
<description>Hello All, At my university we have over 20,000 small file ranging from 20k to 500k per directory and we would like to index them. I was wondering if</description>
<pubDate>11 Oct  2008 07:22:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66223</link>
</item><item>
<title>Retrieving Top Terms for a subset of the index (or for all results of a query)</title>
<description>Hello everyone. I&amp;#039;ve been fiddeling with the idea of retrieving the top  terms from a subset of the index (i.e. top terms from the documents  retrie</description>
<pubDate>11 Oct  2008 03:36:04 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66221</link>
</item><item>
<title>Re: Indexing Scalability, Multiwriter?</title>
<description>Glen,  Thank you for the details there. Its really great what you&amp;#039;ve done and I will study it some more! I too though about using multiple writers i</description>
<pubDate>11 Oct  2008 02:21:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66220</link>
</item><item>
<title>Re: Indexing Scalability, Multiwriter?</title>
<description>IndexWriter is thread-safe and has been for a while (http://www.mail-archive.com/lucene-dev@jakarta.apache.org/msg00157.html) so you don&amp;#039;t have to wor</description>
<pubDate>10 Oct  2008 19:17:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66219</link>
</item><item>
<title>Indexing Scalability, Multiwriter?</title>
<description>Hi gang,  Wondering how folks have address scaled up indexing. I saw old threads about using clustered webapp with JNDI singleton index writer due to</description>
<pubDate>10 Oct  2008 14:05:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66218</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Actually looking at this a little deeper maybe Lucene could/should automatically be doing this &amp;quot;short&amp;quot; optimisation here? Given a comparitively small</description>
<pubDate>10 Oct  2008 09:22:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66214</link>
</item><item>
<title>RE: Question regarding sorting and memory consumption in lucene</title>
<description>I have had a similar problem. What I do is load all the date field values at index startup, convert dates (timestamps) to a Julian date (# of seconds</description>
<pubDate>10 Oct  2008 08:57:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66213</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>That&amp;#039;s a really good idea Mark! :) Thanks! Will try to see if can make a quick change with your suggestion.  (Too bad quick isn&amp;#039;t really a word in my</description>
<pubDate>10 Oct  2008 08:45:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66212</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Update: The statement &amp;quot;...cost is field size (10 bytes ?) times number of documents&amp;quot; is wrong. What you actually have is the cost of the unique strin</description>
<pubDate>10 Oct  2008 08:07:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66209</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Yes, I understand that, and I did mean the number of documents, but I read  in the javadoc that: &amp;quot;For String fields, the cache is larger: in additio</description>
<pubDate>10 Oct  2008 07:58:14 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66208</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>I think you have your memory cost calculation wrong. The cost is field size (10 bytes ?) times number of documents NOT number of unique terms. The cac</description>
<pubDate>10 Oct  2008 07:43:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66207</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Unfortunately no, since the documents that are added may come form a new  &amp;quot;source&amp;quot; containing old documents aswell..:/ I tried deploying our webappli</description>
<pubDate>10 Oct  2008 07:25:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66206</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>Assuming content is added in chronological order and with no updates to existing docs couldn&amp;#039;t you rely on internal Lucene document id to give a chron</description>
<pubDate>10 Oct  2008 06:18:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66205</link>
</item><item>
<title>Re: Question regarding sorting and memory consumption in lucene</title>
<description>I&amp;#039;ll follow up on my own question... Let&amp;#039;s say that we have 4 years of data, meaning that there will be roughly  4 * 365 = 1460 unique terms for our</description>
<pubDate>10 Oct  2008 05:51:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66202</link>
</item><item>
<title>Re: Only last field indexed</title>
<description>True, I guess I was thinking of things from a search-only perspective when I claimed they were identical... But you&amp;#039;re absolutely right in that you ca</description>
<pubDate>10 Oct  2008 05:47:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66201</link>
</item><item>
<title>Re: Release 2.4 on ibiblio</title>
<description>The release bits are indeed propagating through all mirrors, but I&amp;#039;m  going to wait until tomorrow to do the announcement, to make sure all  mirrors</description>
<pubDate>10 Oct  2008 05:31:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66200</link>
</item><item>
<title>Release 2.4 on ibiblio</title>
<description>Hi there, I&amp;#039;ve just noticed that there is already a 2.4 release available on ibiblio  (http://mirrors.ibiblio.org/pub/mirrors/maven2/org/apache/luce</description>
<pubDate>10 Oct  2008 05:27:40 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66199</link>
</item><item>
<title>Question regarding sorting and memory consumption in lucene</title>
<description>Hello, I&amp;#039;ve read a lot of threads now on memory consumption and sorting,  and I think I have a pretty good understanding of how things work, but I</description>
<pubDate>10 Oct  2008 05:09:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66198</link>
</item><item>
<title>Re: Buzz measurement - Aggregate functions</title>
<description>Ah, sorry. Just saw the bit about the free text query too. A FieldCache is the answer here I suspect in order to quickly retrieve the date values for</description>
<pubDate>10 Oct  2008 03:04:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66196</link>
</item><item>
<title>Re: Buzz measurement - Aggregate functions</title>
<description>Assuming your date data is held as YYYYMMDD and you want daily totals....     Term startTerm=new Term(&amp;quot;date&amp;quot;,&amp;quot;20080101&amp;quot;);     TermEnum termEn</description>
<pubDate>10 Oct  2008 02:40:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66195</link>
</item><item>
<title>Buzz measurement - Aggregate functions</title>
<description>Hi. Anyone have an idea of how I would create a query which finds the data backing a trend graph where date is X and num(docs) is on Y axis ? This i</description>
<pubDate>10 Oct  2008 02:12:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66194</link>
</item><item>
<title>Re: wizard for search in Lucene</title>
<description>From what I can understand, you want to insert the word &amp;quot;history&amp;quot; and then  get proposed &amp;quot;related&amp;quot; terms in combination with your input query. In ess</description>
<pubDate>10 Oct  2008 00:33:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66193</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>Hi Ganesh, Your situation seems pretty straight. I did not really split my database (storage), just that while indexing, I indexed the data into &amp;#039;n&amp;#039;</description>
<pubDate>09 Oct  2008 21:49:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66192</link>
</item><item>
<title>RE: Only last field indexed</title>
<description>Kalani, They are put into the document as separate fields. That&amp;#039;s why there are getFields() and deleteFields() methods on Document. John G. -----Or</description>
<pubDate>09 Oct  2008 19:29:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66191</link>
</item><item>
<title>Re: wizard for search in Lucene</title>
<description>Albert Juhe: &amp;gt; &amp;gt; Hi, &amp;gt; &amp;gt; I want to make a wizard that can help to find n-grams terms. &amp;gt; For example: &amp;gt; If i want to search History, after write it t</description>
<pubDate>09 Oct  2008 09:49:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66186</link>
</item><item>
<title>wizard for search in Lucene</title>
<description>Hi, I want to make a wizard that can help to find n-grams terms. For example: If i want to search History, after write it the system propose you the</description>
<pubDate>09 Oct  2008 07:33:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66175</link>
</item><item>
<title>custom tag scoring question</title>
<description>We have a custom &amp;quot;tagger&amp;quot; application which identifies certain entities (such as companies, etc.) and applies a &amp;quot;relevance&amp;quot; value to each entity based</description>
<pubDate>08 Oct  2008 08:48:12 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66162</link>
</item><item>
<title>Access Scoring Values of Lucene for Post-Processing</title>
<description>Hi everybody, I am using Lucene for searching items in a online shop. E.g. I search the shop for &amp;quot;shirt&amp;quot; I get a resultset from lucene. Now I want to</description>
<pubDate>08 Oct  2008 08:15:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66161</link>
</item><item>
<title>Re: Only last field indexed</title>
<description>Kalani: You are mistaken to think this is not possible. In fact it is explicitly supported, and I&amp;#039;ve done it in many situations. These two are entire</description>
<pubDate>08 Oct  2008 05:58:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66157</link>
</item><item>
<title>Wildcard query ...</title>
<description>Let&amp;#039;s say my index has two fields (Type and Description). Type is either 0 or 1 and Description is a string (up to 250 characters). I&amp;#039;d like to exec</description>
<pubDate>08 Oct  2008 05:44:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66156</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>Hello Anshum, In my case i have to add /modify records to the current index database and there will be only delete in older index DB. I will not ha</description>
<pubDate>08 Oct  2008 05:09:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66155</link>
</item><item>
<title>Re: Only last field indexed</title>
<description>You are doing this kind of a thing? document.add(new Field(&amp;quot;id&amp;quot;, id1, Field.Store.YES, Field.Index.TOKENIZED)); document.add(new Field(&amp;quot;id&amp;quot;, id2, Fie</description>
<pubDate>08 Oct  2008 02:01:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66152</link>
</item><item>
<title>Re: advice on using Lucene for sorting based on payloads</title>
<description>Thanks very much for your response, and for pointing me in the direction towards Function Queries - you saved me a ton of time! You&amp;#039;re right, that see</description>
<pubDate>07 Oct  2008 22:42:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66148</link>
</item><item>
<title>Re: advice on using Lucene for sorting based on payloads</title>
<description>Not sure if I fully get it, but bear with me... Inline below. On Oct 6, 2008, at 11:37 PM, Alexander Devine wrote: &amp;gt; Hi Luceners, &amp;gt; &amp;gt; I have a part</description>
<pubDate>07 Oct  2008 19:11:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66147</link>
</item><item>
<title>Re: Re-tokenized fields disappear</title>
<description>See below (and your other mail) On Tue, Oct 7, 2008 at 1:59 PM, John G &amp;lt;griffij@sosstaffing.com&amp;gt; wrote: &amp;gt; &amp;gt; Thanks Erick, &amp;gt; &amp;gt; Yes PerFieldAnalyzerWr</description>
<pubDate>07 Oct  2008 11:25:00 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66142</link>
</item><item>
<title>Re: Only last field indexed</title>
<description>Let&amp;#039;s see the indexing code. It is perfectly reasonable to add data to a field multiple times, so I suspect you&amp;#039;re doing something wrong..... What ev</description>
<pubDate>07 Oct  2008 11:23:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66141</link>
</item><item>
<title>Re: Re-tokenized fields disappear</title>
<description>Thanks Erick, Yes PerFieldAnalyzerWrapper is my friend :&amp;gt;). Another related question, I&amp;#039;m putting these values into a document in fields with the sa</description>
<pubDate>07 Oct  2008 10:59:44 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66140</link>
</item><item>
<title>Only last field indexed</title>
<description>Guys, I&amp;#039;m adding multiple fields with the same name to a document as Store.YES, Indexed.TOKENIZED and it seems that only the last field entered is in</description>
<pubDate>07 Oct  2008 10:28:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66139</link>
</item><item>
<title>Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>I found it. My reader was returning 0 at the end of the stream instead of -1. Doh.   Thanks again for the suggestions. They did ultimately lead me to</description>
<pubDate>07 Oct  2008 08:50:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66136</link>
</item><item>
<title>spellcheck: issues</title>
<description>Hello, I&amp;#039;ve been exploring usage of the spellcheck feature via solr 1.3. I have it working, but there are some issues I&amp;#039;m seeing that make it less us</description>
<pubDate>07 Oct  2008 07:56:12 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66133</link>
</item><item>
<title>Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>Thanks for the tip. I tried your experiment and, sure enough, it works just fine, so it&amp;#039;s not the contents but obviously some other behavior of my cus</description>
<pubDate>07 Oct  2008 07:43:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66132</link>
</item><item>
<title>Re: Re-tokenized fields disappear</title>
<description>This is going to get really sticky given StandardAnalyzer. Let&amp;#039;s say that you have codesearch:B05 1 codesearch:B05 2 codesearch:B05 3 When you index</description>
<pubDate>07 Oct  2008 05:52:24 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66130</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>There were all of the links I could find: http://findmeajob.wordpress.com/2007/07/31/lucene-singlesearcher-vs-multisearcher/ http://archives.devshed.c</description>
<pubDate>07 Oct  2008 05:25:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66129</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>Hello Anusham, My intention is to shard the index after every 7 days (week). After 30 days, (4th week) the first DB may get deleted. At any point o</description>
<pubDate>07 Oct  2008 02:59:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66127</link>
</item><item>
<title>Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>If you capture the exact text produced by the reader, and wrap it in a  StringReader and pass that to StandardAnalyzer, do you then see the  same ex</description>
<pubDate>07 Oct  2008 02:12:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66125</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Hi Edwin, Would love to pose the code, but I don&amp;#039;t have the Terracotta source code at the moment as I only downloaded the binaries. But Terracotta i</description>
<pubDate>06 Oct  2008 22:35:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66123</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Hi Paul, now i&amp;#039;m curious to know how the overriden java.util.Vector implements the clone method. Any chance of posting a snippet? ;)  Cheers, Edwin</description>
<pubDate>06 Oct  2008 21:34:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66121</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Seems like I found the culprit to the problem. Because I am using Terracotta to doing JVM clustering, it has overriden many classes of java.util.*.</description>
<pubDate>06 Oct  2008 20:51:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66120</link>
</item><item>
<title>Re-tokenized fields disappear</title>
<description>My previous question may be moot but as is it is still a problem. Here&amp;#039;s a little more info on my problem. The same named fields contain two pieces of</description>
<pubDate>06 Oct  2008 20:38:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66119</link>
</item><item>
<title>advice on using Lucene for sorting based on payloads</title>
<description>Hi Luceners, I have a particular sorting problem and I wanted some advice on what the best implementation approach would be. We currently use Lucene</description>
<pubDate>06 Oct  2008 20:37:18 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66118</link>
</item><item>
<title>Re-tokenized fields disappear</title>
<description>Guys,   I have documents with multiple stored, tokenized fields of the same name but different values in them such as:   &amp;quot;codesearch&amp;quot;, &amp;quot;B01&amp;quot; &amp;quot;cod</description>
<pubDate>06 Oct  2008 19:39:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66115</link>
</item><item>
<title>Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>I upgraded to the latest, 3.3.2 and had the same problem, even though it was clearly a different lexer reading the text.   I did find some problems w</description>
<pubDate>06 Oct  2008 15:56:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66110</link>
</item><item>
<title>Re: bunch of newbie queries, PS</title>
<description>Hoss --  Thanks for reply --  &amp;gt;Paul: your subject linke seems to indicate that your email is a followup &amp;gt;to previous questions, however i don&amp;#039;t see</description>
<pubDate>06 Oct  2008 15:09:59 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66109</link>
</item><item>
<title>Re: Field names with : in it.</title>
<description>Hello and thanks for the response. Yes the title should have been values. I&amp;#039;ll take at look at the link you provided. Thanks again! Erick Erickson</description>
<pubDate>06 Oct  2008 14:22:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66108</link>
</item><item>
<title>Re: Field names with : in it.</title>
<description>You have to escape it in a query. And did you mean to title your post as field *names*? because I can&amp;#039;t imagine how that would work. Field *values*, s</description>
<pubDate>06 Oct  2008 13:29:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66101</link>
</item><item>
<title>Field names with : in it.</title>
<description>It appears if a search field value entered contains a &amp;#039;:&amp;#039; causes some issues since if I have a field named a.b.c and the user enters a value of foo t</description>
<pubDate>06 Oct  2008 12:54:12 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66100</link>
</item><item>
<title>Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>No particular reason. It is just what I had loaded last and hadn&amp;#039;t upgraded. It sounds like there might be good reason to do that now.   Thanks for th</description>
<pubDate>06 Oct  2008 12:20:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66099</link>
</item><item>
<title>RE: ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>Hi Edwin, I don&amp;#039;t know specifically what&amp;#039;s causing the exception you&amp;#039;re seeing, but note that in Lucene 2.3.0+, the JavaCC-generated version of Stand</description>
<pubDate>06 Oct  2008 12:18:20 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66094</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>Hi Ganesh, About the memory consumption while sorting, it would end up using similar amounts, perhaps even more.. like in the case of regular parallel</description>
<pubDate>06 Oct  2008 11:48:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66093</link>
</item><item>
<title>Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>Oh, and in case it matters, I&amp;#039;m using Lucene 2.2.0. Ed  ----- Original Message ----  I am stumped and have not seen any other reference to this pr</description>
<pubDate>06 Oct  2008 11:32:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66091</link>
</item><item>
<title>Re: bunch of newbie queries, PS</title>
<description>: Subject: bunch of newbie queries, PS Paul: your subject linke seems to indicate that your email is a followup to previous questions, however i don</description>
<pubDate>06 Oct  2008 11:24:53 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66090</link>
</item><item>
<title>ArrayIndexOutOfBoundsException in FastCharStream.readChar</title>
<description>I am stumped and have not seen any other reference to this problem. I am getting the following exception on everything I try to index. Does anyone kno</description>
<pubDate>06 Oct  2008 11:23:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66092</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Hi Edwin, Yes, I am running the TestVector against the same class path as my actual application. My class path contains the Lucene JAR files. This</description>
<pubDate>06 Oct  2008 09:26:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66084</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Hi Paul, When you say that your dummy TestVector class works, are you running it using the same classpaths that you used to run your actual applicati</description>
<pubDate>06 Oct  2008 09:04:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66083</link>
</item><item>
<title>RE: Memory eaten up by String, Term and TermInfo?</title>
<description>Hi, Probably off-topic, but just like to plug a bit on my blog post here: http://tinyurl.com/4vytcc :p (incidentally, Java GC is one of my favourite</description>
<pubDate>06 Oct  2008 08:58:59 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66082</link>
</item><item>
<title>RE: Memory eaten up by String, Term and TermInfo?</title>
<description>Oh yes, I forget to mention that MaxPermSize is very useful, and may be another key that solves my problem. I haven&amp;#039;t tried UseConcMarkSweepGC and the</description>
<pubDate>06 Oct  2008 08:11:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66078</link>
</item><item>
<title>Re: Document larger than setRAMBufferSizeMB()</title>
<description>OK it sounds like you need to increase the RAM your JVM is allowed to  use, or, make your documents smaller. Mike Aditi Goyal wrote: &amp;gt; Thanks for</description>
<pubDate>06 Oct  2008 08:06:04 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66077</link>
</item><item>
<title>RE: Memory eaten up by String, Term and TermInfo?</title>
<description>I played around with GC quite a bit in our app and found the following java settings to help a lot (Used with jboss, but should be good for any jvm).</description>
<pubDate>06 Oct  2008 05:48:20 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66076</link>
</item><item>
<title>Re: Document larger than setRAMBufferSizeMB()</title>
<description>Thanks for showing interest Mike. The OOME comes in the middle of setting a value of one of the field in the doc. That field has a fairly large value.</description>
<pubDate>06 Oct  2008 05:17:59 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66075</link>
</item><item>
<title>RE: Advise for Mediabase with Lucene</title>
<description>Hello Mathias,  IMHO sounds like you are planning to re-invent the wheel while all things you want (AFAICS) are already largely available as open sou</description>
<pubDate>06 Oct  2008 01:35:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66074</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>I have tried to recompile Lucene 2.3.2 under my environment (SUN JDK 1.6) and it still doesn&amp;#039;t work. I am beginning to think that maybe there is some</description>
<pubDate>05 Oct  2008 23:16:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66071</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>Hello Anshum, My index is growing 1 million documents per day. Initially i planned to have a single database but the sorting of one or more fields c</description>
<pubDate>05 Oct  2008 21:36:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66070</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Thank you all for your input. However, this is what I have discovered so far:   - I tried running the test that Edwin suggested and it failed for m</description>
<pubDate>05 Oct  2008 19:45:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66068</link>
</item><item>
<title>RE: Memory eaten up by String, Term and TermInfo?</title>
<description>I have confirmed that the OutOfMemoryError is not Lucene&amp;#039;s problem. It&amp;#039;s just because JVM failed to perform GC when necessary, and I don&amp;#039;t know why. T</description>
<pubDate>05 Oct  2008 04:55:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66059</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>i think, very likely, you have another copy of java.util.Vector loaded, and this one tries to be too clever with its implementation of clone (instanti</description>
<pubDate>03 Oct  2008 23:21:40 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66053</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Hi Paul, The clone() in SegmentInfos is correct. The best practice of clone is to delegate the clone to the super class (if you look at the source co</description>
<pubDate>03 Oct  2008 23:11:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66052</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>:    SegmentInfos sis = (SegmentInfos) super.clone(); : We see that it is trying to cast a Vector into SegmentInfos which explains : the ClassCas</description>
<pubDate>03 Oct  2008 23:08:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66051</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>I am using Sun&amp;#039;s JRE 1.6.0_02 on Windows XP Actually...are you sure it would work? java.util.Vector has the following clone() method: public Object</description>
<pubDate>03 Oct  2008 15:38:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66048</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>That&amp;#039;s Sun&amp;#039;s JRE? That should be fine, unless there&amp;#039;s something  seriously wrong with it&amp;#039;s java.util.Vector implementation. But, this is an excepti</description>
<pubDate>03 Oct  2008 15:16:00 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66047</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>I am using Java 1.6.0_02. Is this a problem? On Fri, Oct 3, 2008 at 5:35 PM, Michael McCandless &amp;lt; lucene@mikemccandless.com&amp;gt; wrote: &amp;gt; &amp;gt; Which Java</description>
<pubDate>03 Oct  2008 15:07:12 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66046</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Which Java environment are you running? super.clone() from SegmentInfos should produce a new SegmentInfos  object. It seems like in your case it&amp;#039;s</description>
<pubDate>03 Oct  2008 14:35:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66045</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Hi Mike, I am actually using the Compass Search Engine which in turn makes use of Lucene. They are doing the following in their code:     Index</description>
<pubDate>03 Oct  2008 14:24:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66044</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>Can you describe what led up to this exception? Ie, what calls you  made to Lucene before this. Mike Paul Chan wrote: &amp;gt; I think I know what the p</description>
<pubDate>03 Oct  2008 13:54:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66043</link>
</item><item>
<title>Re: ClassCastException when writing to index writer</title>
<description>I think I know what the problem is looking at the code: In SegmentInfos.java (line 321): class SegmentInfos extends Vector {   public Object clone</description>
<pubDate>03 Oct  2008 13:41:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66042</link>
</item><item>
<title>ClassCastException when writing to index writer</title>
<description>Hi, I am using lucene 2.3.2 and I encounter the following exception when I try to insert a object into the index. Caused by: java.lang.ClassCastExce</description>
<pubDate>03 Oct  2008 12:40:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66041</link>
</item><item>
<title>Re: Extracting Dates</title>
<description>David, this is not really a Lucene issue. Here is some Perl code that you could either use or rewrite in Java if you need it in Java: http://search.c</description>
<pubDate>03 Oct  2008 09:49:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66029</link>
</item><item>
<title>Re: Single searcher vs Multi Searcher</title>
<description>Hi Ganesh, I have experimented with sharded indexes and they seem to benefit me(atleast in my case). I would like to know a few things before I answe</description>
<pubDate>03 Oct  2008 09:18:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66028</link>
</item><item>
<title>Re: Document larger than setRAMBufferSizeMB()</title>
<description>Note that large stored fields do not use up any RAM in IndexWriter&amp;#039;s  RAM buffer because these stored fields are immediately written to the  directo</description>
<pubDate>03 Oct  2008 04:27:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66021</link>
</item><item>
<title>Re: Document larger than setRAMBufferSizeMB()</title>
<description>Single document of 16 MB seems to be big. I think you are trying to store the entire document content. If it is so drop the stored field and store it</description>
<pubDate>03 Oct  2008 03:11:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66019</link>
</item><item>
<title>Re: Document larger than setRAMBufferSizeMB()</title>
<description>First off, IndexWriter&amp;#039;s RAM buffer size is &amp;quot;approximate&amp;quot;: after each  doc is added, we check if the RAM consumed is greater than our bugdet,  and i</description>
<pubDate>03 Oct  2008 03:05:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/66018</link>
</item>
</channel>
</rss>
