<?xml version="1.0" encoding="iso-8859-1" ?>
<?xml-stylesheet title="XSL_formatting" type="text/xsl" href="/images/lists/rssstyle2.xsl"?>
<rss version="2.0">
<channel>
<title>Lucene | Java-User</title>
<description>Mailing List Archive by Gossamer Threads</description>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<language>en-us</language>
<copyright>(c) Gossamer Threads Inc. All rights reserved.</copyright>
<lastBuildDate>12 Feb  2012 08:49:59 -0800</lastBuildDate>
<ttl>120</ttl>
<image>
<title>Gossamer Threads | Lucene | Java-User</title>
<width>75</width>
<height>23</height>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<url>http://www.gossamer-threads.com/images/lists/rss_logo.jpg</url>
</image>
<item>
<title>Re: confirm unsubscribe from java-user@lucene.apache.org</title>
<description>--- On Thu, 9/2/12, Christof Schablinski &amp;lt;christof.schablinski@devoteam.com&amp;gt; wrote:  From: Christof Schablinski &amp;lt;christof.schablinski@devoteam.com&amp;gt; S</description>
<pubDate>12 Feb  2012 07:16:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145731</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>I&amp;#039;m glad the timed deletion policy is working on NFS! Thanks for bringing closure, Mike McCandless http://blog.mikemccandless.com On Fri, Feb 10,</description>
<pubDate>11 Feb  2012 06:47:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145678</link>
</item><item>
<title>Re: Nested BlockJoinQuery</title>
<description>Your requirement does not sound like a good fit for the nested stuff but is probably more one for conventional faceting. I would characterise the use</description>
<pubDate>11 Feb  2012 06:45:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145677</link>
</item><item>
<title>Re: Index writing performance of 3.5</title>
<description>Tried changing the merge policy but it had no effect on the test times. But I can rule out ReiserFS as the culprit now too, since I was able to run wi</description>
<pubDate>10 Feb  2012 21:10:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145643</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>Thanks for your advice and patient. I modify &amp;quot;present&amp;quot;,and use stress testing two day(loop search and index),the &amp;quot;read past EOF&amp;quot; didn&amp;#039;t appeared yet.</description>
<pubDate>10 Feb  2012 18:58:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145641</link>
</item><item>
<title>norm for a document in a CustomScoreQuery</title>
<description>I was looking to the possibility that _some_ subqueries might discount (actually remove) field norms. I&amp;#039;m trying out the view that in general while l</description>
<pubDate>10 Feb  2012 14:59:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145638</link>
</item><item>
<title>Nested BlockJoinQuery</title>
<description>I&amp;#039;m trying to learn more about using BlockJoinQuery in our search application and I came across this blog post by Mike McCandless: http://blog.mikemcc</description>
<pubDate>10 Feb  2012 14:31:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145671</link>
</item><item>
<title>Re: Filter and IndexSearcher in Lucene 4.0 (trunk)</title>
<description>See the question was so trivial that you actually missed it :)  The problem is that the docs are filtered (which is is great) but the stats (BasicSta</description>
<pubDate>10 Feb  2012 10:58:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145611</link>
</item><item>
<title>RE: Filter and IndexSearcher in Lucene 4.0 (trunk)</title>
<description>Whats the problem? ----- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: uwe@thetaphi.de  &amp;gt; -----Original Message--</description>
<pubDate>10 Feb  2012 10:27:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145610</link>
</item><item>
<title>Re: Filter and IndexSearcher in Lucene 4.0 (trunk)</title>
<description>Hi, I apologise upfront for the trivial question. I have an IndexSearcher and I am applying a FieldCacheTermsFilter filter on it to only retrieve doc</description>
<pubDate>10 Feb  2012 09:43:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145594</link>
</item><item>
<title>Re: Access next token in a stream</title>
<description>Στις 9/2/2012 11:12 μμ, ο/η Steven A Rowe έγραψε: &amp;gt; Damerian, &amp;gt; &amp;gt; When I said &amp;quot;clear the previous token&amp;quot;, I was referring to the pseudo-</description>
<pubDate>09 Feb  2012 14:14:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145504</link>
</item><item>
<title>RE: Access next token in a stream</title>
<description>Damerian,  When I said &amp;quot;clear the previous token&amp;quot;, I was referring to the pseudo-code I gave in my first response to you. There is no built-in meth</description>
<pubDate>09 Feb  2012 14:12:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145503</link>
</item><item>
<title>Re: Access next token in a stream</title>
<description>Στις 9/2/2012 10:51 μμ, ο/η Steven A Rowe έγραψε: &amp;gt; Damerian, &amp;gt; &amp;gt; The technique I mentioned would work for you with a little tweaking: w</description>
<pubDate>09 Feb  2012 13:59:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145502</link>
</item><item>
<title>RE: Access next token in a stream</title>
<description>Damerian,  The technique I mentioned would work for you with a little tweaking: when you see consecutive capitalized tokens, then just set the CharT</description>
<pubDate>09 Feb  2012 13:51:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145501</link>
</item><item>
<title>Re: Access next token in a stream</title>
<description>Στις 9/2/2012 8:54 μμ, ο/η Steven A Rowe έγραψε: &amp;gt; Hi Damerian, &amp;gt; &amp;gt; One way to handle your scenario is to hold on to the previous token,</description>
<pubDate>09 Feb  2012 13:15:17 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145500</link>
</item><item>
<title>RE: Access next token in a stream</title>
<description>Hi Damerian,  One way to handle your scenario is to hold on to the previous token, and only emit a token after you reach at least the second token (</description>
<pubDate>09 Feb  2012 11:54:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145477</link>
</item><item>
<title>Access next token in a stream</title>
<description>Hello i want to implement my custom filter, my wuestion is quite simple but i cannot find a solution to it no matter how i try: How can i access the</description>
<pubDate>09 Feb  2012 11:18:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145476</link>
</item><item>
<title>Re: confirm unsubscribe from java-user@lucene.apache.org</title>
<description>Mit freundlichen Gren Christof Schablinski Devoteam Danet GmbH, Waldburgstrasse 17 - 19, 70563 Stuttgart, Germany Phone: +49 6151 868 8730, Fax: +</description>
<pubDate>09 Feb  2012 07:32:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145439</link>
</item><item>
<title>Re: analyzer per document</title>
<description>I would use a different field per language and use PerFieldAnalyzer indeed. This is also important for queries whose language is not always clear. pa</description>
<pubDate>09 Feb  2012 06:11:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145419</link>
</item><item>
<title>Re: Index writing performance of 3.5</title>
<description>one major thing that changed from 3.0.3 to 3.5 is that we use TieredMergePolicy by default. can you try to use the same merge policy on both 3.0.3 and</description>
<pubDate>09 Feb  2012 04:13:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145399</link>
</item><item>
<title>Re: analyzer per document</title>
<description>Why don&amp;#039;t you store each &amp;quot;file&amp;quot; in a single document, add a field for each &amp;quot;line&amp;quot; and use a PerFieldAnalyzerWrapper? Francisco A. Lozano  On Thu, F</description>
<pubDate>09 Feb  2012 04:11:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145398</link>
</item><item>
<title>Re: IndexWriter in 3.5</title>
<description>Yes, this changed at some point. In recent releases nothing is written to the index unless you close(), or maybe commit(), the writer.  -- Ian.  On</description>
<pubDate>09 Feb  2012 04:09:24 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145397</link>
</item><item>
<title>IndexWriter in 3.5</title>
<description>Hello all, In 3.0.3 the following code works fine but in 3.5, it throws exception &amp;quot;No segments found&amp;quot;. In case of 3.0.3, Just creating writer will cr</description>
<pubDate>09 Feb  2012 04:02:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145396</link>
</item><item>
<title>analyzer per document</title>
<description>Hello All, I have a requirement of using different analyzer per document. How can we do this? My analyzer would be locale specific.  I have a file</description>
<pubDate>09 Feb  2012 04:01:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145395</link>
</item><item>
<title>Fwd: Delete words in a specific increment Position with Lucene</title>
<description>-------- &#039;ρχικό Μήνυμα -------- Θέμα:    Delete words in a specific increment Position with Lucene -μερομηνία:  Tue, 07</description>
<pubDate>09 Feb  2012 03:34:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145394</link>
</item><item>
<title>Index writing performance of 3.5</title>
<description>Hello, I am currently evaluating Lucene 3.5.0 for upgrading from 3.0.3, and in the context of my usage, the most important parameter is index writing</description>
<pubDate>08 Feb  2012 20:28:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145360</link>
</item><item>
<title>RE: Please explain DisjunctionMaxQuery JavaDoc.</title>
<description>&amp;gt; -----Original Message----- &amp;gt; From: Paul Allan Hill [mailto:paul@metajure.com] &amp;gt; Sent: Wednesday, February 08, 2012 2:42 PM &amp;gt; To: java-user@lucene.ap</description>
<pubDate>08 Feb  2012 15:35:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145348</link>
</item><item>
<title>Please explain DisjunctionMaxQuery JavaDoc.</title>
<description>What the heck does is the JavaDoc for DisjunctionMaxQuery saying: &amp;quot;A query that generates the union of documents produced by its subqueries, and that</description>
<pubDate>08 Feb  2012 14:42:11 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145347</link>
</item><item>
<title>Working with MemoryIndex results</title>
<description>Hello, I&amp;#039;m using a MemoryIndex in order to search a block of in-memory text using a lucene query. I&amp;#039;m able to search the text, produce a result, and</description>
<pubDate>08 Feb  2012 13:36:44 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145333</link>
</item><item>
<title>Re: slow speed of searching</title>
<description>thanks a lot On Wed, Feb 8, 2012 at 9:48 PM, Ian Lea &amp;lt;ian.lea@gmail.com&amp;gt; wrote: &amp;gt; http://wiki.apache.org/lucene-java/ImproveSearchingSpeed &amp;gt; &amp;gt; (the</description>
<pubDate>08 Feb  2012 06:18:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145274</link>
</item><item>
<title>Re: slow speed of searching</title>
<description>http://wiki.apache.org/lucene-java/ImproveSearchingSpeed (the 3rd item is Use a local filesystem!) -- Ian.  On Wed, Feb 8, 2012 at 12:44 PM, Cheng</description>
<pubDate>08 Feb  2012 05:48:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145273</link>
</item><item>
<title>Re: how to create directory on a remote server protected by password</title>
<description>Don&amp;#039;t. Likely to cause more problems than it&amp;#039;s worth. See recent thread on &amp;quot;Why read past EOF&amp;quot;. But if you really feel you must, either write your</description>
<pubDate>08 Feb  2012 05:46:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145272</link>
</item><item>
<title>Re: NRTManager and AlreadyClosedException</title>
<description>are you closing the NRTManager while other threads still accessing the SearcherManager? simon On Wed, Feb 8, 2012 at 1:48 PM, Cheng &amp;lt;zhoucheng2008@g</description>
<pubDate>08 Feb  2012 05:09:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145271</link>
</item><item>
<title>Re: NRTManager and AlreadyClosedException</title>
<description>I use it exactly the same way. So there must be other reason causing the problem. On Wed, Feb 8, 2012 at 8:21 PM, Ian Lea &amp;lt;ian.lea@gmail.com&amp;gt; wrote:</description>
<pubDate>08 Feb  2012 04:48:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145270</link>
</item><item>
<title>slow speed of searching</title>
<description>Hi, I have about 6.5 million documents which lead to 1.5G index. The speed of search a couple terms, like &amp;quot;dvd&amp;quot; and &amp;quot;price&amp;quot;, causes about 0.1 second.</description>
<pubDate>08 Feb  2012 04:44:11 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145269</link>
</item><item>
<title>Re: NRTManager and AlreadyClosedException</title>
<description>Releasing a searcher is not the same as closing the searcher manager, if that is what you mean. The searcher should indeed be released, but once only</description>
<pubDate>08 Feb  2012 04:21:20 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145268</link>
</item><item>
<title>how to create directory on a remote server protected by password</title>
<description>Hi, I want to create a writer on a folder (&amp;quot;fsdir&amp;quot;) in a remote server (&amp;quot;10.161.1.23&amp;quot;), which has user id &amp;quot;xyz&amp;quot; and password &amp;quot;pwd&amp;quot;. How can I do so?</description>
<pubDate>08 Feb  2012 04:12:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145248</link>
</item><item>
<title>Re: NRTManager and AlreadyClosedException</title>
<description>You are right. There is a method by which I do searching. At the end of the method, I release the index searcher (not the searchermanager). Since thi</description>
<pubDate>08 Feb  2012 04:09:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145247</link>
</item><item>
<title>Re: NRTManager and AlreadyClosedException</title>
<description>Are you closing the SearcherManager? Calling release() multiple times? From the exception message the first sounds most likely.  -- Ian.  On Wed,</description>
<pubDate>08 Feb  2012 03:51:00 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145246</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>Hmm, there&amp;#039;s a problem with the logic here (sorry: this is my fault -- my prior suggestion is flat out wrong!). The problem is... say you commit once</description>
<pubDate>08 Feb  2012 02:57:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145245</link>
</item><item>
<title>Re: How best to handle a reasonable amount to data (25TB+)</title>
<description>On Feb 8, 2012, at 10:14 AM, Danil ŢORIN wrote: &amp;gt; For example if you only query data for 1 month intervals, and you &amp;gt; partition by date, you can cal</description>
<pubDate>08 Feb  2012 01:30:49 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145234</link>
</item><item>
<title>Re: How best to handle a reasonable amount to data (25TB+)</title>
<description>On Feb 8, 2012, at 10:14 AM, Danil ŢORIN wrote: &amp;gt; For example if you only query data for 1 month intervals, and you &amp;gt; partition by date, you can cal</description>
<pubDate>08 Feb  2012 01:30:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145233</link>
</item><item>
<title>Re: How best to handle a reasonable amount to data (25TB+)</title>
<description>It also depends on your queries. For example if you only query data for 1 month intervals, and you partition by date, you can calculate in which shar</description>
<pubDate>08 Feb  2012 01:14:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145232</link>
</item><item>
<title>Re: How best to handle a reasonable amount to data (25TB+)</title>
<description>it&amp;#039;s up to your machines. in our application, we indexs about 30,000,000(30M)docs/shard, and the response time is about 150ms. our machine has about 4</description>
<pubDate>07 Feb  2012 23:39:55 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145225</link>
</item><item>
<title>NRTManager and AlreadyClosedException</title>
<description>Hi, I am using NRTManager and NRTManagerReopenThread. Though I don&amp;#039;t close either writer or the reopen thread, I receive AlreadyClosedException as fo</description>
<pubDate>07 Feb  2012 21:20:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145211</link>
</item><item>
<title>RE: How best to handle a reasonable amount to data (25TB+)</title>
<description>Well, I am sooo embarrassed: I haven&amp;#039;t stuffed this badly in quite a while. But in the end, 13 shards is the right number. My calculator work was OK,</description>
<pubDate>07 Feb  2012 19:42:24 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145203</link>
</item><item>
<title>Re: How best to handle a reasonable amount to data (25TB+)</title>
<description>I&amp;#039;m all confused. 100M X 13 shards = 1.3G records, not 1.25 T But I get it 1.5 x 10^7 x 12 x 7 = 1.26 x 10 ^ 9 = 1.26 Billion, or am I off base again</description>
<pubDate>07 Feb  2012 18:38:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145202</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>public class PostponeCommitDeletionPolicy implements IndexDeletionPolicy {     private final static long deletionPostPone = 600000;     publi</description>
<pubDate>07 Feb  2012 18:15:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145186</link>
</item><item>
<title>RE: How best to handle a reasonable amount to data (25TB+)</title>
<description>Oops again! Turns out I got to the right result earlier by the wrong means! I found this reference (http://www.dejavutechnologies.com/faq-solr-lucene.</description>
<pubDate>07 Feb  2012 18:07:49 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145185</link>
</item><item>
<title>RE: How best to handle a reasonable amount to data (25TB+)</title>
<description>Whoops! Very poor basic maths, I should have written it down. I was thinking 13 shards. But yes, 13,000 is a bit different. Now I&amp;#039;m in even more need</description>
<pubDate>07 Feb  2012 17:19:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145184</link>
</item><item>
<title>Applying LUCENE-3653 patch to Lucene 3.0.3</title>
<description>Hi, My company is using an older version of Lucene (3.0.3). In my profiling results with 3.0.3, I have found that my app&amp;#039;s threads were blocked due t</description>
<pubDate>07 Feb  2012 13:45:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145148</link>
</item><item>
<title>Re: How best to handle a reasonable amount to data (25TB+)</title>
<description>I&amp;#039;m curious what the nature of your data is such that you have 1.25 trillion documents. Even at 100M/shard, you&amp;#039;re still talking 12,500 shards. The &amp;quot;</description>
<pubDate>07 Feb  2012 05:39:20 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145061</link>
</item><item>
<title>Re: Custom Payload Analyzer and Query</title>
<description>How does searching with PayloadSpanUtil/PayloadTermQuery/etc work to exclude/filter the matching terms based on the payload within a query itself, the</description>
<pubDate>07 Feb  2012 02:53:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145054</link>
</item><item>
<title>Re: Custom Payload Analyzer and Query</title>
<description>2012/2/6 Ian Lea &amp;lt;ian.lea@gmail.com&amp;gt; &amp;gt; Not sure if you got an answer to this or not. Don&amp;#039;t recall seeing one &amp;gt; and gmail threading says not. &amp;gt; &amp;gt; &amp;gt; I</description>
<pubDate>07 Feb  2012 01:11:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145052</link>
</item><item>
<title>RE: How best to handle a reasonable amount to data (25TB+)</title>
<description>Thanks for the response. Actually, I am more concerned with trying to use an Object Store for the indexes. The next concern is the use of a local inde</description>
<pubDate>06 Feb  2012 20:17:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145032</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>ok,thanks. I modify my program like you suggest.But another problem appear: java.lang.ArrayIndexOutOfBoundsException: -1     at org.apache.lucene</description>
<pubDate>06 Feb  2012 20:01:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145031</link>
</item><item>
<title>Re: Need to enforce logging of Lucene queries</title>
<description>Solr already logs the queries themselves although there isn&amp;#039;t any way that I know of to associate that with a user. Although in Solr land, it seems t</description>
<pubDate>06 Feb  2012 15:44:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145027</link>
</item><item>
<title>RE: recording a universal ID from DocID in a CustomScoreQuery</title>
<description>To complete this thread, I read the document itself with a 1 field fieldSelector, so as not to bother with anything but exactly what I needed at this</description>
<pubDate>06 Feb  2012 15:12:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145026</link>
</item><item>
<title>Need to enforce logging of Lucene queries</title>
<description>I have a set of Lucene indexes for which I need to log all accesses and possibly queries. I can use kernel-level auditing to record file accesses, bu</description>
<pubDate>06 Feb  2012 14:45:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145025</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Will do. On Tue, Feb 7, 2012 at 12:52 AM, Michael McCandless &amp;lt; lucene@mikemccandless.com&amp;gt; wrote: &amp;gt; You tell NRTCachingDirectory how much RAM it&amp;#039;s al</description>
<pubDate>06 Feb  2012 08:54:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144990</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>You tell NRTCachingDirectory how much RAM it&amp;#039;s allowed to use, and it then caches newly flushed segments in a private RAMDirectory. But you should fi</description>
<pubDate>06 Feb  2012 08:52:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144989</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Good point. I should remove the commits. Any difference between NRTCashingDirectory and RAMDirectory? how to define the &amp;quot;small&amp;quot;? On Tue, Feb 7, 2012</description>
<pubDate>06 Feb  2012 08:46:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144988</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>You shouldn&amp;#039;t call IW.commit when using NRT; that&amp;#039;s the point of NRT (making changes visible w/o calling commit). Only call commit when you require t</description>
<pubDate>06 Feb  2012 08:42:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144987</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Agree. On Mon, Feb 6, 2012 at 11:53 PM, Uwe Schindler &amp;lt;uwe@thetaphi.de&amp;gt; wrote: &amp;gt; Hi Cheng, &amp;gt; &amp;gt; all pros and cons are explained in those articles wri</description>
<pubDate>06 Feb  2012 07:57:22 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144983</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>My original question is if there exists a way to configure writer when to writer to FSDirectory. I think there may be something in the IndexWriterConf</description>
<pubDate>06 Feb  2012 07:55:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144982</link>
</item><item>
<title>RE: Configure writer to write to FSDirectory?</title>
<description>Hi Cheng, all pros and cons are explained in those articles written by Mike! As soon as there are harddisks in the game, there is a slowdown, what do</description>
<pubDate>06 Feb  2012 07:53:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144981</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Well, yes. What would you expect? From the javadocs for IndexWriter.commit() Commits all pending changes (added &amp;amp; deleted documents, segment merges</description>
<pubDate>06 Feb  2012 07:50:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144980</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>I meant that when I use NRTManager and use commit(), the speed is slower than when I use RAMDirectory. In my case, NRTManager instance not only perfo</description>
<pubDate>06 Feb  2012 07:49:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144979</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Uwe, when I meant speed is slow, I didn&amp;#039;t refer to instant visibility of changes, but that the changes may be synchronized with FSDirectory when I use</description>
<pubDate>06 Feb  2012 07:45:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144978</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>What exactly do you mean by the &amp;quot;speed is slower&amp;quot;? Time taken to update the index? Time taken for updates to become visible in search results? Time</description>
<pubDate>06 Feb  2012 07:41:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144977</link>
</item><item>
<title>RE: Configure writer to write to FSDirectory?</title>
<description>Please review the following articles about NRT, absolutely instant updates that are visible as they are done are almost impossible (even with RAMDirec</description>
<pubDate>06 Feb  2012 07:40:09 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144976</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Ian, I encountered an issue that I need to frequently update the index. The NRTManager seems not very helpful on this front as the speed is slower th</description>
<pubDate>06 Feb  2012 07:27:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144975</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>That really helps! I will try it out. Thanks. On Mon, Feb 6, 2012 at 10:12 PM, Ian Lea &amp;lt;ian.lea@gmail.com&amp;gt; wrote: &amp;gt; You would use NRTManagerReopenT</description>
<pubDate>06 Feb  2012 06:24:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144974</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>You would use NRTManagerReopenThread as a standalone thread, not plugged into your Executor stuff. It is a utility class which you don&amp;#039;t have to use.</description>
<pubDate>06 Feb  2012 06:12:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144964</link>
</item><item>
<title>Re: Custom Payload Analyzer and Query</title>
<description>Not sure if you got an answer to this or not. Don&amp;#039;t recall seeing one and gmail threading says not. &amp;gt; Is the use of payloads I&amp;#039;ve described appropri</description>
<pubDate>06 Feb  2012 05:54:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144963</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>I don&amp;#039;t understand this following portion: IndexWriter iw = new IndexWriter(whatever - some standard disk index); NRTManager nrtm = new NRTManager(iw</description>
<pubDate>06 Feb  2012 04:31:09 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144962</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>If you can use NRTManager and SearcherManager things should be easy and blazingly fast rather than unbearably slow. The latter phrase is not one ofte</description>
<pubDate>06 Feb  2012 04:17:01 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144950</link>
</item><item>
<title>Re: recording a universal ID from DocID in a CustomScoreQuery</title>
<description>int doc will be for the subreader, not for the entire index. oal.search.Collector has setNextReader(IndexReader reader, int docBase) which you might s</description>
<pubDate>06 Feb  2012 03:53:52 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144949</link>
</item><item>
<title>Re: weightage of each word according to precedence in document</title>
<description>At least it doesn&amp;#039;t give the same score for a doc which doesn&amp;#039;t have all the terms which I think at one point you claimed. So to try and simplify thi</description>
<pubDate>06 Feb  2012 03:13:04 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144948</link>
</item><item>
<title>Re: Apache Lucene file search</title>
<description>Hi        The issue of searching file name is resolved with some modifications in SearchFiles.java . A field named path has been added in the</description>
<pubDate>06 Feb  2012 02:00:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144941</link>
</item><item>
<title>Re: How best to handle a reasonable amount to data (25TB+)</title>
<description>it sounds not an issue of lucene but the logic of your app. if you&amp;#039;re afraid too many docs in one index you can make multiple indexes. And then search</description>
<pubDate>05 Feb  2012 22:29:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144939</link>
</item><item>
<title>How best to handle a reasonable amount to data (25TB+)</title>
<description>Hi, I have a little bit of an unusual set of requirements, and I am looking for advice. I have researched the archives, and seen some relevant posts,</description>
<pubDate>05 Feb  2012 18:50:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144934</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>I was trying to, but don&amp;#039;t know how to even I read some of your blogs. On Sun, Feb 5, 2012 at 10:22 PM, Michael McCandless &amp;lt; lucene@mikemccandless.co</description>
<pubDate>05 Feb  2012 16:15:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144922</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Are you using near-real-time readers? (IndexReader.open(IndexWriter)) Mike McCandless http://blog.mikemccandless.com On Sun, Feb 5, 2012 at 9:03 A</description>
<pubDate>05 Feb  2012 06:22:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144898</link>
</item><item>
<title>Re: Configure writer to write to FSDirectory?</title>
<description>Hi Uwe, My challenge is that I need to update/modify the indexes frequently while providing the search capability. I was trying to use FSDirectory, b</description>
<pubDate>05 Feb  2012 06:03:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144886</link>
</item><item>
<title>RE: Configure writer to write to FSDirectory?</title>
<description>Hi Cheng, It seems that you use a RAMDirectory for *caching*, otherwise it makes no sense to write changes back. In recent Lucene versions, this is n</description>
<pubDate>05 Feb  2012 00:14:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144869</link>
</item><item>
<title>Configure writer to write to FSDirectory?</title>
<description>Hi, I build an RAMDirectory on a FSDirectory, and would like the writer associated with the RAMDirectory to periodically write to hard drive. Is thi</description>
<pubDate>04 Feb  2012 22:56:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144868</link>
</item><item>
<title>Re: weightage of each word according to precedence in document</title>
<description>hi lan, sorry for late reply , it is simple search with default similarity only, here it gives same score for doc which has both token that is abcd</description>
<pubDate>04 Feb  2012 02:11:44 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144842</link>
</item><item>
<title>recording a universal ID from DocID in a CustomScoreQuery</title>
<description>My Index does NOT have a simple UID, it uses the file PATH to the file as the unique key. I was implementing a CustomScoreQuery which not only tweaked</description>
<pubDate>03 Feb  2012 16:09:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144829</link>
</item><item>
<title>Re: PayloadNearQuery and AveragePayloadFunction</title>
<description>All term queries, including payload queries, deal only with words from the query that exist in a document. They don&amp;#039;t know what other terms are in a m</description>
<pubDate>03 Feb  2012 09:28:22 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144805</link>
</item><item>
<title>Re: PayloadNearQuery and AveragePayloadFunction</title>
<description>Hi Peter Thanks for your reply. I guess I found the problem.  scorePayload function is only called for query terms. Problem was, when I was retrievin</description>
<pubDate>03 Feb  2012 08:50:14 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144804</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>Instead of .getVersion() you should use .getTimestamp()... version is not &amp;quot;really&amp;quot; a timestamp. (Though, really, you should store your own timestamp</description>
<pubDate>03 Feb  2012 08:49:49 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144803</link>
</item><item>
<title>Performance improvements for fuzzy queries ?</title>
<description>Using Lucene 3.5, I created a query parser based on the dismax parser but in order to get matches on misspellings ecetra I additionally do a fuzzy</description>
<pubDate>03 Feb  2012 07:01:42 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144794</link>
</item><item>
<title>Re: PayloadNearQuery and AveragePayloadFunction</title>
<description>AveragPayloadFunction is just what it sounds like: return numPayloadsSeen &amp;gt; 0 ? (payloadScore / numPayloadsSeen) : 1; What values are you seeing retur</description>
<pubDate>03 Feb  2012 05:35:01 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144779</link>
</item><item>
<title>Re: PayloadNearQuery and AveragePayloadFunction</title>
<description>Hi Peter I have checked payload associated with terms, and they are fine in the index. I was not clear enough I believe. When I say interested in clas</description>
<pubDate>03 Feb  2012 01:13:42 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144772</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>eg,I implement IndexDeletionPolicy and the onCommit():     public void onCommit(List&amp;lt;? extends IndexCommit&amp;gt; commits) {         // Note th</description>
<pubDate>02 Feb  2012 19:17:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144762</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>Thanks,you suggest me to creat a my IndexDeletionPolicy,I check KeepOnlyLastCommitDeletionPolicy.onCommit,it invoke CommitPoint.delete(),but it only :</description>
<pubDate>02 Feb  2012 19:13:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144761</link>
</item><item>
<title>Re: PayloadNearQuery and AveragePayloadFunction</title>
<description>I don&amp;#039;t quite follow what you&amp;#039;re doing, but is it possible that your payloads are not on the desired terms when you indexed them? The first explanatio</description>
<pubDate>02 Feb  2012 13:39:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144747</link>
</item><item>
<title>Re: Join between indexes</title>
<description>Thanks, that&amp;#039;s a very nice feature.  Wouldit also enable joining on the docId level, meaning that part ofa documentis kept in some index and anoth</description>
<pubDate>02 Feb  2012 12:56:44 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144746</link>
</item><item>
<title>RE: lucene-3.0.3</title>
<description>Hi Everybody,  lucene-3.0.3. will handle outlook files, DOCX and .EXLX files while searching a text??  We have taken indexfiles.java and searchfiles</description>
<pubDate>02 Feb  2012 10:02:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144714</link>
</item>
</channel>
</rss>

