<?xml version="1.0" encoding="iso-8859-1" ?>
<?xml-stylesheet title="XSL_formatting" type="text/xsl" href="/images/lists/rssstyle2.xsl"?>
<rss version="2.0">
<channel>
<title>Lucene | Java-User</title>
<description>Mailing List Archive by Gossamer Threads</description>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<language>en-us</language>
<copyright>(c) Gossamer Threads Inc. All rights reserved.</copyright>
<lastBuildDate>25 Nov  2009 03:07:42 -0800</lastBuildDate>
<ttl>120</ttl>
<image>
<title>Gossamer Threads | Lucene | Java-User</title>
<width>75</width>
<height>23</height>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<url>http://www.gossamer-threads.com/images/lists/rss_logo.jpg</url>
</image>
<item>
<title>RE: How to implement a GivenCharFilter using incrementToken</title>
<description>I do not understand your request completely, maybe you tell us some more requirements of your implementation. The example you have given is invalid,</description>
<pubDate>25 Nov  2009 02:05:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90046</link>
</item><item>
<title>RE: Need help regarding implementation of autosuggest using jquery</title>
<description>Hi Dhivya, you can iterate all terms in the index using a TermEnum, that can be retrieved using IndexReader.terms(Term startTerm). If you are intere</description>
<pubDate>25 Nov  2009 01:54:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90045</link>
</item><item>
<title>Need help regarding implementation of autosuggest using jquery</title>
<description>Hi all,   Am using lucene 2.3.2 as a search engine in my e-paper site. So that i want the user to search the news. I achieved that objective but now a</description>
<pubDate>24 Nov  2009 23:05:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90040</link>
</item><item>
<title>How to implement a GivenCharFilter using incrementToken</title>
<description>Hi,   I find it is very hard to implement a GivenCharFilter(extends TokenFilter)using incrementToken. My requirment is like this: I want to analyze</description>
<pubDate>24 Nov  2009 20:15:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90032</link>
</item><item>
<title>Re: autosuggest - in the sense of autocomplete</title>
<description>On Mon, Nov 23, 2009 at 12:14 PM, Uwe Schindler &amp;lt;uwe@thetaphi.de&amp;gt; wrote: &amp;gt; If you just want to autocomplete the current term the user enters, &amp;gt; initi</description>
<pubDate>24 Nov  2009 17:56:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90030</link>
</item><item>
<title>Re: Is Lucene a good choice for PB scale mailbox search?</title>
<description>For what it&amp;#039;s worth, AOL uses a Solr cluster to handle searches for @aol users. Each user has his own index. Otis -- Sematext is hiring -- http://se</description>
<pubDate>24 Nov  2009 15:20:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90029</link>
</item><item>
<title>Re: Lucene IndexOutput</title>
<description>No, createOutput is designed to always create a new file. This fits with Lucene because Lucene is &amp;quot;write once&amp;quot; (never writes to the same file more th</description>
<pubDate>24 Nov  2009 12:51:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90020</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>OK, I&amp;#039;ll add that to the javadocs; thanks. But the fact that you weren&amp;#039;t closing the old readers was probably also tying up lots of disk space... Mi</description>
<pubDate>24 Nov  2009 12:39:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90019</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>Hi, this is good information. as I read your post I realized that I am supposed to commit after an optimize, which is something I do not currently do.</description>
<pubDate>24 Nov  2009 12:31:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/90018</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>On Tue, Nov 24, 2009 at 9:08 AM, vsevel &amp;lt;v.sevel@lombardodier.com&amp;gt; wrote: &amp;gt; Hi, just to make sure I understand correctly... After an optimize, withou</description>
<pubDate>24 Nov  2009 08:59:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89985</link>
</item><item>
<title>Re: NearSpansUnordered payloads</title>
<description>I am also having a hard time understanding the NewSpansUnordered isPayloadAvilable() method. For my test case where 2 tokens are at the same position</description>
<pubDate>24 Nov  2009 07:55:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89958</link>
</item><item>
<title>customized SpanQuery Payload usage</title>
<description>Hello, For certain span queries I construct problematically by piecing together my own SpanTermQueries I would like to enforce that Payload data is n</description>
<pubDate>24 Nov  2009 06:56:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89957</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>yes that indeed works for me. thanks,  On Mon, Nov 23, 2009 at 5:50 PM, Paul Elschot &amp;lt;paul.elschot@xs4all.nl&amp;gt;wrote: &amp;gt; Op maandag 23 november 2009 2</description>
<pubDate>24 Nov  2009 06:17:09 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89934</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>Hi, just to make sure I understand correctly... After an optimize, without any reader, my index takes 30Gb on the disk. Are you saying that if I can e</description>
<pubDate>24 Nov  2009 06:08:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89933</link>
</item><item>
<title>Re: did you mean issue</title>
<description>On Nov 24, 2009, at 7:09 AM, m.harig wrote: &amp;gt; &amp;gt; What should i do now , could you make me clear ?? &amp;gt;  How are you constructing the SpellChecker?</description>
<pubDate>24 Nov  2009 05:13:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89932</link>
</item><item>
<title>Re: RamDirectory and FS at the same moment</title>
<description>Thanks a lot! of course performance :) I used 2.41 and now with 2.9.1 it works:)  Michael McCandless pisze: &amp;gt; Are you sure you even need to do this (</description>
<pubDate>24 Nov  2009 04:49:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89931</link>
</item><item>
<title>Re: did you mean issue</title>
<description>What should i do now , could you make me clear ??  Grant Ingersoll-6 wrote: &amp;gt; &amp;gt; &amp;gt; On Nov 24, 2009, at 1:16 AM, m.harig wrote: &amp;gt; &amp;gt;&amp;gt; &amp;gt;&amp;gt; String[] su</description>
<pubDate>24 Nov  2009 04:09:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89904</link>
</item><item>
<title>Re: did you mean issue</title>
<description>On Nov 24, 2009, at 1:16 AM, m.harig wrote: &amp;gt; &amp;gt; String[] suggestions = spellChecker.suggestSimilar(&amp;quot;hoem&amp;quot;, 3,indexReader, &amp;gt; &amp;quot;contents&amp;quot;, true); &amp;gt; &amp;gt;</description>
<pubDate>24 Nov  2009 04:01:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89903</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>I don&amp;#039;t really like that name, for the same reason (&amp;quot;create&amp;quot; and &amp;quot;new&amp;quot; imply that an entirely new reader is being created, which is far more costly th</description>
<pubDate>24 Nov  2009 02:22:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89902</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Erick, Thanks for all your help so far. I&amp;#039;ll try and see if upgrading to 2.9.1 won&amp;#039;t cause us too much changes and if it&amp;#039;s stable enough. If upgradin</description>
<pubDate>24 Nov  2009 02:17:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89877</link>
</item><item>
<title>RE: Searching while optimizing</title>
<description>How about newReader()? ----- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: uwe@thetaphi.de  &amp;gt; -----Original Messa</description>
<pubDate>24 Nov  2009 02:02:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89876</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>On Tue, Nov 24, 2009 at 1:44 AM, vsevel &amp;lt;v.sevel@lombardodier.com&amp;gt; wrote: &amp;gt; &amp;gt; 1) correct: I am using IndexWriter.getReader(). I guess I was assuming t</description>
<pubDate>24 Nov  2009 01:59:55 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89875</link>
</item><item>
<title>Re: RamDirectory and FS at the same moment</title>
<description>Are you sure you even need to do this (manage separate RAMDir)? Ie what&amp;#039;s driving this (performance)? IndexWriter already has its own RAM buffer for</description>
<pubDate>24 Nov  2009 01:45:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89874</link>
</item><item>
<title>Re: RamDirectory and FS at the same moment</title>
<description>Hi Rafal, If what I understand about your implementation is correct, you could try a parallelmultisearcher http://lucene.apache.org/java/2_9_1/api/co</description>
<pubDate>23 Nov  2009 23:16:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89866</link>
</item><item>
<title>Re: Is Lucene a good choice for PB scale mailbox search?</title>
<description>fulin tang wrote: &amp;gt; We are going to add full-text search for our mailbox service . &amp;gt; &amp;gt; The problem is we have more than 1 PB mails there , and obvious</description>
<pubDate>23 Nov  2009 22:56:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89865</link>
</item><item>
<title>RamDirectory and FS at the same moment</title>
<description>Hi all! i&amp;#039;ve just started my adventure with Lucene i&amp;#039;ve got one question  regarding indexing. Does Lucene have got built-in mechanism to store indexe</description>
<pubDate>23 Nov  2009 22:51:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89864</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>1) correct: I am using IndexWriter.getReader(). I guess I was assuming that was a privately owned object and I had no business dealing with its lifecy</description>
<pubDate>23 Nov  2009 22:44:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89863</link>
</item><item>
<title>Re: did you mean issue</title>
<description>String[] suggestions = spellChecker.suggestSimilar(&amp;quot;hoem&amp;quot;, 3,indexReader, &amp;quot;contents&amp;quot;, true); this is how am retrieving my did you mean words  Grant</description>
<pubDate>23 Nov  2009 22:16:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89855</link>
</item><item>
<title>Re: did you mean issue</title>
<description>String[] suggestions = spellChecker.suggestSimilar(&amp;quot;hoem&amp;quot;, 3,indexReader, &amp;quot;contents&amp;quot;, true); this is how am retrieving my did you words  Grant Inge</description>
<pubDate>23 Nov  2009 22:16:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89854</link>
</item><item>
<title>Re: Is Lucene a good choice for PB scale mailbox search?</title>
<description>A sharded architecture (i.e. smaller indexes) used by Google for example and implemented by open source in the Katta project may be best for scaling t</description>
<pubDate>23 Nov  2009 21:41:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89853</link>
</item><item>
<title>updating spell index</title>
<description>hello all   is there any way to update the spell index directory ? please any1 help me out of this. -- View this message in context: http://old.na</description>
<pubDate>23 Nov  2009 21:34:59 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89852</link>
</item><item>
<title>Re: Is Lucene a good choice for PB scale mailbox search?</title>
<description>Hi, I have not worked on a petascale (yet!) - mostly on the scale of tens of terabyes - but I do think Lucene would be very helpful for such usecase.</description>
<pubDate>23 Nov  2009 19:59:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89851</link>
</item><item>
<title>Is Lucene a good choice for PB scale mailbox search?</title>
<description>We are going to add full-text search for our mailbox service . The problem is we have more than 1 PB mails there , and obviously we don&amp;#039;t want to add</description>
<pubDate>23 Nov  2009 18:35:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89850</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>This was a really silly idea I had &amp;lt;G&amp;gt;. If your time is being spent in the scoring in the first place, keeping the Filter out of the query and checkin</description>
<pubDate>23 Nov  2009 16:14:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89839</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>Op maandag 23 november 2009 20:07:58 schreef Christopher Tignor: &amp;gt; Also, I noticed that with the above edit to NearSpansOrdered I am getting &amp;gt; erroneo</description>
<pubDate>23 Nov  2009 14:50:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89838</link>
</item><item>
<title>Re: Searching while optimizing</title>
<description>When you say &amp;quot;getting a reader of the writer&amp;quot; do you mean writer.getReader()? Ie the new near real-time API in 2.9? For that API (an in general when</description>
<pubDate>23 Nov  2009 14:17:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89823</link>
</item><item>
<title>Searching while optimizing</title>
<description>Hi, I am using lucene 2.9.1 to index a continuous flow of events. My server keeps an index writer open at all time and write events as groups of a few</description>
<pubDate>23 Nov  2009 12:20:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89822</link>
</item><item>
<title>Re: Linking Fields to Documents possible?</title>
<description>Thanks guys, I get the point, it is best to reindex(hope it isnt very expensive). And yes, it&amp;#039;s true that the suppliers dont change often. I -- View</description>
<pubDate>23 Nov  2009 12:14:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89807</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>Also, I noticed that with the above edit to NearSpansOrdered I am getting erroneous results fo normal ordered searches using searches like: &amp;quot;_n&amp;quot; foll</description>
<pubDate>23 Nov  2009 11:07:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89806</link>
</item><item>
<title>Re: ConcurrentMergeScheduler, Exception and transaction</title>
<description>IndexWriter will try the merge again, the next time it checks merges (eg after flushing a new segment, but not after adding a new document). You&amp;#039;ll o</description>
<pubDate>23 Nov  2009 09:45:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89797</link>
</item><item>
<title>RE: ConcurrentMergeScheduler, Exception and transaction</title>
<description>Thank you, Mike, for explanation. So I understand that all the data is kept even if any of these merging threads fail. Will Lucene keep attempting m</description>
<pubDate>23 Nov  2009 09:40:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89796</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>Thanks so much for this. Using an un-ordered query, the -1 slop indeed returns the correct results, matching tokens at the same position. I tried th</description>
<pubDate>23 Nov  2009 09:26:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89795</link>
</item><item>
<title>RE: autosuggest - in the sense of autocomplete</title>
<description>If you just want to autocomplete the current term the user enters, initialize a TermEnum with the current entered term fragment. If you then iterate t</description>
<pubDate>23 Nov  2009 09:14:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89794</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>See: http://issues.apache.org/jira/browse/LUCENE-1427 &amp;lt;http://issues.apache.org/jira/browse/LUCENE-1427&amp;gt;Short form: this is fixed, but not until 2.9.</description>
<pubDate>23 Nov  2009 09:12:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89793</link>
</item><item>
<title>Re: autosuggest - in the sense of autocomplete</title>
<description>For auto complete, you could try the following: 1. Run a prefix query. [Could be a fuzzy query] 2. Index using something like ngrams. term : sample i</description>
<pubDate>23 Nov  2009 08:59:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89792</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>Your trying -1 with ordered right? Try it with non ordered. Christopher Tignor wrote: &amp;gt; A slop of -1 doesn&amp;#039;t work either. I get no results returned.</description>
<pubDate>23 Nov  2009 08:59:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89791</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>Op maandag 23 november 2009 17:27:56 schreef Christopher Tignor: &amp;gt; A slop of -1 doesn&amp;#039;t work either. I get no results returned. I think the problem</description>
<pubDate>23 Nov  2009 08:56:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89790</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>A slop of -1 doesn&amp;#039;t work either. I get no results returned. this would be a *really* helpful feature for me if someone might suggest an implementat</description>
<pubDate>23 Nov  2009 08:27:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89789</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>I&amp;#039;ve taken TermsFilter from contrib which does exactly that and indeed the speed was reduced to half, which starts to be reasonable for my needs. I&amp;#039;v</description>
<pubDate>23 Nov  2009 08:01:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89783</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>Tested it out. It doesn&amp;#039;t work. A slop of zero indicates no words between the provided terms. E.g. my query of &amp;quot;plan&amp;quot; &amp;quot;_n&amp;quot; returns entries like &amp;quot;co</description>
<pubDate>23 Nov  2009 06:20:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89782</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Oh my goodness yes. No wonder nothing I suggested made any difference &amp;lt;G&amp;gt;. Ignore everything I&amp;#039;ve written.... OK, here&amp;#039;s something to try, and it goe</description>
<pubDate>23 Nov  2009 05:54:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89778</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Erick, Maybe I didn&amp;#039;t make myself clear enough. I&amp;#039;m talking about high level filters used when searching. I construct a very big BooleanQuery and ad</description>
<pubDate>23 Nov  2009 05:12:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89777</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Now I&amp;#039;m really confused, which usually means I&amp;#039;m making some assumptions that aren&amp;#039;t true. So here they are... 1&amp;gt; You&amp;#039;re talking about Filters that c</description>
<pubDate>23 Nov  2009 04:55:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89775</link>
</item><item>
<title>Re: Linking Fields to Documents possible?</title>
<description>There are some tricks you can apply, but they amount to keeping your own lists and manipulating them manually. As Ian says, Lucene isn&amp;#039;t a database, a</description>
<pubDate>23 Nov  2009 04:44:48 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89776</link>
</item><item>
<title>Re: scoring adjacent terms without proximity search</title>
<description>Hi Joel, I encounter the same problem. Could you please elaborate a bit on this? Many thanks, Liat 2009/11/2 Joel Halbert &amp;lt;joel@su3analytics.com&amp;gt;</description>
<pubDate>23 Nov  2009 03:55:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89761</link>
</item><item>
<title>autosuggest - in the sense of autocomplete</title>
<description>Sir,   I actually meant auto suggest as such available for google suggest similar to autocomplete. Where, users need not type the entire text and inst</description>
<pubDate>23 Nov  2009 02:58:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89760</link>
</item><item>
<title>Re: How to find the fields that are indexed?</title>
<description>By autosuggest, would you mean similar documents? In that case you could try the lucene &amp;#039;morelikethis&amp;#039; class. -- Anshum Gupta Naukri Labs! http://ai-</description>
<pubDate>23 Nov  2009 02:35:14 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89759</link>
</item><item>
<title>Re: How to find the fields that are indexed?</title>
<description>&amp;gt; That was a good solution to my problem and i found my fields for the document. Good. &amp;gt; Acutally i was trying it to find out how to implement autos</description>
<pubDate>23 Nov  2009 02:32:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89758</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>After commenting out the collector logic, the time is still more or less the same. Anyway, since without the filter collecting the documents is very f</description>
<pubDate>23 Nov  2009 02:18:37 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89744</link>
</item><item>
<title>Re: How to find the fields that are indexed?</title>
<description>That was a good solution to my problem and i found my fields for the document. Acutally i was trying it to find out how to implement autosuggest with</description>
<pubDate>23 Nov  2009 02:12:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89743</link>
</item><item>
<title>Re: How to find the fields that are indexed?</title>
<description>Use this tool to examine the index: http://www.getopt.org/luke/ I would also suggest getting hold of a Lucene book such as Lucene In Action (http://w</description>
<pubDate>23 Nov  2009 02:11:53 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89742</link>
</item><item>
<title>Re: Linking Fields to Documents possible?</title>
<description>Lucene is not a database. You&amp;#039;ll need to flatten the data and yes, that does mean duplication.  -- Ian.  On Mon, Nov 23, 2009 at 9:05 AM, sameerpat</description>
<pubDate>23 Nov  2009 02:09:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89741</link>
</item><item>
<title>Re: How to find the fields that are indexed?</title>
<description>Lucene will index and store the fields that you tell it to when a document is written to the index. In lucene 2.4 doc.getFields() returns a List of a</description>
<pubDate>23 Nov  2009 02:08:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89740</link>
</item><item>
<title>How to find the fields that are indexed?</title>
<description>Sir,   Am using lucene 2.3.2. I would like to know what are the fields that are been indexed?   Ex:   doc.get(&amp;quot;path&amp;quot;);   this statement returns the pa</description>
<pubDate>23 Nov  2009 01:42:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89739</link>
</item><item>
<title>Linking Fields to Documents possible?</title>
<description>Hi,  I have a requirement where I have a list of Suppliers(documents for lucene index) and a list of Products(documents again). Each Product has a s</description>
<pubDate>23 Nov  2009 01:06:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89738</link>
</item><item>
<title>Linking Fields to Documents possible?</title>
<description>Hi,  I have a requirement where I have a list of Suppliers(documents for lucene index) and a list of Products(documents again). Each Product has a s</description>
<pubDate>23 Nov  2009 01:05:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89737</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>You are right Paul, 0 would not work, probably something less than zero, as Paul suggested. Give it a try and tell us if it worked ; ) On Sun, Nov 22</description>
<pubDate>22 Nov  2009 23:11:53 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89732</link>
</item><item>
<title>Re: did you mean issue</title>
<description>How are you invoking the spell checker?  On Nov 19, 2009, at 1:22 AM, m.harig wrote: &amp;gt; &amp;gt; hello all &amp;gt; &amp;gt;      i&amp;#039;ve a doubt in spell checker , w</description>
<pubDate>22 Nov  2009 17:07:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89723</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>To call clear, you can always downcast to AttributeImpl. But you need to know, that it may clear also other attributes (like if it is a Token). So set</description>
<pubDate>22 Nov  2009 11:30:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89687</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>Ok I see you fixed it at the same time I sent the email :). I think I get it ... so far. So far I had to cache just TermAttribute. I think it&amp;#039;ll get</description>
<pubDate>22 Nov  2009 11:25:49 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89686</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>I assume termAtt is the input&amp;#039;s TermAttribute, right? Therefore it has no copyTo ... What I&amp;#039;ve done so far is create a TermAttribute like you propose</description>
<pubDate>22 Nov  2009 11:23:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89685</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>Sorry small error: Class Initializer: private final AttributeSource lastState = cloneAttributes(); private final TermAttribute lastTermAtt = lastStat</description>
<pubDate>22 Nov  2009 11:22:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89684</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>The cast to TermAttributeImpl may not work if the factory creates a Token... So declare termBuf as TermAttribute (without impl). To clear, you can al</description>
<pubDate>22 Nov  2009 11:21:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89683</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>Did you mean something like: TermAttributeImpl termBuf = (TermAttributeImpl) input.getAttributeFactory().createAttributeInstance(TermAttribute.class)</description>
<pubDate>22 Nov  2009 11:14:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89682</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>Another idea, what you can also do is, create an AttributeSource instance in your TokenStream one time using the AttributeSource.cloneAttributes() cal</description>
<pubDate>22 Nov  2009 11:14:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89681</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>I said, you *could* if it would be exposed. But the State is a holder class without functionality. Because the internals are impl dependent, maybe we</description>
<pubDate>22 Nov  2009 11:03:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89680</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>Yes I can clone the term itself by instantiating a TermAttributeImpl, which is better than storing the String, because the latter always allocates cha</description>
<pubDate>22 Nov  2009 10:52:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89679</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Hmmm, could you show us what you do in your collector? Because one of the gotchas about a collector is loading the documents in the inner loop. Quick</description>
<pubDate>22 Nov  2009 10:36:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89678</link>
</item><item>
<title>RE: Top field count scoring across documents</title>
<description>Hi Jake,   Many thanks for your quick reply. I shall check these out.   Thanks! Peter   &amp;gt; Date: Sun, 22 Nov 2009 09:20:24 -0800 &amp;gt; Subject: Re:</description>
<pubDate>22 Nov  2009 09:45:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89664</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Op zondag 22 november 2009 17:23:53 schreef Eran Sevi: &amp;gt; Thanks for the tips. &amp;gt; &amp;gt; I&amp;#039;m still using version 2.4 so I can&amp;#039;t use MultiTermQueryWrapperFil</description>
<pubDate>22 Nov  2009 09:37:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89663</link>
</item><item>
<title>Re: Top field count scoring across documents</title>
<description>Peter,   You want to do a facet query. This kind of functionality is not in Lucene-core (sadly), but both Solr (the fully featured search applicati</description>
<pubDate>22 Nov  2009 09:20:24 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89662</link>
</item><item>
<title>Top field count scoring across documents</title>
<description>Hello Lucene Experts,  I wonder if someone might be able to shed some insight on this interesting scoring question:  The problem: Build a search que</description>
<pubDate>22 Nov  2009 08:42:09 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89661</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>I think it shouldn&amp;#039;t take X5 times longer since the number of results is only about X2 times larger (and much smaller than the number of terms in the</description>
<pubDate>22 Nov  2009 08:41:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89660</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Thanks for the tips. I&amp;#039;m still using version 2.4 so I can&amp;#039;t use MultiTermQueryWrapperFilter but I&amp;#039;ll definitely try to re-group the the terms that ar</description>
<pubDate>22 Nov  2009 08:23:53 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89658</link>
</item><item>
<title>RE: Efficient filtering advise</title>
<description>Maybe this helps you, but read the docs, it will work only with single-value-fields: http://lucene.apache.org/java/2_9_1/api/core/org/apache/lucene/se</description>
<pubDate>22 Nov  2009 08:10:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89627</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Hmmm, I&amp;#039;m not very clear here. Are you saying that you effectively form 10-50K filters and OR them all together? That would be consistent with the 50K</description>
<pubDate>22 Nov  2009 07:57:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89659</link>
</item><item>
<title>Re: Efficient filtering advise</title>
<description>Try a MultiTermQueryWrapperFilter instead of the QueryFilter. I&amp;#039;d expect a modest gain in performance. In case it is possible to form a few groups of</description>
<pubDate>22 Nov  2009 07:51:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89626</link>
</item><item>
<title>Efficient filtering advise</title>
<description>Hi, I have a need to filter my queries using a rather large subset of terms (can be 10K or even 50K). All these terms are sure to exist in the index</description>
<pubDate>22 Nov  2009 06:48:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89625</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>&amp;gt; Because that&amp;#039;d mean I&amp;#039;ll check for abbreviations for every token. Which is &amp;gt; a &amp;gt; big performance loss. That way, I can just check abbr if I encounte</description>
<pubDate>22 Nov  2009 06:42:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89624</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>Because that&amp;#039;d mean I&amp;#039;ll check for abbreviations for every token. Which is a big performance loss. That way, I can just check abbr if I encountered a</description>
<pubDate>22 Nov  2009 06:37:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89623</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>If you just want to lookup if &amp;quot;Mr&amp;quot; is an abbreviation, why not look it up when you handle that token and set a boolean variable in the TS (lastTokenWa</description>
<pubDate>22 Nov  2009 06:34:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89622</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>What I&amp;#039;ve done is: State state = in.captureState(); ... // Upon new call to incrementToken(). State tmp = in.captureState(); in.restoreState(state);</description>
<pubDate>22 Nov  2009 06:27:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89621</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>Perhaps I misunderstand something. The current use case I&amp;#039;m trying to solve is - I have an abbreviations TokenFilter which reads a token and stores it</description>
<pubDate>22 Nov  2009 05:57:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89604</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>Use captureState and save the state somewhere. You can restore the state with restoreState to the TokenStream. CachingTokenFilter does this. So the n</description>
<pubDate>22 Nov  2009 05:33:11 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89603</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>ok so from what I understand, I should stop working w/ Token, and move to working w/ the Attributes. addAttribute indeed does not work. Even though i</description>
<pubDate>22 Nov  2009 05:28:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89602</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>&amp;gt; But I do use addAttribute(Token.class), so I don&amp;#039;t understand why you say &amp;gt; it&amp;#039;s not possible. And I completely don&amp;#039;t understand why the new API &amp;gt; a</description>
<pubDate>22 Nov  2009 04:58:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89601</link>
</item><item>
<title>Re: SpanQuery for Terms at same position</title>
<description>Op zondag 22 november 2009 04:47:50 schreef Adriano Crestani: &amp;gt; Hi, &amp;gt; &amp;gt; I didn&amp;#039;t test, but you might want to try SpanNearQuery and set slop to zero.</description>
<pubDate>22 Nov  2009 04:50:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89600</link>
</item><item>
<title>Re: How to deal with Token in the new TS API</title>
<description>But I do use addAttribute(Token.class), so I don&amp;#039;t understand why you say it&amp;#039;s not possible. And I completely don&amp;#039;t understand why the new API allows</description>
<pubDate>22 Nov  2009 04:44:00 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89599</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>&amp;gt; &amp;gt; I want to add Token.class, and then work w/ Token. Not TermAttribute, &amp;gt; PosIncrAttribute, OffsetAttribute, PayloadAttribute and TypeAttribute &amp;gt; (</description>
<pubDate>22 Nov  2009 04:36:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89598</link>
</item><item>
<title>RE: How to deal with Token in the new TS API</title>
<description>&amp;gt; I started to migrate my Analyzers, Tokenizer, TokenStreams and &amp;gt; TokenFilters &amp;gt; to the new API. Since the entire set of classes handled Token before</description>
<pubDate>22 Nov  2009 04:35:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/89597</link>
</item>
</channel>
</rss>
