<?xml version="1.0" encoding="iso-8859-1" ?>
<?xml-stylesheet title="XSL_formatting" type="text/xsl" href="/images/lists/rssstyle2.xsl"?>
<rss version="2.0">
<channel>
<title>Lucene | Java-User</title>
<description>Mailing List Archive by Gossamer Threads</description>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<language>en-us</language>
<copyright>(c) Gossamer Threads Inc. All rights reserved.</copyright>
<lastBuildDate>12 Feb  2012 08:09:04 -0800</lastBuildDate>
<ttl>120</ttl>
<image>
<title>Gossamer Threads | Lucene | Java-User</title>
<width>75</width>
<height>23</height>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/</link>
<url>http://www.gossamer-threads.com/images/lists/rss_logo.jpg</url>
</image>
<item>
<title>norm for a document in a CustomScoreQuery</title>
<description>I was looking to the possibility that _some_ subqueries might discount (actually remove) field norms. I&amp;#039;m trying out the view that in general while l</description>
<pubDate>10 Feb  2012 14:59:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145638</link>
</item><item>
<title>Nested BlockJoinQuery</title>
<description>I&amp;#039;m trying to learn more about using BlockJoinQuery in our search application and I came across this blog post by Mike McCandless: http://blog.mikemcc</description>
<pubDate>10 Feb  2012 14:31:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145671</link>
</item><item>
<title>Re: Filter and IndexSearcher in Lucene 4.0 (trunk)</title>
<description>Hi, I apologise upfront for the trivial question. I have an IndexSearcher and I am applying a FieldCacheTermsFilter filter on it to only retrieve doc</description>
<pubDate>10 Feb  2012 09:43:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145594</link>
</item><item>
<title>Access next token in a stream</title>
<description>Hello i want to implement my custom filter, my wuestion is quite simple but i cannot find a solution to it no matter how i try: How can i access the</description>
<pubDate>09 Feb  2012 11:18:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145476</link>
</item><item>
<title>Re: confirm unsubscribe from java-user@lucene.apache.org</title>
<description>Mit freundlichen Gren Christof Schablinski Devoteam Danet GmbH, Waldburgstrasse 17 - 19, 70563 Stuttgart, Germany Phone: +49 6151 868 8730, Fax: +</description>
<pubDate>09 Feb  2012 07:32:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145439</link>
</item><item>
<title>IndexWriter in 3.5</title>
<description>Hello all, In 3.0.3 the following code works fine but in 3.5, it throws exception &amp;quot;No segments found&amp;quot;. In case of 3.0.3, Just creating writer will cr</description>
<pubDate>09 Feb  2012 04:02:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145396</link>
</item><item>
<title>analyzer per document</title>
<description>Hello All, I have a requirement of using different analyzer per document. How can we do this? My analyzer would be locale specific.  I have a file</description>
<pubDate>09 Feb  2012 04:01:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145395</link>
</item><item>
<title>Fwd: Delete words in a specific increment Position with Lucene</title>
<description>-------- &#039;ρχικό Μήνυμα -------- Θέμα:    Delete words in a specific increment Position with Lucene -μερομηνία:  Tue, 07</description>
<pubDate>09 Feb  2012 03:34:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145394</link>
</item><item>
<title>Index writing performance of 3.5</title>
<description>Hello, I am currently evaluating Lucene 3.5.0 for upgrading from 3.0.3, and in the context of my usage, the most important parameter is index writing</description>
<pubDate>08 Feb  2012 20:28:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145360</link>
</item><item>
<title>Please explain DisjunctionMaxQuery JavaDoc.</title>
<description>What the heck does is the JavaDoc for DisjunctionMaxQuery saying: &amp;quot;A query that generates the union of documents produced by its subqueries, and that</description>
<pubDate>08 Feb  2012 14:42:11 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145347</link>
</item><item>
<title>Working with MemoryIndex results</title>
<description>Hello, I&amp;#039;m using a MemoryIndex in order to search a block of in-memory text using a lucene query. I&amp;#039;m able to search the text, produce a result, and</description>
<pubDate>08 Feb  2012 13:36:44 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145333</link>
</item><item>
<title>slow speed of searching</title>
<description>Hi, I have about 6.5 million documents which lead to 1.5G index. The speed of search a couple terms, like &amp;quot;dvd&amp;quot; and &amp;quot;price&amp;quot;, causes about 0.1 second.</description>
<pubDate>08 Feb  2012 04:44:11 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145269</link>
</item><item>
<title>how to create directory on a remote server protected by password</title>
<description>Hi, I want to create a writer on a folder (&amp;quot;fsdir&amp;quot;) in a remote server (&amp;quot;10.161.1.23&amp;quot;), which has user id &amp;quot;xyz&amp;quot; and password &amp;quot;pwd&amp;quot;. How can I do so?</description>
<pubDate>08 Feb  2012 04:12:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145248</link>
</item><item>
<title>NRTManager and AlreadyClosedException</title>
<description>Hi, I am using NRTManager and NRTManagerReopenThread. Though I don&amp;#039;t close either writer or the reopen thread, I receive AlreadyClosedException as fo</description>
<pubDate>07 Feb  2012 21:20:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145211</link>
</item><item>
<title>Applying LUCENE-3653 patch to Lucene 3.0.3</title>
<description>Hi, My company is using an older version of Lucene (3.0.3). In my profiling results with 3.0.3, I have found that my app&amp;#039;s threads were blocked due t</description>
<pubDate>07 Feb  2012 13:45:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145148</link>
</item><item>
<title>Need to enforce logging of Lucene queries</title>
<description>I have a set of Lucene indexes for which I need to log all accesses and possibly queries. I can use kernel-level auditing to record file accesses, bu</description>
<pubDate>06 Feb  2012 14:45:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/145025</link>
</item><item>
<title>How best to handle a reasonable amount to data (25TB+)</title>
<description>Hi, I have a little bit of an unusual set of requirements, and I am looking for advice. I have researched the archives, and seen some relevant posts,</description>
<pubDate>05 Feb  2012 18:50:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144934</link>
</item><item>
<title>Configure writer to write to FSDirectory?</title>
<description>Hi, I build an RAMDirectory on a FSDirectory, and would like the writer associated with the RAMDirectory to periodically write to hard drive. Is thi</description>
<pubDate>04 Feb  2012 22:56:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144868</link>
</item><item>
<title>recording a universal ID from DocID in a CustomScoreQuery</title>
<description>My Index does NOT have a simple UID, it uses the file PATH to the file as the unique key. I was implementing a CustomScoreQuery which not only tweaked</description>
<pubDate>03 Feb  2012 16:09:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144829</link>
</item><item>
<title>Performance improvements for fuzzy queries ?</title>
<description>Using Lucene 3.5, I created a query parser based on the dismax parser but in order to get matches on misspellings ecetra I additionally do a fuzzy</description>
<pubDate>03 Feb  2012 07:01:42 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144794</link>
</item><item>
<title>PayloadNearQuery and AveragePayloadFunction</title>
<description>Hi List Apologies for such a long message. I have tried to include everything, that you might need to know to answer my question.  I am having diffic</description>
<pubDate>02 Feb  2012 08:57:00 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144713</link>
</item><item>
<title>When does Query Parser do its analysis ?</title>
<description>So I subclass Query Parser and give it query dug up then debugging shows it calls getFieldQuery(String field, String queryText, boolean quoted) twi</description>
<pubDate>01 Feb  2012 13:32:44 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144661</link>
</item><item>
<title>Join between indexes</title>
<description>Assume we have a Lucene index over which several types of analyses are performed.  Assume that the conclusions of some analysisrequire thatnew toke</description>
<pubDate>01 Feb  2012 06:05:11 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144601</link>
</item><item>
<title>lucene-3.0.3</title>
<description>Hi,   lucene-3.0.3 can be used for searching a text from PDF, xlsx, docx, doc, xls, msg, TXT files. For this we have any common function to accompli</description>
<pubDate>01 Feb  2012 05:07:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144595</link>
</item><item>
<title>Lucene 2.9.4 Wildcard Search, Boost and Sorting</title>
<description>Hi, I have an issue with Lucene 2.9.4 and sorting of wildcard queries. If I set a boost to some documents during indexing like this: doc.setBoost(1</description>
<pubDate>01 Feb  2012 02:41:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144581</link>
</item><item>
<title>upgrading from 3.0.3 to 3.5.0</title>
<description>Hello all, I am upgrading from 3.0.3 to 3.5.0.  1) NumberTools is deprecated. I am converting long to string and storing it in Index. Now this is de</description>
<pubDate>01 Feb  2012 00:43:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144542</link>
</item><item>
<title>too many boolean clauses</title>
<description>Hi all, I have been using lucene with Hibernate to index the data. Each document is indexed with two fields: id and content. Each document correspond</description>
<pubDate>31 Jan  2012 23:50:52 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144538</link>
</item><item>
<title>Apache Lucene file search</title>
<description>Hi     I learnt about Lucene from google and i thought of implementing it my company. I don&amp;#039;t want to use Lucene as a web search application. I ha</description>
<pubDate>31 Jan  2012 23:40:22 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144543</link>
</item><item>
<title>Lucene appears to use memory maps after unmapping them</title>
<description>Hi all. I&amp;#039;ve found a rather frustrating issue which I can&amp;#039;t seem to get to the bottom of. Our application will crash with an access violation around</description>
<pubDate>31 Jan  2012 16:16:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144512</link>
</item><item>
<title>Phrase Queries vs. SpanTermQueries exact phrases vs. stop words</title>
<description>In Lucene, 3.4 I recently implemented &amp;quot;Translating PhraseQuery to SpanNearQuery&amp;quot; (see Lucene in Action, page 220) because I wanted _order_ to matter.</description>
<pubDate>31 Jan  2012 12:48:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144481</link>
</item><item>
<title>Lucene 3.5 Payloads</title>
<description>Working with Lucene 3.5, I&amp;#039;d like to append a payload to a specific field in the index, at indexing time. To get that, I use the following code to pro</description>
<pubDate>31 Jan  2012 12:25:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144480</link>
</item><item>
<title>using character &amp;#039;%&amp;#039; in queries (Lucene v3.1.0)</title>
<description>Hi,  I&#039;m using lucene on Hebrew MySql tables. I used ngram (1-15 gram sizes) in my name analyzer and the only thing that doesn&#039;t work for me is when</description>
<pubDate>31 Jan  2012 09:32:04 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144459</link>
</item><item>
<title>Lucene Site Feedback (and gift cards)?</title>
<description>Hey Guys, As you might know, we have been working hard on building a site that would help users use and understand Lucene. We have been playing aroun</description>
<pubDate>31 Jan  2012 08:35:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144458</link>
</item><item>
<title>Searching a string using lucene</title>
<description>Hello,  I&#039;m having a bit of trouble Googling this, so I&#039;m hoping someone can point me in the right direction.  We have a system which generates bl</description>
<pubDate>31 Jan  2012 07:50:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144439</link>
</item><item>
<title>Boost term according to phonetic representation</title>
<description>Consider a people index, containing People documents with the following names: Doc 1 [name: &amp;quot;Marcus&amp;quot;] Doc 2 [name: &amp;quot;Markus&amp;quot;] Doc 3 [name: &amp;quot;Mharcus&amp;quot;]</description>
<pubDate>30 Jan  2012 14:35:53 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144373</link>
</item><item>
<title>Custom Payload Analyzer and Query</title>
<description>I&amp;#039;m working on providing advanced searching for annotated Medical Documents (using UIMA). In the context of an annotated document, I identify relev</description>
<pubDate>30 Jan  2012 14:24:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144372</link>
</item><item>
<title>Differences between BooleanQuery and QueryParser</title>
<description>Is there any difference, from a performance standpoint (or any other standpoint whatsoever), between instantiating a query using QueryParser and Boole</description>
<pubDate>30 Jan  2012 13:55:00 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144357</link>
</item><item>
<title>Does Fuzzy Search scores the same as Exact Match</title>
<description>All things being equal does a fuzzy match give the same score as an exact match. i.e if I do a search for farmin and it matches two docs one on term</description>
<pubDate>28 Jan  2012 01:32:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144162</link>
</item><item>
<title>How to avoid filtering stop words like &amp;quot;IS&amp;quot; in StandardAnalyzer</title>
<description>Hi, I don&amp;#039;t want to filter certain stop words within the StandardAnalyzer? Can I do so? Ideally, I would like to have a customized StandardAnalyzer.</description>
<pubDate>27 Jan  2012 20:40:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144156</link>
</item><item>
<title>deprecated optimize()!</title>
<description>After reading all about the renaming of optimize() and updating my Lucene libraries to 3.4, I was surprised and confused by what I found. I have a 1</description>
<pubDate>27 Jan  2012 15:18:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144141</link>
</item><item>
<title>Null scorer constructed by TermQuery</title>
<description>Hi! I have a Solr-constructed index, which I read with this code: Directory directory = FSDirectory.open(file); IndexReader reader = IndexReader.ope</description>
<pubDate>27 Jan  2012 07:39:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144094</link>
</item><item>
<title>NGraming document for similar documents matching</title>
<description>Hi All, I am working on a project to find similar documents for the one being processed by a job. These documents talk about the functional issues s</description>
<pubDate>26 Jan  2012 15:41:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144025</link>
</item><item>
<title>Distributed index: Infinispan Directory or GlusterFS?</title>
<description>Hi, I am going to face very soon the need of having a big number of small indexes directly accessible for R/W from N machines. I am evaluating Infin</description>
<pubDate>26 Jan  2012 10:21:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/144008</link>
</item><item>
<title>BlockJoinQuery in text queries</title>
<description>Hi all, I am thinking about the best way to use BlockJoinQuery to make joins for child documents. Is there any QueryParser implementation that can ha</description>
<pubDate>26 Jan  2012 10:14:55 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143997</link>
</item><item>
<title>Find similar documents of different types</title>
<description>Hi list, We have two different document types with different fields each. My problem is given one document (Doc) from type1, find similar ones of typ</description>
<pubDate>26 Jan  2012 08:34:20 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143996</link>
</item><item>
<title>Query term counting, again...</title>
<description>Hi all, After much code and forum searching, I&amp;#039;ve hit a frustrating point that should be more obvious. I&amp;#039;ve trolled through a ton of postings and mes</description>
<pubDate>25 Jan  2012 15:36:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143935</link>
</item><item>
<title>Ignore this - just testing - restrict fuzzy search to longer words</title>
<description>--------------------------------- Lance  --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-uns</description>
<pubDate>25 Jan  2012 14:30:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143932</link>
</item><item>
<title>Cleaning up writer after certain idle time?</title>
<description>Hi, I am using multiple writer instances in a web service. Some instances are busy all the time, while some aren&amp;#039;t. I wonder how to configure the wri</description>
<pubDate>25 Jan  2012 14:01:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143923</link>
</item><item>
<title>Multiple document types</title>
<description>It seems that it is not possible to have multiple document types defined in a single solr schema.xml file. If, in fact, this is not possible, then, wh</description>
<pubDate>25 Jan  2012 13:49:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143922</link>
</item><item>
<title>Lucene 4 getSpans not retrieving spans</title>
<description>Goofing off with my index, I ran across this example http://www.lucidimagination.com/blog/2009/05/26/accessing-words-around-a-positional-match-in-luce</description>
<pubDate>24 Jan  2012 15:37:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143865</link>
</item><item>
<title>Lucene 4.0 Get All Index Terms</title>
<description>Hi all, Looking at some older Lucene examples, I noticed for older versions of lucene that IndexReader came with a handy terms() method that would re</description>
<pubDate>24 Jan  2012 13:10:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143858</link>
</item><item>
<title>weightage of each word according to precedence in document</title>
<description>Hi  how can we assign custom score for each token/word.  For Ex I have document  1  pqrst uvwx abcd 2  abcd pqrst uvwx 3  pqrst uvwx lm</description>
<pubDate>24 Jan  2012 09:08:37 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143842</link>
</item><item>
<title>comparing index fields within a query</title>
<description>Hi Everyone I have a problem where I need to compare two indexed fields as part of a query. For instance: modified_date[1970 to 2012] AND NOT delet</description>
<pubDate>23 Jan  2012 02:33:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143755</link>
</item><item>
<title>null pointer exception in indexwriter.close (using ramdirectory in google app engine)</title>
<description>Hi, I am working on getting lucene indexing working on *Google App Engine*. I am using a *ramdirectory* . I am facing a null pointer exception when I</description>
<pubDate>22 Jan  2012 16:24:17 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143732</link>
</item><item>
<title>Finding cityfuzzily but most accurate is most relevant</title>
<description>Hi, I&amp;#039;m trying to select city names in a way that goes easy on the spelling mistakes with the most accurate match first. My index for the city name f</description>
<pubDate>21 Jan  2012 07:38:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143652</link>
</item><item>
<title>[Lucene Spatial] Issues with CartesianPolyFilterBuilder.getShapeLoop</title>
<description>Hi, I &amp;#039;m currently working in integrating lucene spatial into the search engine of my customer but I&amp;#039;m facing a problem : If I ask to CartesianPoly</description>
<pubDate>20 Jan  2012 06:51:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143614</link>
</item><item>
<title>restrict fuzzy search to longer words</title>
<description>HI, Could you please help me with a quick question - Is there a way to restrict lucene/solr fuzzy search to only analyze words that have more than 5</description>
<pubDate>19 Jan  2012 12:08:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143578</link>
</item><item>
<title>any tips for upgrading Lucene 3.0.3 -&amp;gt; 3.5.0?</title>
<description>I&amp;#039;m hoping to upgrade Lucene on a local code base from 3.0.3 to 3.5.0; is there a good guide out there for particular pitfalls that I should worry abo</description>
<pubDate>19 Jan  2012 11:01:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143577</link>
</item><item>
<title>NRTManager, NRTManagerReopenThread and ExecutorServices example</title>
<description>Hi, can any of you provide a working code example that utilizes the NRTManager, NRTManagerReopenThread and ExecutorServices instances? The limited av</description>
<pubDate>18 Jan  2012 09:45:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143476</link>
</item><item>
<title>Error when opening a lucene index: Map failed</title>
<description>Hello, I am having problems opening a lucene index. The index has been created on the same machine. The size of index is 44G. Its a 64bit machine ru</description>
<pubDate>17 Jan  2012 03:05:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143371</link>
</item><item>
<title>Creating an IndexReader for a subset from original IndexReader object</title>
<description>Hi! I am trying to extend &amp;quot;mahout lucene.vector&amp;quot; driver, so that it can be feeded with arbitrary key-value constraints on solr schema fields (and gen</description>
<pubDate>16 Jan  2012 08:06:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143298</link>
</item><item>
<title>Query building performance</title>
<description>I have a situation where there are users that create n keywords. I&amp;#039;m storing them as individual DB fields for aggregating scores and then building the</description>
<pubDate>16 Jan  2012 07:59:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143297</link>
</item><item>
<title>LUCENE_35 index keyword analyzer only doesn&amp;#039;t like indexed sentences</title>
<description>Dear Lucene-developers, I switched to using Lucene 3.5 a few weeks ago and suddenly sentences are not correctly indexed anymore. Basically, fields ca</description>
<pubDate>16 Jan  2012 07:30:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143295</link>
</item><item>
<title>Import jar to lucene ant build script</title>
<description>I&amp;#039;m having trouble including the guava (http://code.google.com/p/guava-libraries/) library in my ant build script for lucene (lucene-3.5.0/build.xml).</description>
<pubDate>15 Jan  2012 18:46:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143235</link>
</item><item>
<title>Lucene on HPC podcast</title>
<description>I host an high performance computing podcast at www.rce-cast.com We would like to have a developer or two from Lucene to chat with us on the show. I</description>
<pubDate>15 Jan  2012 18:25:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143228</link>
</item><item>
<title>ArrayIndexOutOfBoundsException: -65536</title>
<description>Hi friends, Any one meet ArrayIndexOutOfBoundsException: -65536 described in https://issues.apache.org/jira/browse/LUCENE-1995 after it declared being</description>
<pubDate>15 Jan  2012 16:21:45 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143223</link>
</item><item>
<title>How NRTManagerReopenThread works with Java Executor framework?</title>
<description>I saw the link, https://builds.apache.org/job/Lucene-3.x/javadoc/contrib-misc/org/apache/lucene/index/NRTManagerReopenThread.html, which talks about h</description>
<pubDate>15 Jan  2012 10:18:18 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143202</link>
</item><item>
<title>best query for one-box search string over multiple types &amp;amp; fields</title>
<description>hi all, short of it: i want &amp;quot;queen bohemian rhapsody&amp;quot; to return that song named &amp;quot;Bohemian Rhapsody&amp;quot; by the artist named &amp;quot;Queen&amp;quot;, rather than songs wi</description>
<pubDate>14 Jan  2012 22:19:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143189</link>
</item><item>
<title>custom scoring</title>
<description>the following message comes from Explanation explain  0.09375 = (MATCH) fieldWeight(name:85 in 8687), product of   1.0 = tf(termFreq(name:85)=1)</description>
<pubDate>13 Jan  2012 21:20:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143113</link>
</item><item>
<title>Retrieving offsets</title>
<description>I&amp;#039;m having a set of issues in trying to use Lucene that are all connected to the difficulty of retrieving offsets. I need some advice on how best to</description>
<pubDate>13 Jan  2012 18:33:17 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143109</link>
</item><item>
<title>how to preserve whitespaces etc when tokenizing stream?</title>
<description>I am trying to perform a &amp;quot;translation&amp;quot; of sorts of a stream of text. More specifically, I need to tokenize the input stream, look up every term in a s</description>
<pubDate>13 Jan  2012 08:44:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143051</link>
</item><item>
<title>Is Lucene a good candidate for a Google-like search engine?</title>
<description>Just curious about that. Any thoughts? Thanks</description>
<pubDate>12 Jan  2012 16:49:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143016</link>
</item><item>
<title>10 million entities and 100 million related information</title>
<description>I have 10MM entities, for each of which I will index 10-20 fields. Also, I will have to index 100MM related information of the entities, and each piec</description>
<pubDate>12 Jan  2012 16:48:08 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143015</link>
</item><item>
<title>extractterms Output</title>
<description>Hi all - thanks in advance for any help... I have an app that aggregates keyword performance through incoming messages. A message comes in, I index i</description>
<pubDate>12 Jan  2012 12:36:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/143009</link>
</item><item>
<title>Is it necessary to create a new searcher?</title>
<description>I am currently using the following statement at the end of each index writing, although I don&amp;#039;t know if the writing modifies the indexes or not: is =</description>
<pubDate>11 Jan  2012 14:51:55 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142936</link>
</item><item>
<title>is it possible to index wiki markup files?</title>
<description>Hi, my name is Reyna Melara I&amp;#039;m a PhD student form Mexico, and I have a set of 11,051,447 files with txt extension but the content of each file is in</description>
<pubDate>11 Jan  2012 11:13:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142907</link>
</item><item>
<title>Unsubscribe failure</title>
<description>I tried to unsubscribe from this list, without success. I sent an email to &amp;#039;java-user-unsubscribe@lucene.apache.org&amp;#039;, I received the &amp;quot;please confirm&amp;quot;</description>
<pubDate>11 Jan  2012 09:44:01 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142885</link>
</item><item>
<title>Seem contradictive -- indexwriter in handling multiple threads</title>
<description>I have read a lot about IndexWriter and multi-threading over the Internet. It seems to me that the normal practice is: 1) use a same indexwriter inst</description>
<pubDate>11 Jan  2012 09:19:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142884</link>
</item><item>
<title>Call for Submission Berlin Buzzwords 2012all for Submission Berlin Buzzwords - http://berlinbuzzwords.de</title>
<description>Call for Submission Berlin Buzzwords 2012 - Search, Store, Scale  -- June 4 / 5. 2012 The event will comprise presentations on scalable data process</description>
<pubDate>11 Jan  2012 05:15:48 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142860</link>
</item><item>
<title>Large data set or data corpus</title>
<description>Hello all, Recently i saw couple of discussions in LinkedIn group about generating large data set or data corpus. I have compiled the same in to an a</description>
<pubDate>11 Jan  2012 03:21:16 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142853</link>
</item><item>
<title>SIGSEGV when indexing documents.</title>
<description>I have a collection of 50 million documents and I hit the SIGSEGV error. For every 10000 documents I perform commit. The logs and the question has bee</description>
<pubDate>11 Jan  2012 00:28:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142834</link>
</item><item>
<title>shared instance of IndexWriter doesn&amp;#039;t improve proformance</title>
<description>Hi, I use a same instance of writer for multiple threads. It turns out that the time to finish jobs is more than to create a new writer instance in e</description>
<pubDate>10 Jan  2012 17:32:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142824</link>
</item><item>
<title>Score exact matches higher than matches that match analysed text but not original text</title>
<description>My analyser strips out accents as often these are not entered correctly, so assume there are two documents in the database with default field contai</description>
<pubDate>10 Jan  2012 01:12:50 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142757</link>
</item><item>
<title>Remoting Lucene</title>
<description>Hi all. I want to access a Lucene index remotely. I&amp;#039;m aware of a couple of options for it which seem to operate more or less at the IndexSearcher le</description>
<pubDate>09 Jan  2012 18:59:52 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142748</link>
</item><item>
<title>Tamper resistant index</title>
<description>Hi, I&amp;#039;m investigating storing syslog data using Lucene (via Solr or Elasticsearch, undecided at present). The syslogs belong to systems under the sco</description>
<pubDate>09 Jan  2012 07:27:11 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142704</link>
</item><item>
<title>3.5.0 javadocs link missing?</title>
<description>Hi  The &amp;quot;Documentation&amp;quot; link on http://lucene.apache.org/java/docs/index.html expands to list Release 3.4.0, 3.3.0, etc. but not 3.5.0. http://lucen</description>
<pubDate>09 Jan  2012 02:54:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142696</link>
</item><item>
<title>How to merge indices in ram</title>
<description>Hi,    How can I merge multiple indices in RAM while not impacting search?   Thanks</description>
<pubDate>08 Jan  2012 23:06:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142686</link>
</item><item>
<title>How to load only part of index files on hard drive to ram</title>
<description>Hi, I have a folder containing a few industry categories. I would like to load only some of the categories into RAMDirectory. Can I use some queries t</description>
<pubDate>08 Jan  2012 20:56:37 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142684</link>
</item><item>
<title>Build RAMDirectory on FSDirectory, and then synchronzing the two</title>
<description>Hi, I new a RAMDirectory based upon a FSDirectory. After a few modifications, I would like to synchronize the two. Some on the mailing list provided</description>
<pubDate>08 Jan  2012 20:04:48 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142682</link>
</item><item>
<title>Problem using custom-separator in UpdateCSV ( in solr )</title>
<description>I am trying to add document to a slor index via : $&amp;gt; curl &amp;quot;http://localhost:8983/solr/update/csv?commit=true&amp;amp;fieldnames=id,title_s&amp;amp;separator=%09&amp;quot; --</description>
<pubDate>08 Jan  2012 01:57:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142641</link>
</item><item>
<title>Strategy for large index files</title>
<description>Hi, my servlet application is running a large index of 20G. I don&amp;#039;t think it can be loaded to RAM at one time. What are the general strategies to imp</description>
<pubDate>07 Jan  2012 21:32:33 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142639</link>
</item><item>
<title>question about SearcherManager in version 3.5.0</title>
<description>hi, i&amp;#039;m writing a normal web-search application with lucene 3.5.0. in version 3.5.0 lucene provides SearcherManager to manage multithreaded searching.</description>
<pubDate>06 Jan  2012 19:44:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142614</link>
</item><item>
<title>Shared IndexWriter does not increase speed</title>
<description>Hi, I am trying to use a shared IndexWriter instance for a multi-thread application. Surprisingly, this under performs by creating a writer instance</description>
<pubDate>06 Jan  2012 19:43:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142589</link>
</item><item>
<title>Re: Why read past EOF</title>
<description>and my IndexWriter create is: IndexWriterConfig indexWriterConfig = new IndexWriterConfig(Version.LUCENE_34, getAnalyzer()); indexWriterConfig.setOpe</description>
<pubDate>06 Jan  2012 18:55:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142615</link>
</item><item>
<title>Why read past EOF</title>
<description>Hi,  I use lucene 3.4.0 in a search project,but encounter a problem and i don&amp;#039;t know how to resolve. I index and it run well,but one week or two(it</description>
<pubDate>06 Jan  2012 18:28:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142617</link>
</item><item>
<title>Using dismax features in Lucene</title>
<description>Just reading Apache Solr Enterprise Search Server and was interested in pages 152, 153 dismax and DisjunctionMaxQuery and automatic Phrase Boosting.</description>
<pubDate>06 Jan  2012 14:52:43 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142586</link>
</item><item>
<title>Heads Up - Index File Format Change on Trunk</title>
<description>Folks, I just committed LUCENE-3628 [1] which cuts over Norms to DocVaues. This is an index file format change and if you are using trunk you need to</description>
<pubDate>05 Jan  2012 10:36:59 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142531</link>
</item><item>
<title>Re: IndexDocValues and storing Stats</title>
<description>Hi, I am experimenting with the Lucene trunk (aka 4.0), especially with the new IndexDocValues feature. I am trying to store some query-independent s</description>
<pubDate>04 Jan  2012 04:15:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142409</link>
</item><item>
<title>Inheritance heirarchy in the contrib-queryparser package</title>
<description>Hi folks, I was recommended to use PrecedenceQueryParser if I want boolean precedence in my queries. While examining this class, I have noticed that</description>
<pubDate>04 Jan  2012 02:40:09 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142408</link>
</item><item>
<title>frequent keyword computation within a search ( and timeinterval )</title>
<description>I have a requirement where reads and writes are quite high ( @ 100-500 per-sec ). A document has the following fields : timestamp, unique-docid, cont</description>
<pubDate>03 Jan  2012 21:17:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/java-user/142394</link>
</item>
</channel>
</rss>

