<?xml version="1.0" encoding="iso-8859-1" ?>
<?xml-stylesheet title="XSL_formatting" type="text/xsl" href="/images/lists/rssstyle2.xsl"?>
<rss version="2.0">
<channel>
<title>Lucene | General</title>
<description>Mailing List Archive by Gossamer Threads</description>
<link>http://www.gossamer-threads.com/lists/lucene/general/</link>
<language>en-us</language>
<copyright>(c) Gossamer Threads Inc. All rights reserved.</copyright>
<lastBuildDate>22 Nov  2008 13:48:25 -0800</lastBuildDate>
<ttl>120</ttl>
<image>
<title>Gossamer Threads | Lucene | General</title>
<width>75</width>
<height>23</height>
<link>http://www.gossamer-threads.com/lists/lucene/general/</link>
<url>http://www.gossamer-threads.com/images/lists/rss_logo.jpg</url>
</image>
<item>
<title>parse url addresses and boost by field</title>
<description>Hello all, I have two questions: 1. Lucene doesn&amp;#039;t parse url addresses well for me. It stores it in (almost) full format: www.address.net or www.add</description>
<pubDate>09 Nov  2008 02:48:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/67043</link>
</item><item>
<title>How to get the keywords in articles which hitted by search?</title>
<description>For instance, I use search term &amp;quot;lucene OR java OR keyword&amp;quot; to create a query, and the result comes back with a bunch of articles, my question is how</description>
<pubDate>05 Nov  2008 08:03:35 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66929</link>
</item><item>
<title>[VOTE] [RESULT] Graduate Tika to a Lucene subproject (Graduation Approval Vote)</title>
<description>Hi, On Fri, Oct 24, 2008 at 3:07 PM, Jukka Zitting &amp;lt;jukka.zitting@gmail.com&amp;gt; wrote: &amp;gt; Please vote on approving the graduation of Tika. This vote is o</description>
<pubDate>27 Oct  2008 23:38:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66695</link>
</item><item>
<title>Simplest way to check for an exact match on an tokenized/stored field?</title>
<description>Hi group. I have a Lucene index that contains a bunch of text documents, which are both tokenized (using the standard analyzer, not KeywordAnalyzer) a</description>
<pubDate>26 Oct  2008 15:09:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66659</link>
</item><item>
<title>What happend hits</title>
<description>Hello, Â  I&amp;#039;m sure this question has been asked before but I can&amp;#039;t find the answer. I&amp;#039;ve just update to the latest version of lucene and it has left m</description>
<pubDate>25 Oct  2008 08:23:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66645</link>
</item><item>
<title>[VOTE] Graduate Tika to a Lucene subproject (Graduation Approval Vote)</title>
<description>Hi, The Tika community has voted [1] to request and the Lucene PMC has accepted [2] graduating Apache Tika to a Lucene subproject. As described in in</description>
<pubDate>24 Oct  2008 07:07:48 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66604</link>
</item><item>
<title>search starts with</title>
<description>Hi Firends, I am new to this forum.I really appriciate your help. I wanted to form a &amp;quot;start with query&amp;quot; in lucene. Lets say for instance i want to get</description>
<pubDate>22 Oct  2008 14:05:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66553</link>
</item><item>
<title>[VOTE] Graduate Tika to a Lucene subproject (Subproject Acceptance Vote)</title>
<description>Hi, As summarized below, the incubating Tika project has voted to indicate their willingness to graduate into a Lucene subproject. We feel that Tika</description>
<pubDate>20 Oct  2008 17:17:47 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66499</link>
</item><item>
<title>FYI: Tika is voting to graduate to a Lucene subproject</title>
<description>Dear Lucene and Incubator PMCs, Based on previous discussions about Tika&amp;#039;s future and current status, I have now started the graduation process for T</description>
<pubDate>16 Oct  2008 16:31:37 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66408</link>
</item><item>
<title>documentation for V1.0.2</title>
<description>Where can I get doc for 1.0.2? The lucene site apparently only goes back to 1.4.3</description>
<pubDate>16 Oct  2008 12:49:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66402</link>
</item><item>
<title>question about wildcard like search</title>
<description>I need to do a query where i&amp;#039;m looking for strings that are embedded into a single word in one of the fields. In other words, a field my have a phras</description>
<pubDate>16 Oct  2008 11:55:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66395</link>
</item><item>
<title>lucene 1.4 booleanquery</title>
<description>I&amp;#039;m working with an older version of lucene. In that version the BooleanQuery.add() takes three arguments. The query and two booleans -- &amp;#039;required&amp;#039;</description>
<pubDate>09 Oct  2008 10:28:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66179</link>
</item><item>
<title>[ANNOUNCE] Apache Solr Logo Contests</title>
<description>By popular demand (and after a few false starts) Solr is holding a contest to pick a new Solr logo. Full details about the contest, and how to subm</description>
<pubDate>03 Oct  2008 11:22:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/66030</link>
</item><item>
<title>Search Mediawiki and Intranet?</title>
<description>Hi, Currently I&amp;#039;m researching our documentation needs. Our documentations are split over several servers, including Sharepoint, our Fileserver, and a</description>
<pubDate>02 Oct  2008 02:46:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65991</link>
</item><item>
<title>Updation of field/metadata value in a document</title>
<description>Hi All, I have a query regarding document updation in Lucene Index. Is there a way to &amp;quot;update&amp;quot; a Document i.e. retrieve the existing Document from t</description>
<pubDate>30 Sep  2008 03:34:14 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65909</link>
</item><item>
<title>Lucene Index file vs. database</title>
<description>Hi, First I want to apologize if I&amp;#039;m asking something that was asked already. I tried search, but couldn&amp;#039;t find what I was looking for (or I simply</description>
<pubDate>29 Sep  2008 06:46:59 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65881</link>
</item><item>
<title>Subjects DB Matching</title>
<description>I am studying the possibility to use Lucene in order to build a matching system for a database of subjects. The subjects are stored in records of data</description>
<pubDate>29 Sep  2008 06:12:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65884</link>
</item><item>
<title>JIRA Forwarding</title>
<description>Greets, How do I get Lucy&amp;#039;s JIRA set up so that it forwards discussion to the lucy-dev list, as happens with Java Lucene&amp;#039;s JIRA and java-dev? If I r</description>
<pubDate>28 Sep  2008 21:54:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65871</link>
</item><item>
<title>ApacheCon US promo</title>
<description>Cross-posting... Just wanted to let everyone know that there will be a number of Lucene/ Solr/Mahout/Tika related talks, training sessions, and Bird</description>
<pubDate>26 Sep  2008 11:53:59 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65844</link>
</item><item>
<title>ANNOUNCE: Application Period Opens for Travel Assistance to ApacheCon US 2008</title>
<description>NOTE: This is a cross posted announcement to all Lucene sub-projects, please confine any replies to general@lucene. ------------- The Travel Assist</description>
<pubDate>26 Sep  2008 10:25:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65843</link>
</item><item>
<title>Updating an index??</title>
<description>Hi all. I&amp;#039;m new to Lucene, reading Lucene in Action, and using Lucene.NET, but my question is not platform specific. I&amp;#039;m baffled about the &amp;quot;create&amp;quot; p</description>
<pubDate>17 Sep  2008 19:07:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65454</link>
</item><item>
<title>[ANN] katta-0.1.0 release - distribute lucene indexes in a grid</title>
<description>After 5 month work we are happy to announce the first developer  preview release of katta. This release contains all functionality to serve a large,</description>
<pubDate>17 Sep  2008 17:06:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/65451</link>
</item><item>
<title>Preliminary, fundamental question about the demo</title>
<description>Hi, I just started with Lucene today, and the first thing I did was try out the small demo. I followed the instructions in &amp;quot;Getting started - Buildin</description>
<pubDate>08 Sep  2008 01:16:37 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/64877</link>
</item><item>
<title>Escaped boolean queries with wildcards</title>
<description>In order to support autocomplete in a location search, I&amp;#039;m taking the query string and adding a wildcard to the end. This works fine in general, but I</description>
<pubDate>04 Sep  2008 09:38:20 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/64749</link>
</item><item>
<title>GermanAnalyzer</title>
<description>Hello, i use the GermanAnalyzer. But i believe that this analyzer isnÂ´t working correct or i make an error in my code. For indexing and searching i</description>
<pubDate>03 Sep  2008 01:49:37 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/64628</link>
</item><item>
<title>Replicating Lucene Index with out SOLR</title>
<description>I have the following requirement Right now we have multiple indexes serving our web application. Our indexes are around 30 GB size. We want to repl</description>
<pubDate>27 Aug  2008 16:34:41 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/64426</link>
</item><item>
<title>We need Java Developer with Lucene experience in SSFO, CA</title>
<description>Contractor position in SSF (South San Francisco)  _____   We are looking for excellent Java developer who loves solving difficult problems. The Pr</description>
<pubDate>27 Aug  2008 15:28:55 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/64418</link>
</item><item>
<title>Local Lucene and Local Solr</title>
<description>The creators of Local Lucene and Local Solr (http://www.nsshutdown.com/projects/lucene/whitepaper/locallucene.htm ) have generously agreed to donate</description>
<pubDate>25 Aug  2008 08:41:10 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/64310</link>
</item><item>
<title>Multi Lingual indexing</title>
<description>Hi,  I am new to Lucene and got stuck while trying to accomplish a multi lingual indexing. Suppose i have two indexes, one English and one French and</description>
<pubDate>22 Aug  2008 00:03:15 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/64189</link>
</item><item>
<title>Lucene Performance and usage alternatives</title>
<description>I just made a program using the java api of Lucene. Its is working fine for my actually index size. But i am worried about performance with an biger i</description>
<pubDate>05 Aug  2008 07:21:17 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63620</link>
</item><item>
<title>[IMPORTANT] Fieldable and LUCENE-1349</title>
<description>Per https://issues.apache.org/jira/browse/LUCENE-1349, we have made an  exception to Lucene&amp;#039;s backward compatibility rules and marked  Fieldable as</description>
<pubDate>05 Aug  2008 05:34:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63616</link>
</item><item>
<title>RFO- Indexing &amp;#039;meaningfull&amp;#039; xml</title>
<description>Hello! This is a Request for Opinion targeted for the Lucene experts out there :-) I&amp;#039;m trying to get to know Lucene a bit better: After having playe</description>
<pubDate>02 Aug  2008 04:37:09 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63535</link>
</item><item>
<title>issues with wildcard search and snowball english analyzer</title>
<description>I am using SnowballAnalayzer(English). I just created one document with one field with content as &amp;quot;elephant is a big animal&amp;quot;. I searched for e*t using</description>
<pubDate>24 Jul  2008 15:39:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63278</link>
</item><item>
<title>How to use lucene for high search performance ?</title>
<description>Hi,   If I use lucene to execute many search requests at one time, the io operation will be the bottleneck of the performance.   So I use RAMDir</description>
<pubDate>24 Jul  2008 03:00:28 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63254</link>
</item><item>
<title>Remove from list</title>
<description>Remove from list. I want to be removed from this mailing list.</description>
<pubDate>22 Jul  2008 08:22:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63177</link>
</item><item>
<title>Matching Search Terms?</title>
<description>We have a requirement for our project where the user would like to be able to paste a bunch of terms into a search box. They want to basically &amp;quot;or&amp;quot; t</description>
<pubDate>22 Jul  2008 07:53:51 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63163</link>
</item><item>
<title>FW: lucene query parser syntax &amp;quot;escape =&amp;quot;</title>
<description>Please remove my email from the general lucene group Thanks, Tom</description>
<pubDate>21 Jul  2008 13:07:17 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63150</link>
</item><item>
<title>Basic genral information is needed.</title>
<description>Freinds, I am familar with searh engine algoritms/concepts. However, I am new to lucene. My question: How long it takes to learn/control lucence and</description>
<pubDate>21 Jul  2008 03:22:37 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63134</link>
</item><item>
<title>lucene query parser syntax &amp;quot;escape =&amp;quot;</title>
<description>i have a field that indexed called &amp;lt;summary&amp;gt;. i can see the field and it&amp;#039;s data using &amp;quot;luke&amp;quot;  .   the content is like below:::   sadf &amp;lt;body /&amp;gt; &amp;lt;body</description>
<pubDate>20 Jul  2008 09:59:39 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63126</link>
</item><item>
<title>Anyone have experince with using PDFTron and Lucene together?</title>
<description>Hello all,         I am not sure if this is the correct place to ask this question, but hopefully I don&amp;#039;t offend anyone. The company I work fo</description>
<pubDate>17 Jul  2008 09:44:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/63041</link>
</item><item>
<title>Retrieving term positions without storing the term vectors</title>
<description>Dear all,   Am I correct to believe that a quoted (phrase) search, like &amp;quot;red dog&amp;quot;, returns documents containing the consecutive words &amp;quot;red&amp;quot; and &amp;quot;dog</description>
<pubDate>09 Jul  2008 04:46:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62764</link>
</item><item>
<title>Alternatives for opening Index</title>
<description>Hi,  I am currently working on retrieving url and contentLength of each document  found during the search. I want to retrieve it during the calcula</description>
<pubDate>07 Jul  2008 18:01:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62713</link>
</item><item>
<title>New Versions of Lucene.net</title>
<description>Hi, A new version of Lucne.net has not been posted since april 2007 While java versions are posted rapidly. When a new version of Lucne.net is going</description>
<pubDate>02 Jul  2008 10:37:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62552</link>
</item><item>
<title>ranking</title>
<description>I wanted to know how the ranking work in Lucene and if it is only according to the frequency or there is any other criteria   Mahy Khairy faculty o</description>
<pubDate>27 Jun  2008 17:16:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62405</link>
</item><item>
<title>boosting keywords</title>
<description>Hi, I&amp;#039;m new to Lucene and this forum, so my apologies if this has been asked before, or I&amp;#039;m asking something obvious. My client has some specific re</description>
<pubDate>27 Jun  2008 09:57:48 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62370</link>
</item><item>
<title>Ad Server using Lucene</title>
<description>Anyone aware of an F/OSS or commercial ad server engine that uses Lucene as a backend for contextual ad serving? Any pointers are very welcome. Thanks</description>
<pubDate>20 Jun  2008 21:41:06 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62101</link>
</item><item>
<title>Should this mailing list exist?</title>
<description>Hi List, I see subscribing users constantly redirected to java-user, because there is much greater traffic over there. Does it make sense to operate</description>
<pubDate>18 Jun  2008 07:09:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62031</link>
</item><item>
<title>Lucene is not able to index certain words of txt file converted form pdf</title>
<description>Hi I am using Lucene for indexing and searching the documents. I have an PDF (Lucene_in_action.pdf) file which i converted to txt file using PDFBox.</description>
<pubDate>18 Jun  2008 05:24:53 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/62027</link>
</item><item>
<title>Deleting Documents</title>
<description>Hello,    I&amp;#039;m having difficulty deleting documents from an index. I am using lucene 2.3.1    The program that I have created recursively searche</description>
<pubDate>17 Jun  2008 03:33:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61996</link>
</item><item>
<title>Getting irrelevant results using fuzzy query</title>
<description>Hi guys, I try to provide relevant results for the users of a lyrics site, even in the case of misspellings by indexing artist and songs with Lucene.</description>
<pubDate>17 Jun  2008 02:52:56 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61995</link>
</item><item>
<title>Result Count</title>
<description>Hi, Can i get 700 result at a time from lucene? I have tried for 100 result but it s getting delay... is there any possibilities to get 700 or more</description>
<pubDate>05 Jun  2008 06:08:40 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61757</link>
</item><item>
<title>Documentation for migration between v1.4.3 &amp;amp; V.2.3.x</title>
<description>Hello,     I am to migrate our application that uses Lucene v1.4.3 to the latest version of Lucene , as part of this effort I need to document the</description>
<pubDate>30 May  2008 15:17:12 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61682</link>
</item><item>
<title>Problem after combining queries</title>
<description>Hi all, I have to implement searching for the following criteria: &amp;quot;search all documents with &amp;#039;status&amp;#039; approved and &amp;#039;createddate&amp;#039; within the given ran</description>
<pubDate>25 May  2008 22:44:57 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61546</link>
</item><item>
<title>Boolen operators</title>
<description>Is multiple boolean operators valid in lucene query. Example str1 OR OR str2       str1 OR AND str2 are these queries valid? -- View this me</description>
<pubDate>22 May  2008 02:45:14 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61455</link>
</item><item>
<title>Multi-Processor Indexing</title>
<description>I am going to be indexing a large volume of documents (1TB worth) on a server with 8 processors. By default Lucene only seems to use one processor bu</description>
<pubDate>20 May  2008 13:26:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61401</link>
</item><item>
<title>Min Merge Docs</title>
<description>Hello, I am trying to improve indexing time and looking through tutorials I found that the three main variables to improve indexing time are minMerge</description>
<pubDate>20 May  2008 10:21:54 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61394</link>
</item><item>
<title>Relevence Feedback</title>
<description>Hello, I was under the impression Lucene did not come with any relevance feedback implementation, and that you needed to add it yourself. Someone to</description>
<pubDate>17 May  2008 07:35:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61315</link>
</item><item>
<title>Search by first term in field.</title>
<description>I have tokenized index. I need to do search to search only by prefix. So if I enter &amp;quot;java&amp;quot;, then I need to get &amp;quot;java developer&amp;quot;, but NOT &amp;quot;developed in</description>
<pubDate>16 May  2008 02:32:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61252</link>
</item><item>
<title>Online Question Answering demo using Lucene</title>
<description>[Apologies if you consider this as spam] Hello Lucene users and developers, I wanted to point people on this list to a Question Answering System, de</description>
<pubDate>14 May  2008 07:30:52 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61172</link>
</item><item>
<title>words close together - like google</title>
<description>hi, i am a newbie to text search, but need to evaluate lucene. my question is this: in a google query such as &amp;quot;prune scotch broom&amp;quot; it has always see</description>
<pubDate>12 May  2008 09:39:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61119</link>
</item><item>
<title>Wildcard Search over multiple fields</title>
<description>Hello, What is the best method of performing a leading and trailing wildcard search over multiple fields? Currently I performing a wildcard search on</description>
<pubDate>07 May  2008 04:14:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/61022</link>
</item><item>
<title>Welcome two new PMC members: Mike Klaas and Ryan McKinley</title>
<description>The Lucene Project Management Committee is happy to announce that two new members have been voted onto the PMC: Mike Klaas and Ryan McKinley. -Hoss</description>
<pubDate>01 May  2008 18:17:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60888</link>
</item><item>
<title>DateField Replacement?</title>
<description>I am using Lucene version 2.3 and i noticed it does&amp;#039;nt contain the class DateFilter , what class replaces dateFilter -- View this message in context:</description>
<pubDate>25 Apr  2008 10:04:58 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60724</link>
</item><item>
<title>A problem about additonal info(after some modification for lucene)</title>
<description>I modified some lucene&amp;#039;s code to make lucene have the new use like:    doc=new Document();   byte[] additionalInfo=new byte[]{&amp;#039;x&amp;#039;,&amp;#039;x&amp;#039;,&amp;#039;x&amp;#039;};</description>
<pubDate>23 Apr  2008 04:29:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60652</link>
</item><item>
<title>how to modify the lucene demo to make it search for the pdf format files?</title>
<description></description>
<pubDate>20 Apr  2008 23:56:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60599</link>
</item><item>
<title>Need addtional info for Field</title>
<description>I want to use lucene with some additional info,like: 1.index   Document additionalDoc=ew Document()   additionalDoc.add(new Field(&amp;quot;field&amp;quot;,&amp;quot;AA BB</description>
<pubDate>20 Apr  2008 23:15:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60598</link>
</item><item>
<title>A question about ParalellMultiSearcher and RMI</title>
<description>I want to use RAMDirectory to raise the peformance of lucene. So I cut the index dir to 3 smaller index dirs(1G one index dir). Then I use RAMDirector</description>
<pubDate>19 Apr  2008 01:37:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60570</link>
</item><item>
<title>a question about MultiSearcher</title>
<description>If I search in multiple dirs,I can use MultiSearcher. The idf about a term is log(numDocs/(docFreq+1)+1). In the two kinds of condition:   1.only 1</description>
<pubDate>18 Apr  2008 04:42:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60527</link>
</item><item>
<title>Looking for duplicate names</title>
<description>I&amp;#039;m new to Lucene, and would like to use it to find duplicate names in a contact list. Is Lucene a good fit? We have a form where a user enters a com</description>
<pubDate>15 Apr  2008 12:04:31 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60445</link>
</item><item>
<title>the question about the example of lucene in action</title>
<description>lucene in action has a instance about explaining indexwriter&amp;#039;s MergeFactor,MaxMergeDocs parameter on 2.7.1. Because of lucene&amp;#039;s edition, imodified it</description>
<pubDate>14 Apr  2008 00:19:36 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60406</link>
</item><item>
<title>Lucene on Hadoop</title>
<description>Hi, From this URL http://www.mail-archive.com/hadoop-user@lucene.apache.org/msg00998.html I see that Hadoop is not suitable for incremental updates if</description>
<pubDate>09 Apr  2008 02:34:19 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60288</link>
</item><item>
<title>a problem about the deprecated method</title>
<description>there is difference between lucene edition 1.4.3 and 2.0.0. e.g class FSDirectory&amp;#039;s method getDirectory(File indexDir,boolean XX). i am studying luce</description>
<pubDate>08 Apr  2008 22:12:25 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60281</link>
</item><item>
<title>Problem with Russian Language in Lucene 2.0.0.4</title>
<description>Hi all! I am indexing Russian text with that code. I have a problem, when I try to search for words in different cases. Search is productive, only if</description>
<pubDate>04 Apr  2008 06:00:23 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/60174</link>
</item><item>
<title>Improving indexing and some questions</title>
<description>Dear, I have ideas for improving indexing for web search. I have written the tutorial for IPSI conference in Opatija about ranking in search engines:</description>
<pubDate>24 Mar  2008 17:17:40 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59856</link>
</item><item>
<title>how to control the disk size of the indices</title>
<description>Hi all, I wanted to ask the list whether there is an easy and efficient way to manage the size (in bytes) of a lucene index stored on disk. Basicall</description>
<pubDate>24 Mar  2008 16:33:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59855</link>
</item><item>
<title>Google Summer of Code</title>
<description>Dear, I have idea to implement distributed version of Lucene for Google Summer of Code. Distributed version would improve speed of ranking. I have al</description>
<pubDate>19 Mar  2008 17:02:29 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59764</link>
</item><item>
<title>Lucene: Searching through multiple records.</title>
<description>Hi there! I am not actually a programmer (more like statistician). I have a text mining problem where I need to search for certain key-words in a par</description>
<pubDate>19 Mar  2008 04:11:38 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59707</link>
</item><item>
<title>Similarity Search</title>
<description>Hello, we are using lucene in one of our applications for fulltext search, which works very vell. I&amp;#039;am now interested in some similarity search for</description>
<pubDate>14 Mar  2008 01:14:02 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59488</link>
</item><item>
<title>Similarity Class - A couple of questions</title>
<description>Hello all! As of lately, I&amp;#039;ve been interested in understanding how Lucene scores my documents, and so I&amp;#039;ve asked a couple of questions in the mailing</description>
<pubDate>13 Mar  2008 13:40:18 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59459</link>
</item><item>
<title>question</title>
<description>Hello All: I want to rewrite the Lucene with cocoa. What should I do firstly???  Stone</description>
<pubDate>11 Mar  2008 01:28:18 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59254</link>
</item><item>
<title>Lucene&amp;#039;s Scoring &amp;amp; Regular TF-IDF</title>
<description>Hello all! I&amp;#039;ve asked here a few days ago if I could get a &amp;quot;raw&amp;quot; tf-idf score out of lucene&amp;#039;s methods. I was kindly advised to hack my way through th</description>
<pubDate>10 Mar  2008 15:05:14 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59236</link>
</item><item>
<title>Searching chomps my terms..</title>
<description>Hello all! I&amp;#039;m trying to do a search based on alphanumeric terms such as: cit2, mit12, hiv17, etc. However, when submitting them to Lucene, it only se</description>
<pubDate>10 Mar  2008 14:57:03 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59235</link>
</item><item>
<title>Getting TF-IDF from a match</title>
<description>Hello all! I&amp;#039;m interested in getting the TF-IDF values of a given search for a given document. I can &amp;quot;see&amp;quot; the parts of the scoring formula through t</description>
<pubDate>08 Mar  2008 11:55:26 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/59165</link>
</item><item>
<title>Lucene - Search Optimization Problem</title>
<description>Hello all! I&amp;#039;ve finally got round to setup Lucene 2.3.0 in my two production boxes (Ubuntu 7.10 and Windows XP), after quite a trouble with the JCC c</description>
<pubDate>24 Feb  2008 07:12:30 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/58760</link>
</item><item>
<title>Lucene or Nutch???</title>
<description>Hi all,  I am new to lucene and nutch. I am doing a project on an archiving web portal which allow individual user to index document (from file syste</description>
<pubDate>17 Feb  2008 12:14:05 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/58562</link>
</item><item>
<title>Retrieving a document from a &amp;quot;keyword&amp;quot; field</title>
<description>Hello all, The documents of my (mysql) database are indexed by Lucene, and I save (as a &amp;quot;Keyword&amp;quot;) the database id in the &amp;quot;internal_id&amp;quot; field of the</description>
<pubDate>15 Feb  2008 02:57:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/58489</link>
</item><item>
<title>Boosting documents individually for each user.</title>
<description>I have web-site with a bunch of users. There are documents, that can be searched by users. I want to have result order to be slightly different for ev</description>
<pubDate>14 Feb  2008 11:32:52 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/58470</link>
</item><item>
<title>Indexing the fields</title>
<description>Hi , I am using PyLucene in my program. I want to search for all the words with the &amp;quot;sim&amp;quot; in it. So, I typed &amp;quot;sim*&amp;quot; in my query. It is also giving me</description>
<pubDate>12 Feb  2008 03:40:24 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/58355</link>
</item><item>
<title>Lucene-based Distributed Index Leveraging Hadoop</title>
<description>There have been several proposals for a Lucene-based distributed index architecture. 1) Doug Cutting&amp;#039;s &amp;quot;Index Server Project Proposal&amp;quot; at   http://</description>
<pubDate>06 Feb  2008 10:57:22 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/58087</link>
</item><item>
<title>Multiple Indexes - Merge?</title>
<description>Hello all! I am new to Lucene and I&amp;#039;m using PyLucene&amp;#039;s extension in my project. I&amp;#039;m using it to index a large volume of data (index size estimated in</description>
<pubDate>04 Feb  2008 06:24:07 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57959</link>
</item><item>
<title>multiple instances of fields or attributes</title>
<description>Hi. I am totally new to Lucene, and currently investigating the usage of Lucene for a new development project. In fact, for evaluation I am using t</description>
<pubDate>02 Feb  2008 09:27:13 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57918</link>
</item><item>
<title>Checking if a given document is indexed</title>
<description>Hello all, I&amp;#039;m using pylucene to index documents and I&amp;#039;m interested in checking if a given document from the list A (that is going to be indexed) is</description>
<pubDate>01 Feb  2008 08:41:48 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57897</link>
</item><item>
<title>Luke with latest version of Java...</title>
<description>Hi all, I&amp;#039;m failing to get the latest version of Luke (0.7.1) working with the latest version of Java (1.6 update 4). It just consumes CPU without ge</description>
<pubDate>30 Jan  2008 02:15:46 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57917</link>
</item><item>
<title>Retrieve all documents sorted by date</title>
<description>Hi, How can i do that ? I know how to sort the search results but i dont know which query to use in order to retrieve all docs. Moran -- View this</description>
<pubDate>28 Jan  2008 09:36:44 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57726</link>
</item><item>
<title>Searching sub string</title>
<description>Hi, I&amp;#039;d like to perform a substring search (query like *foo*). As you better know, it is not possible to use * as the first character. Is there any o</description>
<pubDate>25 Jan  2008 06:59:49 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57627</link>
</item><item>
<title>Mahout Machine Learning Project Launches</title>
<description>(Apologies for cross-posting) The Lucene PMC is pleased to announce the creation of the Mahout  Machine Learning project, located at http://lucene.a</description>
<pubDate>25 Jan  2008 04:25:24 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57599</link>
</item><item>
<title>Full-Text Search in a Relational Model</title>
<description>Hi, (Warning, not for the weak-hearted) I&amp;#039;m currently working on a project where we have a large and complex data model, related to Genomics. We are</description>
<pubDate>24 Jan  2008 04:29:34 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57520</link>
</item><item>
<title>Apache Lucene Liminitation and disadvanages?</title>
<description>Hi everybody, These days i am reasearching on &amp;quot;Apache Lucene&amp;quot; and thier features, As a newbee to this technology, i wanted to know what kind of limi</description>
<pubDate>21 Jan  2008 02:41:32 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57341</link>
</item><item>
<title>Exact match syntax</title>
<description>Lucene has been working well for us until the last few days when we hit a snag.  We&amp;#039;re trying to build a query to search a multiple value property to</description>
<pubDate>17 Jan  2008 11:47:27 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57180</link>
</item><item>
<title>Announcing sixearch.org</title>
<description>Some time back we announced the first public prototype of 6S, a peer application for social, distributed, adaptive Web search. Thanks to the feedback</description>
<pubDate>16 Jan  2008 20:46:21 -0800</pubDate>
<link>http://www.gossamer-threads.com/lists/lucene/general/57136</link>
</item>
</channel>
</rss>
