Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: General

Whole unfiltered content in response document field

 

 

Lucene general RSS feed   Index | Next | Previous | View Threaded


a2701511 at jnxjn

May 6, 2011, 8:33 AM

Post #1 of 1 (160 views)
Permalink
Whole unfiltered content in response document field

Hi, I have a question to the content of the document fields. My configuration
is ok so far, I index a database with DIH and have configured a index
analyser as folow:

<analyzer type="index">
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.StopFilterFactory"
ignoreCase="true"
words="stopwords.txt"
enablePositionIncrements="true"
/>
<filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" generateNumberParts="1" catenateWords="1"
catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/>
<filter class="solr.LowerCaseFilterFactory"/>
</analyzer>

...

<fields>
<field name="id" type="int" indexed="true" stored="true" required="true"
/>
<field name="text" type="text" indexed="true" stored="true"/>
</fields>

On the analysis view, my filters work poperly. On the end of the filter
chain I have only interest tokens. But when I search with Solr, I become as
a response the whole content of the indexed databse field. The field
contains stopwords, whitespaces, upercases and so on. I search for
stopwords, and I can find them. I would expect, I find in the response
document only the filtered content in the field and not the original raw
content that I would to index.

Is this a normal behaviour? Do I understand Solr right?

Many thanks!


--
View this message in context: http://lucene.472066.n3.nabble.com/Whole-unfiltered-content-in-response-document-field-tp2908685p2908685.html
Sent from the Lucene - General mailing list archive at Nabble.com.

Lucene general RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.