Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-Dev

more field advocacy

 

 

Lucene java-dev RSS feed   Index | Next | Previous | View Threaded


kalle at snigel

May 7, 2006, 3:00 AM

Post #1 of 2 (647 views)
Permalink
more field advocacy

The TermFreqVector.getTermFrequencies always return the same value as
TermDocs.freq, even if a field was set not to be added to the term
frequency vector.

Is this really the way it should be? It makes the fields even more
confusing than my prior post on the subject.


doc.add(new Field("foo", "bar", Store.NO, Index.TOKENIZED,
TermVector.YES));
doc.add(new Field("foo", "bar", Store.NO, Index.TOKENIZED,
TermVector.NO));

Vector frequency of [foo, bar] is 2. I would expect it to be 1 or a
field setting inconsistency warning.

Perhaps this is true also with Store and Index? I'll have to take a look
at that.

I'm OK with the settings being by field name, but the current design
really don't imply that. It implies multidimensional settings, one per
instance of Field. And that could be a nice thing to have.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe [at] lucene
For additional commands, e-mail: java-dev-help [at] lucene


kalle at snigel

May 8, 2006, 4:13 PM

Post #2 of 2 (575 views)
Permalink
Re: more field advocacy [In reply to]

On Sun, 2006-05-07 at 12:00 +0200, karl wettin wrote:
>
> doc.add(new Field("foo", "bar", Store.NO, Index.TOKENIZED,
> TermVector.YES));
> doc.add(new Field("foo", "bar", Store.NO, Index.TOKENIZED,
> TermVector.NO));
>
> Vector frequency of [foo, bar] is 2. I would expect it to be 1 or a
> field setting inconsistency warning.
>
> Perhaps this is true also with Store and Index? I'll have to take a
> look at that.

"Once setting, always setting" goes for Index too.

If someone can tell me what settings are global and what settings are
not global I'll patch and depricatate:

[Document]<#>-- {0..*} ->[FieldValue]--->[Field +index +vector]


What is per value and what is per field?

private boolean storeTermVector = false;
private boolean storeOffsetWithTermVector = false;
private boolean storePositionWithTermVector = false;
private boolean omitNorms = false;
private boolean isStored = false;
private boolean isIndexed = true;
private boolean isTokenized = true;
private boolean isBinary = false;
private boolean isCompressed = false;


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe [at] lucene
For additional commands, e-mail: java-dev-help [at] lucene

Lucene java-dev RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.