Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User

Sorting on tokenized fields

 

 

Lucene java-user RSS feed   Index | Next | Previous | View Threaded


michael_prichard at mac

Jan 8, 2008, 10:22 AM

Post #1 of 6 (1137 views)
Permalink
Sorting on tokenized fields

Is it possible to sort on a tokenized field? For example, I break
email address into pieces, i.e.

michael.prichard [at] email

becomes

michael.prichard [at] email
michael.prichard
michael
prichard
email.com
email


so when sorting on this field I get some strange results. Do I need
to create another field just for sorting? Or are there other ways to
do it?

Thanks!
Michael

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


michael_prichard at mac

Jan 8, 2008, 12:26 PM

Post #2 of 6 (1050 views)
Permalink
Re: Sorting on tokenized fields [In reply to]

ok... i should read the manual more often.

i went ahead and just added untokenized, unstored sort fields....

question, if I put a field in to sort of but say I have not indexed
any as of yet...will the Sort fail? For example, say I have a BCC
field and nothing has been indexed with that yet....I run a sort on
BCC.....I think it fails now...anyway around that?

On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote:

> Is it possible to sort on a tokenized field? For example, I break
> email address into pieces, i.e.
>
> michael.prichard [at] email
>
> becomes
>
> michael.prichard [at] email
> michael.prichard
> michael
> prichard
> email.com
> email
>
>
> so when sorting on this field I get some strange results. Do I
> need to create another field just for sorting? Or are there other
> ways to do it?
>
> Thanks!
> Michael
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


ryantxu at gmail

Jan 8, 2008, 1:13 PM

Post #3 of 6 (1054 views)
Permalink
Re: Sorting on tokenized fields [In reply to]

what do you mean by "fail"? -- there is the sortMissingLast attribute


Michael Prichard wrote:
> ok... i should read the manual more often.
>
> i went ahead and just added untokenized, unstored sort fields....
>
> question, if I put a field in to sort of but say I have not indexed any
> as of yet...will the Sort fail? For example, say I have a BCC field and
> nothing has been indexed with that yet....I run a sort on BCC.....I
> think it fails now...anyway around that?
>
> On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote:
>
>> Is it possible to sort on a tokenized field? For example, I break
>> email address into pieces, i.e.
>>
>> michael.prichard [at] email
>>
>> becomes
>>
>> michael.prichard [at] email
>> michael.prichard
>> michael
>> prichard
>> email.com
>> email
>>
>>
>> so when sorting on this field I get some strange results. Do I need
>> to create another field just for sorting? Or are there other ways to
>> do it?
>>
>> Thanks!
>> Michael
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>> For additional commands, e-mail: java-user-help [at] lucene
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


michael_prichard at mac

Jan 8, 2008, 1:21 PM

Post #4 of 6 (1044 views)
Permalink
Re: Sorting on tokenized fields [In reply to]

Meaning that it says "field is not indexed". Where is
sortMissingLastAttribute?

thanks.

On Jan 8, 2008, at 4:13 PM, Ryan McKinley wrote:

> what do you mean by "fail"? -- there is the sortMissingLast attribute
>
>
> Michael Prichard wrote:
>> ok... i should read the manual more often.
>> i went ahead and just added untokenized, unstored sort fields....
>> question, if I put a field in to sort of but say I have not
>> indexed any as of yet...will the Sort fail? For example, say I
>> have a BCC field and nothing has been indexed with that yet....I
>> run a sort on BCC.....I think it fails now...anyway around that?
>> On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote:
>>> Is it possible to sort on a tokenized field? For example, I
>>> break email address into pieces, i.e.
>>>
>>> michael.prichard [at] email
>>>
>>> becomes
>>>
>>> michael.prichard [at] email
>>> michael.prichard
>>> michael
>>> prichard
>>> email.com
>>> email
>>>
>>>
>>> so when sorting on this field I get some strange results. Do I
>>> need to create another field just for sorting? Or are there
>>> other ways to do it?
>>>
>>> Thanks!
>>> Michael
>>>
>>> --------------------------------------------------------------------
>>> -
>>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>>> For additional commands, e-mail: java-user-help [at] lucene
>>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>> For additional commands, e-mail: java-user-help [at] lucene
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


ryantxu at gmail

Jan 8, 2008, 1:26 PM

Post #5 of 6 (1051 views)
Permalink
Re: Sorting on tokenized fields [In reply to]

my mistake, I thought I was looking at the solr mailing list ;)

If you change your analyzer, it does not change the tokens that are
already in the index -- you will need to re-index for any changes to
take effect.

ryan


Michael Prichard wrote:
> Meaning that it says "field is not indexed". Where is
> sortMissingLastAttribute?
>
> thanks.
>
> On Jan 8, 2008, at 4:13 PM, Ryan McKinley wrote:
>
>> what do you mean by "fail"? -- there is the sortMissingLast attribute
>>
>>
>> Michael Prichard wrote:
>>> ok... i should read the manual more often.
>>> i went ahead and just added untokenized, unstored sort fields....
>>> question, if I put a field in to sort of but say I have not indexed
>>> any as of yet...will the Sort fail? For example, say I have a BCC
>>> field and nothing has been indexed with that yet....I run a sort on
>>> BCC.....I think it fails now...anyway around that?
>>> On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote:
>>>> Is it possible to sort on a tokenized field? For example, I break
>>>> email address into pieces, i.e.
>>>>
>>>> michael.prichard [at] email
>>>>
>>>> becomes
>>>>
>>>> michael.prichard [at] email
>>>> michael.prichard
>>>> michael
>>>> prichard
>>>> email.com
>>>> email
>>>>
>>>>
>>>> so when sorting on this field I get some strange results. Do I need
>>>> to create another field just for sorting? Or are there other ways
>>>> to do it?
>>>>
>>>> Thanks!
>>>> Michael
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>>>> For additional commands, e-mail: java-user-help [at] lucene
>>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>>> For additional commands, e-mail: java-user-help [at] lucene
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>> For additional commands, e-mail: java-user-help [at] lucene
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


cdoronc at gmail

Jan 8, 2008, 1:37 PM

Post #6 of 6 (1041 views)
Permalink
Re: Sorting on tokenized fields [In reply to]

Hi Michael, I think you mean the exception thrown when you
search and sort with a field that was not yet indexed:

RuntimeException: field "BBC" does not appear to be indexed

I think the current behavior is correct, otherwise an application
might (by a bug) attempt to sort by a wrong field, that not only
does not exist yet but will never exist. To prevent hidden bugs
cases like this it is important to throw an exception in this case.
Do you agree?

On Jan 8, 2008 11:21 PM, Michael Prichard <michael_prichard [at] mac> wrote:

> Meaning that it says "field is not indexed". Where is
> sortMissingLastAttribute?
>
> thanks.
>
> On Jan 8, 2008, at 4:13 PM, Ryan McKinley wrote:
>
> > what do you mean by "fail"? -- there is the sortMissingLast attribute
> >
> >
> > Michael Prichard wrote:
> >> ok... i should read the manual more often.
> >> i went ahead and just added untokenized, unstored sort fields....
> >> question, if I put a field in to sort of but say I have not
> >> indexed any as of yet...will the Sort fail? For example, say I
> >> have a BCC field and nothing has been indexed with that yet....I
> >> run a sort on BCC.....I think it fails now...anyway around that?
> >> On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote:
> >>> Is it possible to sort on a tokenized field? For example, I
> >>> break email address into pieces, i.e.
> >>>
> >>> michael.prichard [at] email
> >>>
> >>> becomes
> >>>
> >>> michael.prichard [at] email
> >>> michael.prichard
> >>> michael
> >>> prichard
> >>> email.com
> >>> email
> >>>
> >>>
> >>> so when sorting on this field I get some strange results. Do I
> >>> need to create another field just for sorting? Or are there
> >>> other ways to do it?
> >>>
> >>> Thanks!
> >>> Michael
> >>>
> >>> --------------------------------------------------------------------
> >>> -
> >>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> >>> For additional commands, e-mail: java-user-help [at] lucene
> >>>
> >> ---------------------------------------------------------------------
> >> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> >> For additional commands, e-mail: java-user-help [at] lucene
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> > For additional commands, e-mail: java-user-help [at] lucene
> >
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>

Lucene java-user RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.