Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User

nonalpha numberic characters

 

 

Lucene java-user RSS feed   Index | Next | Previous | View Threaded


soshima at business

Nov 8, 2001, 4:32 PM

Post #1 of 4 (210 views)
Permalink
nonalpha numberic characters

My search works for At&t with the ampersand in the middle. However it
doesn't work for e-commerce with the dash in the middle. Anything I have to
do with the analyzers/filters to fix this? Thanks.

-scott

--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe [at] jakarta>
For additional commands, e-mail: <mailto:lucene-user-help [at] jakarta>


lebowitz at finaltouch

Nov 8, 2001, 5:24 PM

Post #2 of 4 (199 views)
Permalink
Re: nonalpha numberic characters [In reply to]

Yes... you should extend one of the standard analyzers and provide a
mechanism to include the dash (-) within words. Use this analyzer to index
and search.


Rob
----- Original Message -----
From: <soshima [at] business>
To: <lucene-user [at] jakarta>
Sent: Thursday, November 08, 2001 5:32 PM
Subject: nonalpha numberic characters


>
>
> My search works for At&t with the ampersand in the middle. However it
> doesn't work for e-commerce with the dash in the middle. Anything I have
to
> do with the analyzers/filters to fix this? Thanks.
>
> -scott
>
> --
> To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe [at] jakarta>
> For additional commands, e-mail:
<mailto:lucene-user-help [at] jakarta>


--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe [at] jakarta>
For additional commands, e-mail: <mailto:lucene-user-help [at] jakarta>


soshima at business

Nov 8, 2001, 5:40 PM

Post #3 of 4 (200 views)
Permalink
RE: nonalpha numberic characters [In reply to]

Thanks, do you know if nonalphanumeric characters are excluded by default?
since & works, but - works, I am assuming they are not. I am not sure how
to cleary include nonalphanumeric character explicitly. If you have any
exmaples of this that would be great. Much thanks again.

-----Original Message-----
From: Robert J. Lebowitz [mailto:lebowitz [at] finaltouch]
Sent: Thursday, November 08, 2001 4:24 PM
To: Lucene Users List
Subject: Re: nonalpha numberic characters


Yes... you should extend one of the standard analyzers and provide a
mechanism to include the dash (-) within words. Use this analyzer to index
and search.


Rob
----- Original Message -----
From: <soshima [at] business>
To: <lucene-user [at] jakarta>
Sent: Thursday, November 08, 2001 5:32 PM
Subject: nonalpha numberic characters


>
>
> My search works for At&t with the ampersand in the middle. However it
> doesn't work for e-commerce with the dash in the middle. Anything I have
to
> do with the analyzers/filters to fix this? Thanks.
>
> -scott
>
> --
> To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe [at] jakarta>
> For additional commands, e-mail:
<mailto:lucene-user-help [at] jakarta>


--
To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe [at] jakarta>
For additional commands, e-mail:
<mailto:lucene-user-help [at] jakarta>

--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe [at] jakarta>
For additional commands, e-mail: <mailto:lucene-user-help [at] jakarta>


lebowitz at finaltouch

Nov 8, 2001, 8:39 PM

Post #4 of 4 (203 views)
Permalink
Re: nonalpha numberic characters [In reply to]

It all depends on the analyzer and filters that you are using to index your
text. Look at the archive for this list using my last name (lebowitz). I
posted the source code for a slightly modified analyzer and filter that
should provide you with an example you can use for your own purposes. It
was for an AlphaNumericAnalyzer, I believe. You'd have to provide
additional tests for the characters you have in mind; not just standard
digits and characters.

Rob


--
To unsubscribe, e-mail: <mailto:lucene-user-unsubscribe [at] jakarta>
For additional commands, e-mail: <mailto:lucene-user-help [at] jakarta>

Lucene java-user RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.