
fancyerii at gmail
Apr 27, 2012, 8:06 PM
Post #2 of 3
(238 views)
Permalink
|
stemmer semantic is a "large" word, care to use it. On Sat, Apr 28, 2012 at 11:02 AM, Kasun Perera <kasunp [at] opensource> wrote: > I'm using Lucene's Term Freq vector to calculate cosine similarity between > documents, Say my docments has these 3 terms, "owe" "owed" "owing". Lucene > takes this as 3 separate terms, but 3 of them means same "owe". Is there > any functionality in Lucene that can be used to index by semantics? so that > it indexes "owe" "owed" "owing" as one word "owe" with term frequency =3 ? > > If not I'd welcome any suggestions achieving this task? > > -- > Regards > > Kasun Perera --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe [at] lucene For additional commands, e-mail: java-user-help [at] lucene
|