Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User

How to extract highest TF-IDF terms from Lucene index?

 

 

Lucene java-user RSS feed   Index | Next | Previous | View Threaded


michael.berkovsky at gmail

May 9, 2012, 1:18 PM

Post #1 of 3 (244 views)
Permalink
How to extract highest TF-IDF terms from Lucene index?

Hi,

Assuming that there is a large lucene collection, and I want to extract top
N terms with highest TF/IDF scores from some field.
The collection does not have term vectors stored. Does Lucene have some
utility to do this?

Thanks!
Michael


lucene at mikemccandless

May 9, 2012, 2:01 PM

Post #2 of 3 (246 views)
Permalink
Re: How to extract highest TF-IDF terms from Lucene index? [In reply to]

There is a tool named HighFregTerms, in contrib/misc that does this...

Mike

Sent from my iPad

On May 9, 2012, at 4:18 PM, Michael Berkovsky <michael.berkovsky [at] gmail> wrote:

> Hi,
>
> Assuming that there is a large lucene collection, and I want to extract top
> N terms with highest TF/IDF scores from some field.
> The collection does not have term vectors stored. Does Lucene have some
> utility to do this?
>
> Thanks!
> Michael

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


michael.berkovsky at gmail

May 9, 2012, 2:40 PM

Post #3 of 3 (233 views)
Permalink
Re: How to extract highest TF-IDF terms from Lucene index? [In reply to]

Thanks!

On Wed, May 9, 2012 at 2:01 PM, Mike McCandless
<lucene [at] mikemccandless>wrote:

> There is a tool named HighFregTerms, in contrib/misc that does this...
>
> Mike
>
> Sent from my iPad
>
> On May 9, 2012, at 4:18 PM, Michael Berkovsky <michael.berkovsky [at] gmail>
> wrote:
>
> > Hi,
> >
> > Assuming that there is a large lucene collection, and I want to extract
> top
> > N terms with highest TF/IDF scores from some field.
> > The collection does not have term vectors stored. Does Lucene have some
> > utility to do this?
> >
> > Thanks!
> > Michael
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>

Lucene java-user RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.