Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User
A simple Vector Space Model and TFIDF usage
 

Index | Next | Previous | View Flat


amir.jadidi at yahoo

Jun 29, 2009, 12:10 PM


Views: 691
Permalink
A simple Vector Space Model and TFIDF usage

Hi,
It's my first experiment with Lucene. Please help me.
I'm going to index a set of documents and create a feature vector for each of them. This vector contains all terms belong to the document that weight using TFIDF.
After that I want to compute the cosine similarity between all documents and produce a doc-doc similarity matrix. My document set is large and it's important to have a scalable implementation.
Would you please provide me a guideline or to-do list?
Thank you and kind regards.

Subject User Time
A simple Vector Space Model and TFIDF usage amir.jadidi at yahoo Jun 29, 2009, 12:10 PM
    Re: A simple Vector Space Model and TFIDF usage gsingers at apache Jun 30, 2009, 9:13 AM
    Re: A simple Vector Space Model and TFIDF usage kamal.najib at mytum Jul 2, 2009, 1:49 AM

  Index | Next | Previous | View Flat
 
 


Interested in having your list archived? Contact lists@gossamer-threads.com
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.