Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: kinosearch: discuss
Re: Queries with large number of hits.
 

Index | Next | Previous | View Flat


marvin at rectangular

Sep 19, 2008, 5:28 PM


Views: 11766
Permalink
Re: Queries with large number of hits. [In reply to]

On Sep 19, 2008, at 11:25 AM, Nathan Kurz wrote:

> The third thing (tiny, but perhaps easy to fix) is that
> Scorepost_read_record is spending 40% of its time in REALLOC. Is the
> enlarged position buffer not getting reused for some reason?

Oi, good catch! With one line of code, we see a 10-20% search-time
speed improvement:

Index: ../c_src/KinoSearch/Posting/ScorePosting.c
===================================================================
--- ../c_src/KinoSearch/Posting/ScorePosting.c (revision 3882)
+++ ../c_src/KinoSearch/Posting/ScorePosting.c (working copy)
@@ -145,6 +145,7 @@
num_prox = self->freq;
if (num_prox > self->prox_cap) {
self->prox = REALLOCATE(self->prox, num_prox, u32_t);
+ self->prox_cap = num_prox;
}
positions = self->prox;

> ps. The directions for building the Reuters benchmark index seem out
> of date. '-Mblib' no longer finds the uninstalled KinoSearch.so in
> the parent hierarchy.

I'll try to get updates committed later this evening.

Incidentally, although there are c. 19,000 unique documents in the
Reuters corpus, the indexing benchmarker will loop if you specify a
larger number, e.g. --docs=1000000.

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/


_______________________________________________
KinoSearch mailing list
KinoSearch [at] rectangular
http://www.rectangular.com/mailman/listinfo/kinosearch

Subject User Time
Queries with large number of hits. dmarkham at gmail Sep 13, 2008, 1:56 PM
    Re: Queries with large number of hits. henka at cityweb Sep 14, 2008, 10:41 AM
        Re: Queries with large number of hits. dmarkham at gmail Sep 14, 2008, 12:12 PM
    Re: Queries with large number of hits. marvin at rectangular Sep 14, 2008, 4:36 PM
    Re: Queries with large number of hits. dmarkham at gmail Sep 14, 2008, 6:05 PM
    Re: Queries with large number of hits. nate at verse Sep 14, 2008, 10:02 PM
    Re: Queries with large number of hits. dmarkham at gmail Sep 14, 2008, 10:55 PM
    Re: Queries with large number of hits. marvin at rectangular Sep 16, 2008, 11:36 PM
    Re: Queries with large number of hits. dmarkham at gmail Sep 17, 2008, 11:23 AM
    Re: Queries with large number of hits. nate at verse Sep 17, 2008, 1:16 PM
    Re: Queries with large number of hits. marvin at rectangular Sep 18, 2008, 9:25 PM
    Re: Queries with large number of hits. nate at verse Sep 19, 2008, 11:25 AM
    Re: Queries with large number of hits. marvin at rectangular Sep 19, 2008, 5:28 PM
    Re: Queries with large number of hits. marvin at rectangular Sep 19, 2008, 7:14 PM
    Re: Queries with large number of hits. dmarkham at gmail Sep 19, 2008, 10:37 PM

  Index | Next | Previous | View Flat
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.