Oct 1, 1999, 5:44 PM
Veteran / Moderator (6956 posts)
Oct 1, 1999, 5:44 PM
Post #6 of 8
Views: 2875
You will find that if the only think you find at fault with LinkSQL is that it can't search like the mulit-million dollar software behind Yahoo, you will be very disappointed with the performance of PHP as your driving force.
It has a place, in scripting dynamic pages on the server (rather than client-side javascript), but for managing a site, it has limitations that are not found with other cgi programs.
One of the problems I have found is because the code is embeded in the HTML, it's processed each time the page/frame is accessed, and that creates some odd behaviour with the <back> and <forward> buttons on the browser. Page caching doesn't work quite right, and can give the users some very odd -- or problematic -- results (especially where information is being sent TO the server).
Phrase searching is partially enabled in this release, and it's on the list of things Alex has been working on for awhile.
If that's what you mean by 'yahoo type' searches, then when the foreign character support for searches is finalized, this type of search will be too.
In thinking about my own use of the net, the number of times I search for "word1 word2" as a phrase, vs "word1 word2" as an AND or OR is very, very small -- usually where I want to exclude a term and would use a 'NOT' if the option was offered.
It took 7 seconds to index my 425 link test database. (Some of the entries are 15k of text in size.) I imagine it would take roughly as long (maybe a bit less) to just SEARCH and RETURN a full-text search on that database. Multiply this over a larger database, more simultaneous searches, etc, and the processing load becomes extreme, and puts you back in the same (perhaps worse) position of resource consumption that Links 2.0 had. (the other benefits on database size, etc, are still greater with SQL).
Yahoo gains much of its speed (forgetting its top-end investment in hardware and software) in that many searches are already processed, cached, and just regurgitated. Diskspace (and even ram) is cheaper than CPU).
It defeats most of the benefits gained by SQL and Indexed access.
Weighted searches, with the ability to weight certain fields higher than others, to create more 'relevance' in the searches returned, is a much better, and more desirable feature -- and it exists. This puts the entries that more closely fit the criteria at the top, while still allowing the searches to include fields that often return useless 'hits' such as email address and URL.
(BTW... the changes I posted for the last version apply to the 1.1beta as well.)
I _would_ like to see more ability to use SQL-type commands in the search dialogs, and the admin interface fixed up to look more like the user search -- so that AND/OR and EXACT/SUBSTRING are options, and an SQL pass-through for returning links that match a generic SQL query for maintennance reasons.
Full-text searching ("like yahoo") is coming, I just don't see it being a benefit, and do see it as defeating much of the benefits of an Indexed RDBMS.