You are not logged in Log in Join
You are here: Home » Members » TheJester » Zope Quick and Dirty Search Engine and Indexer

Log in
Name

Password

 
 

Folder icon Zope Quick and Dirty Search Engine and Indexer

ZQUADSEX: Zope QUick And Dirty Search Engine and indeXer

What's it for?

Well it's for making a search engine, based on content you provide, either by

  • Submitting URLs to be indexed (which are not stored), o Uploading files to be indexed (which are not stored), o By using the block tag to submit content for indexing.

Using it:

Install a Text Indexer product in your tree somewhere. Then you can submit URLs, or Files to be indexed into it (via dtml if you want).

You can also use to wrap content to be indexed.

URL/Tag based indexing uses the absolute_url() to determine an ID for it, so if you use variables to alter the display of your page, then you'll need to do some work to get it to work.

** Warning: the stats page has some code commented out, uncomment it for an interesting view of your index, but, it'll take a long* time to render.

------------------------------------------------------------------------ You can specify the default search mode:

  • And Mode: all words in a query must match o Or Mode: any words in a query can match

You can override the search method dynamically if you want to.

You can specify an amount of text to keep as a summary. You can specify a threshold (as a percentage of the total) for words to ignore in searching. Ignored words do not abort an and search.

You can specify a minimum period of time before reindexing occurs (useful for .

  • Indexing does not occur if content has not changed since last indexing.
  • URL indexing removes html tags before indexing so font and td don't end up being the most frequent words in your index.

Indexing is quite slow, but searching is quite quick, queries on 40Mb of data (that is data submitted to be indexed), takes on average less than .5 second.

Can I do advanced searches? Short Answer: No. Long Answer: Yes.

Can it be made faster? Yes, it's not particularly optimised right now, there's quite a few things that can be done to make things run quicker, smoother, nicer, pick your superlative.

Will you eventually do that? Probably not, unless someone wants to hire me to work on it, its sufficient for my needs right at the moment (although I do have a version thats tweaked for my dataset).

 Title   Type   Size   Modified   Status 
 Initial Release Edit object Software Release   2001-04-29 published
 Small fix for 'and' searches Edit object Software Release   2001-11-23 published