You are not logged in Log in Join
You are here: Home » Members » Bjorn Stabell

Log in
Name

Password

 

Bjorn Stabell

This tool lets you specify redirections inside of your CMF portal, similar to Apache's Rewrite rules. The match and replacement patterns are all regular expressions, and you can select which HTTP status code to return.

Up one level

 Title   Type   Size   Last Modified   Description 
temp Folder   2004-02-19  
ZCTextIndex splitter that works with Chinese, Japanese, and Korean text Software Package   2004-01-18

CJKSplitter - Chinese, Japanese, Korean word splitter for ZCTextIndex

CJKSplitter is a ZCTextIndex splitter for CJK (Chinese-Japenese-Korea) text stored as Unicode. It uses a simple, but workable, "hack" instead of trying to do real word splitting from dictionaries. Compared to a dictionary based word splitter, this results in a bigger index and more matches than necessary, but it is a cheap price to pay for the reduced complexity.

Changes Summary

  • Version 0.2 [email protected] improves on the previous in a number of ways: uses Unicode internally (not UTF-8), replaces configuration file with lookups using unicodedata module for looking up CJK characters and symbols, adds unit tests, and detailed English instructions for installation etc.
  • Version 0.1 [email protected] original version.

Known Problems

  • Text must (well, should) be stored as Unicode.
  • Cannot search single characters.
  • Could do a better job at identifying CJK characters.
  • May match more than is strictly necessary due to algorithm used. (See source code for details.)

Please join the zopeasia project on SourceForge to participate in the development

CMF URL Redirection Tool Software Package   2001-09-07

Overview

After installing this tool you can specify redirections inside of your CMF portal, similar to the way as Apache Rewrite rules works.

The match and replacement patterns are all regular expressions, and you can select which HTTP status code to return.

TODO

I couldn't for the life of me figure out how to make it work with virtual hosting. The URL rewriting engine takes a URL relative to the parent of the tool (usually the portal root), but I don't know how to get that relative URL in a virtual hosting environment; I tried quite a few ways.

Help welcome.

CJKSplitter v0.2 News Item 1 K 2003-03-10  
Proposals Folder   2004-02-19