CHANGES

Changes

Rewrote process algorithm to just work directly on Unicode. There was no need to work in UTF-8, as the old program did. Need to make sure all text is stored as Unicode, so people need to start using ...:utf8:utext etc in their form fields.
Removed configuration file stuff. Replaced definition of symbols and CJK with calls to unicodedata.category().
Made structure and interface conform more closely to another splitter: HTMLSplitter.
Removed Chinese comments and added English comments.
Described how algorithm works.
Made it refreshable by ignoring ValueError from registerFactory. (There is probably a better way of doing this.)
Added to installation instructions how to enable GB2312 and Unicode support in Python. Hopefully we can Chinese support with the Python distributed with Zope in the future.
Added a simple set of unit tests.