You are not logged in Log in Join
You are here: Home » Members » Stefan's Home » Unicode aware lexicon type for ZCTextIndex » dc_view

Log in
Name

Password

 
 


Dublin Core Elements

The Dublin Core metadata element set is a standard for cross-domain information resource description.
Element Description Value
Identifier resource ID http://old.zope.org/Members/shh/UnicodeLexicon
Title resource name Unicode aware lexicon type for ZCTextIndex
Description resource summary <h2>Why?</h2> <p> The stock lexicon deals well with 8 bit strings if you get the 'locale' setting in zope.conf right; it does not work with Unicode and UTF-8. UnicodeLexicon fills this gap. </p> <h2>What?</h2> <p> This product adds a ZCTextIndex Unicode Lexicon type to Zope. The lexicon comes with word splitters, stop word removers, and a case normalizer. </p> <p> If you have GenericSetup installed, you can use the provided extension profile to create Unicode lexicons in your portal_catalog and update the 'Title', 'Description', and 'SearchableText' ZCTextIndexes. </p> <h2>Limitations</h2> <p> The lexicon assumes either Unicode or UTF-8. If your site employs e.g. UTF-16 (aka UCS-2) or UTF-32 (aka UCS-4) you will have to change the 'enc' constant in UnicodeSplitter.py accordingly. </p> <p> The extension profile installs lexicons without stop word removers. This is because only English language stop words are supported, and it is safe to assume you are using Unicode precisely because you need to handle non-English text. </p>
Creator resource creator shh
Date default date 2006-08-14 04:36:41
Format resource format text/html
Type resource type Software Package
Subject resource keywords Search/Catalog
Contributors resource collaborators
Language resource language
Publisher resource publisher No publisher
Rights resource copyright


Additional Zope Elements

Element Description Value
CreationDate date resource created 2006-08-14 04:17:19
ModificationDate date resource last modified 2006-08-14 04:36:41
EffectiveDate date resource becomes effective 2006-08-14 04:36:41
ExpirationDate date resource expires None

Backlinks: via Google / Technorati