summaryrefslogtreecommitdiffstats
path: root/Tools/unicode/makeunicodedata.py
Commit message (Collapse)AuthorAgeFilesLines
* Unicode nits: Don't include unicodedatabase.h no more. And make sureFredrik Lundh2001-01-211-2/+2
| | | | to build *all* tables in makeunicodedata.py.
* compress unicode decomposition tables (this saves another 55k)Fredrik Lundh2001-01-211-41/+76
|
* forgot to check in the new makeunicodedata.py scriptFredrik Lundh2001-01-211-17/+271
|
* Added 38,642 missing characters to the Unicode database (first-lastFredrik Lundh2000-11-031-11/+39
| | | | | | | ranges) -- but thanks to the 2.0 compression scheme, this doesn't add a single byte to the resulting binaries (!) Closes bug #117524
* Remove bogus stdout redirection and use of sys.__stdout__; useFred Drake2000-10-261-46/+42
| | | | augmented print statement instead.
* - don't set the titlecase flag for uppercase letters (sorry, tim)Fredrik Lundh2000-09-251-2/+2
|
* unicode database compression, step 3:Fredrik Lundh2000-09-251-4/+19
| | | | - added decimal digit and digit properties to the unidb tables
* unicode database compression, step 3:Fredrik Lundh2000-09-251-9/+97
| | | | | | | - use unidb compression for the unicodectype module. smaller, faster, and slightly more portable... - also mention the unicode directory in Tools/README
* unicode database compression, step 2:Fredrik Lundh2000-09-251-15/+47
| | | | | | | | | | - fixed attributions - moved decomposition data to a separate table, in preparation for step 3 (which won't happen before 2.0 final, promise!) - use relative paths in the generator script I have a lot more stuff in the works for 2.1, but let's leave that for another day...
* Fiddled w/ /F's cool new splitbins function: documented it, generalized itTim Peters2000-09-251-26/+54
| | | | | | | | | | a bit, sped it a lot primarily by removing the unused assumption that None was a legit bin entry (the function doesn't really need to assume that there's anything special about 0), added an optional "trace" argument, and in __debug__ mode added exhaustive verification that the decomposition is both correct and doesn't overstep any array bounds (which wasn't obvious to me from staring at the generated C code -- now I feel safe!). Did not commit a new unicodedata_db.h, as the one produced by this version is identical to the one already checked in.
* unicode database compression, step 1:Fredrik Lundh2000-09-241-0/+202
- use unidb compression for the unicodedata module. on Windows, the new unidatabase module is 120k, down from nearly 600k.