summaryrefslogtreecommitdiffstats
path: root/Modules/unicodedata.c
Commit message (Collapse)AuthorAgeFilesLines
* Patch 1494554: Update numeric properties to Unicode 4.1.Martin v. Löwis2006-05-271-2/+2
|
* No reason to export get_decomp_record, make staticNeal Norwitz2006-04-171-1/+1
|
* Support NFD of very long strings.Martin v. Löwis2006-04-131-3/+3
|
* Get rid of warnings about using chars as subscriptsNeal Norwitz2006-04-101-2/+2
| | | | on Alpha (and possibly other platforms) by using Py_CHARMASK().
* Adjust CJK Ideograph range to Unicode 4.1.Martin v. Löwis2006-03-111-13/+12
|
* Fix refcounting bug.Martin v. Löwis2006-03-101-0/+1
|
* Avoid forward-declaring the methods array.Martin v. Löwis2006-03-101-52/+53
| | | | Rename unicodedata.db* to unicodedata.ucd*
* Update Unicode database to Unicode 4.1.Martin v. Löwis2006-03-091-28/+213
|
* Remove gcc (4.0.x) warning about uninitialized value by explicitly settingThomas Wouters2006-03-011-2/+1
| | | | | | | the sentinel value in the main function, rather than the helper. This function could possibly do with an early-out if any of the helper calls ends up with a len of 0, but I doubt it really matters (how common are malformed hangul syllables, really?)
* Patch #1213831: Fix typo in unicodedata._getcode.Martin v. Löwis2005-09-181-1/+1
| | | | Will backport to Python 2.4.
* Correct URL to the official UnicodeData 3.2.0 resource. (ReportedHye-Shik Chang2005-06-041-1/+1
| | | | by Darek Suchojad)
* Fill docstrings for module and functions, extracted from the texHye-Shik Chang2005-04-041-13/+108
| | | | documentation. (Patch #1173245, Contributed by Jeremy Yallop)
* SF #989185: Drop unicode.iswide() and unicode.width() and addHye-Shik Chang2004-08-041-0/+21
| | | | | | | | | | | | unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w
* Fix typo.Hye-Shik Chang2004-07-151-1/+1
|
* Special case normalization of empty strings. Fixes #924361.Martin v. Löwis2004-04-171-0/+7
| | | | Backported to 2.3.
* Overallocate target buffer for normalization more early. Fixes #834676.Martin v. Löwis2003-11-061-5/+7
| | | | Backported to 2.3.
* Fix SF bug #694816, remove comparison of unsigned value < 0Neal Norwitz2003-02-281-2/+2
|
* Remove C++ comment.Martin v. Löwis2002-12-071-1/+1
|
* Add unidata_version. Bump generator version number.Martin v. Löwis2002-11-251-0/+2
|
* Verify that the code in CJK UNIFIED IDEOGRAPH- actually denotes an ideograph.Martin v. Löwis2002-11-231-3/+12
|
* Patch #626485: Support Unicode normalization.Martin v. Löwis2002-11-231-15/+279
|
* Implement names for CJK unified ideographs. Add name to KeyError output.Martin v. Löwis2002-11-231-1/+39
| | | | Verify that the lookup for an existing name succeeds.
* Fix off-by-one error.Martin v. Löwis2002-11-231-1/+1
|
* Patch #626548: Support Hangul syllable names.Martin v. Löwis2002-11-231-2/+109
|
* Update to Unicode 3.2 database.Martin v. Löwis2002-10-181-3/+3
|
* Replace DL_IMPORT with PyMODINIT_FUNC and remove "/export:init..." linkMark Hammond2002-07-231-1/+1
| | | | | command line for Windows builds. This should allow MSVC to import and build the Python MSVC6 project files without error.
* Patch #568124: Add doc string macros.Martin v. Löwis2002-06-131-1/+1
|
* _Py prefix is verboten for static entry pointsAndrew MacIntyre2002-06-131-4/+4
|
* Remove direct manipulation of the module dict.Fred Drake2002-04-031-9/+3
|
* OS/2 EMX port changes (Modules part of patch #450267):Andrew MacIntyre2002-03-031-4/+4
| | | | | | | | | | | | | | | Modules/ _hotshot.c dbmmodule.c fcntlmodule.c main.c pwdmodule.c readline.c selectmodule.c signalmodule.c termios.c timemodule.c unicodedata.c
* unicodedata_decomposition(): sprintf -> PyOS_snprintf.Tim Peters2001-11-301-2/+4
|
* Kill a warning on the SGI compiler.Fred Drake2001-07-191-1/+1
| | | | This is part of SF patch #434992.
* Be a bit more strict in setting up the export of the C API for thisFred Drake2001-03-031-6/+6
| | | | | module; do not attempt to insert the API object into the module dict if there was an error creating it.
* renamed internal functions to avoid name clashes under OpenVMSFredrik Lundh2001-02-181-17/+18
| | | | (fixes bug #132815)
* stupid typo (for some reason, this only caused problems on OpenVMS).Fredrik Lundh2001-02-181-1/+1
|
* Move uchhash functionality into unicodedata (after the recentFredrik Lundh2001-01-241-19/+226
| | | | | crop of changes, the files are small enough to do this). Also adds "name" and "lookup" functions to unicodedata.
* Unicode nits: Don't include unicodedatabase.h no more. And make sureFredrik Lundh2001-01-211-1/+0
| | | | to build *all* tables in makeunicodedata.py.
* compress unicode decomposition tables (this saves another 55k)Fredrik Lundh2001-01-211-79/+94
|
* unicode database compression, step 2:Fredrik Lundh2000-09-251-28/+18
| | | | | | | | | | - fixed attributions - moved decomposition data to a separate table, in preparation for step 3 (which won't happen before 2.0 final, promise!) - use relative paths in the generator script I have a lot more stuff in the works for 2.1, but let's leave that for another day...
* unicode database compression, step 1:Fredrik Lundh2000-09-241-11/+1
| | | | | - use unidb compression for the unicodedata module. on Windows, the new unidatabase module is 120k, down from nearly 600k.
* Changing the CNRI copyright notice according to CNRI's instructions.Guido van Rossum2000-08-031-1/+1
| | | | | This is a notice without a date, which apparently is not a claim to copyright but only advice to the reader. IANAL. :-)
* Bunch of minor ANSIfications: 'void initfunc()' -> 'void initfunc(void)',Thomas Wouters2000-07-211-1/+1
| | | | | | | | | | | | | | | | | | and a couple of functions that were missed in the previous batches. Not terribly tested, but very carefully scrutinized, three times. All these were found by the little findkrc.py that I posted to python-dev, which means there might be more lurking. Cases such as this: long func(a, b) long a; long b; /* flagword */ { and other cases where the last ; in the argument list isn't followed by a newline and an opening curly bracket. Regexps to catch all are welcome, of course ;)
* Marc-Andre Lemburg: The large unicode database table is broken inGuido van Rossum2000-03-311-7/+17
| | | | | pages of 4k entries each. This should fix compiler problems on some platforms.
* Module unicodedata -- Provides access to the Unicode 3.0 data base.Guido van Rossum2000-03-101-0/+269
Written by Marc-Andre Lemburg.