summaryrefslogtreecommitdiffstats
path: root/Modules/unicodedata.c
Commit message (Collapse)AuthorAgeFilesLines
* Issue #3811: The Unicode database was updated to 5.1.Martin v. Löwis2008-09-101-5/+8
| | | | Reviewed by Fredrik Lundh and Marc-Andre Lemburg.
* This reverts r63675 based on the discussion in this thread:Gregory P. Smith2008-06-091-6/+6
| | | | | | | http://mail.python.org/pipermail/python-dev/2008-June/079988.html Python 2.6 should stick with PyString_* in its codebase. The PyBytes_* names in the spirit of 3.0 are available via a #define only. See the email thread.
* Change all functions that expect one unicode character to accept a pair ofWalter Dörwald2008-06-021-73/+74
| | | | surrogates in narrow builds. Fixes issue #1706460.
* Renamed PyString to PyBytesChristian Heimes2008-05-261-6/+6
|
* #1629: Renamed Py_Size, Py_Type and Py_Refcnt to Py_SIZE, Py_TYPE and ↵Christian Heimes2007-12-191-1/+1
| | | | Py_REFCNT. Macros for b/w compatibility are available.
* Bug #1704793: Return UTF-16 pair if unicodedata.lookup cannotMartin v. Löwis2007-07-281-16/+11
| | | | represent the result in a single character.
* PEP 3123: Provide forward compatibility with Python 3.0, while keepingMartin v. Löwis2007-07-211-3/+2
| | | | | backwards compatibility. Add Py_Refcnt, Py_Type, Py_Size, and PyVarObject_HEAD_INIT.
* Replace C++ comment with C comment (fixes SF bug #1593525).Walter Dörwald2006-11-091-1/+1
|
* I'm not sure why this code allocates this string for the error message.Neal Norwitz2006-08-121-2/+11
| | | | | | | I think it would be better to always use snprintf and have the format limit the size of the name appropriately (like %.200s). Klocwork #340
* Update dangling references to the 3.2 database toMartin v. Löwis2006-08-101-5/+5
| | | | mention that this is UCD 4.1 now.
* No functional change. Add comment and assert to describe why there cannot ↵Neal Norwitz2006-07-271-2/+9
| | | | be overflow which was reported by Klocwork. Discussed on python-dev
* Patch 1494554: Update numeric properties to Unicode 4.1.Martin v. Löwis2006-05-271-2/+2
|
* No reason to export get_decomp_record, make staticNeal Norwitz2006-04-171-1/+1
|
* Support NFD of very long strings.Martin v. Löwis2006-04-131-3/+3
|
* Get rid of warnings about using chars as subscriptsNeal Norwitz2006-04-101-2/+2
| | | | on Alpha (and possibly other platforms) by using Py_CHARMASK().
* Adjust CJK Ideograph range to Unicode 4.1.Martin v. Löwis2006-03-111-13/+12
|
* Fix refcounting bug.Martin v. Löwis2006-03-101-0/+1
|
* Avoid forward-declaring the methods array.Martin v. Löwis2006-03-101-52/+53
| | | | Rename unicodedata.db* to unicodedata.ucd*
* Update Unicode database to Unicode 4.1.Martin v. Löwis2006-03-091-28/+213
|
* Remove gcc (4.0.x) warning about uninitialized value by explicitly settingThomas Wouters2006-03-011-2/+1
| | | | | | | the sentinel value in the main function, rather than the helper. This function could possibly do with an early-out if any of the helper calls ends up with a len of 0, but I doubt it really matters (how common are malformed hangul syllables, really?)
* Patch #1213831: Fix typo in unicodedata._getcode.Martin v. Löwis2005-09-181-1/+1
| | | | Will backport to Python 2.4.
* Correct URL to the official UnicodeData 3.2.0 resource. (ReportedHye-Shik Chang2005-06-041-1/+1
| | | | by Darek Suchojad)
* Fill docstrings for module and functions, extracted from the texHye-Shik Chang2005-04-041-13/+108
| | | | documentation. (Patch #1173245, Contributed by Jeremy Yallop)
* SF #989185: Drop unicode.iswide() and unicode.width() and addHye-Shik Chang2004-08-041-0/+21
| | | | | | | | | | | | unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w
* Fix typo.Hye-Shik Chang2004-07-151-1/+1
|
* Special case normalization of empty strings. Fixes #924361.Martin v. Löwis2004-04-171-0/+7
| | | | Backported to 2.3.
* Overallocate target buffer for normalization more early. Fixes #834676.Martin v. Löwis2003-11-061-5/+7
| | | | Backported to 2.3.
* Fix SF bug #694816, remove comparison of unsigned value < 0Neal Norwitz2003-02-281-2/+2
|
* Remove C++ comment.Martin v. Löwis2002-12-071-1/+1
|
* Add unidata_version. Bump generator version number.Martin v. Löwis2002-11-251-0/+2
|
* Verify that the code in CJK UNIFIED IDEOGRAPH- actually denotes an ideograph.Martin v. Löwis2002-11-231-3/+12
|
* Patch #626485: Support Unicode normalization.Martin v. Löwis2002-11-231-15/+279
|
* Implement names for CJK unified ideographs. Add name to KeyError output.Martin v. Löwis2002-11-231-1/+39
| | | | Verify that the lookup for an existing name succeeds.
* Fix off-by-one error.Martin v. Löwis2002-11-231-1/+1
|
* Patch #626548: Support Hangul syllable names.Martin v. Löwis2002-11-231-2/+109
|
* Update to Unicode 3.2 database.Martin v. Löwis2002-10-181-3/+3
|
* Replace DL_IMPORT with PyMODINIT_FUNC and remove "/export:init..." linkMark Hammond2002-07-231-1/+1
| | | | | command line for Windows builds. This should allow MSVC to import and build the Python MSVC6 project files without error.
* Patch #568124: Add doc string macros.Martin v. Löwis2002-06-131-1/+1
|
* _Py prefix is verboten for static entry pointsAndrew MacIntyre2002-06-131-4/+4
|
* Remove direct manipulation of the module dict.Fred Drake2002-04-031-9/+3
|
* OS/2 EMX port changes (Modules part of patch #450267):Andrew MacIntyre2002-03-031-4/+4
| | | | | | | | | | | | | | | Modules/ _hotshot.c dbmmodule.c fcntlmodule.c main.c pwdmodule.c readline.c selectmodule.c signalmodule.c termios.c timemodule.c unicodedata.c
* unicodedata_decomposition(): sprintf -> PyOS_snprintf.Tim Peters2001-11-301-2/+4
|
* Kill a warning on the SGI compiler.Fred Drake2001-07-191-1/+1
| | | | This is part of SF patch #434992.
* Be a bit more strict in setting up the export of the C API for thisFred Drake2001-03-031-6/+6
| | | | | module; do not attempt to insert the API object into the module dict if there was an error creating it.
* renamed internal functions to avoid name clashes under OpenVMSFredrik Lundh2001-02-181-17/+18
| | | | (fixes bug #132815)
* stupid typo (for some reason, this only caused problems on OpenVMS).Fredrik Lundh2001-02-181-1/+1
|
* Move uchhash functionality into unicodedata (after the recentFredrik Lundh2001-01-241-19/+226
| | | | | crop of changes, the files are small enough to do this). Also adds "name" and "lookup" functions to unicodedata.
* Unicode nits: Don't include unicodedatabase.h no more. And make sureFredrik Lundh2001-01-211-1/+0
| | | | to build *all* tables in makeunicodedata.py.
* compress unicode decomposition tables (this saves another 55k)Fredrik Lundh2001-01-211-79/+94
|
* unicode database compression, step 2:Fredrik Lundh2000-09-251-28/+18
| | | | | | | | | | - fixed attributions - moved decomposition data to a separate table, in preparation for step 3 (which won't happen before 2.0 final, promise!) - use relative paths in the generator script I have a lot more stuff in the works for 2.1, but let's leave that for another day...