Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | #7643: Unicode codepoints VT (0x0B) and FF (0x0C) are linebreaks according ↵ | Florent Xicluna | 2010-03-30 | 1 | -1/+12 |
| | | | | to Unicode Standard Annex #14. | ||||
* | Issue #8024: Update the Unicode database to 5.2 | Florent Xicluna | 2010-03-18 | 1 | -2/+2 |
| | |||||
* | Issue #1054943: Fix unicodedata.normalize('NFC', text) for the Public Review | Victor Stinner | 2010-03-04 | 1 | -0/+5 |
| | | | | | | Issue #29. PR #29 was released in february 2004! | ||||
* | use assert[Not]In where appropriate | Ezio Melotti | 2010-01-23 | 1 | -1/+1 |
| | |||||
* | #1571184: makeunicodedata.py now generates the functions _PyUnicode_ToNumeric, | Amaury Forgeot d'Arc | 2009-10-06 | 1 | -2/+3 |
| | | | | | | _PyUnicode_IsLinebreak and _PyUnicode_IsWhitespace. It now also parses the Unihan.txt for numeric values. | ||||
* | convert usage of fail* to assert* | Benjamin Peterson | 2009-06-30 | 1 | -7/+7 |
| | |||||
* | Fix typo. | Walter Dörwald | 2009-04-26 | 1 | -1/+1 |
| | |||||
* | Issue #4971: Fix titlecase for characters that are their own | Martin v. Löwis | 2009-04-26 | 1 | -1/+6 |
| | | | | titlecase, but not their own uppercase. | ||||
* | Issue #5828 (Invalid behavior of unicode.lower): Fixed bogus logic in | Walter Dörwald | 2009-04-25 | 1 | -1/+14 |
| | | | | | makeunicodedata.py and regenerated the Unicode database (This fixes u'\u1d79'.lower() == '\x00'). | ||||
* | don't segfault when \N escapes are used and unicodedata fails to load | Benjamin Peterson | 2008-11-21 | 1 | -2/+25 |
| | | | | Fixes #4367 | ||||
* | Issue #3811: The Unicode database was updated to 5.1. | Martin v. Löwis | 2008-09-10 | 1 | -2/+12 |
| | | | | Reviewed by Fredrik Lundh and Marc-Andre Lemburg. | ||||
* | Change all functions that expect one unicode character to accept a pair of | Walter Dörwald | 2008-06-02 | 1 | -0/+8 |
| | | | | surrogates in narrow builds. Fixes issue #1706460. | ||||
* | Bug #1704793: Return UTF-16 pair if unicodedata.lookup cannot | Martin v. Löwis | 2007-07-28 | 1 | -0/+3 |
| | | | | represent the result in a single character. | ||||
* | Patch 1494554: Update numeric properties to Unicode 4.1. | Martin v. Löwis | 2006-05-27 | 1 | -2/+2 |
| | |||||
* | In stdlib, use hashlib instead of deprecated md5 and sha modules. | Georg Brandl | 2006-04-30 | 1 | -3/+3 |
| | |||||
* | Update Unicode database to Unicode 4.1. | Martin v. Löwis | 2006-03-09 | 1 | -2/+2 |
| | |||||
* | Fix all wrong instances of "it's". | Georg Brandl | 2005-07-22 | 1 | -2/+2 |
| | |||||
* | SF #989185: Drop unicode.iswide() and unicode.width() and add | Hye-Shik Chang | 2004-08-04 | 1 | -0/+11 |
| | | | | | | | | | | | | unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w | ||||
* | Special case normalization of empty strings. Fixes #924361. | Martin v. Löwis | 2004-04-17 | 1 | -0/+1 |
| | | | | Backported to 2.3. | ||||
* | Combine the functionality of test_support.run_unittest() | Walter Dörwald | 2003-05-01 | 1 | -5/+5 |
| | | | | | | | | | | and test_support.run_classtests() into run_unittest() and use it wherever possible. Also don't use "from test.test_support import ...", but "from test import test_support" in a few spots. From SF patch #662807. | ||||
* | Whitespace normalization. | Tim Peters | 2003-03-07 | 1 | -1/+1 |
| | |||||
* | Port test_ucn and test_unicodedata to PyUnit. Add a few tests for error | Walter Dörwald | 2003-02-26 | 1 | -111/+199 |
| | | | | | cases increasing coverage in unicodedata.c from 87% to 95% (when the normalization tests are run). From SF patch #662807. | ||||
* | Get rid of relative imports in all unittests. Now anything that | Barry Warsaw | 2002-07-23 | 1 | -1/+1 |
| | | | | | | | | | | | imports e.g. test_support must do so using an absolute package name such as "import test.test_support" or "from test import test_support". This also updates the README in Lib/test, and gets rid of the duplicate data dirctory in Lib/test/data (replaced by Lib/email/test/data). Now Tim and Jack can have at it. :) | ||||
* | This patch removes all uses of "assert" in the regression test suite | Marc-André Lemburg | 2001-01-17 | 1 | -26/+27 |
| | | | | | | | and replaces them with a new API verify(). As a result the regression suite will also perform its tests in optimization mode. Written by Marc-Andre Lemburg. Copyright assigned to Guido van Rossum. | ||||
* | Make reindent.py happy (convert everything to 4-space indents!). | Fred Drake | 2000-10-23 | 1 | -5/+5 |
| | |||||
* | Fixed encoding to use an endianness independent format. | Marc-André Lemburg | 2000-09-27 | 1 | -1/+3 |
| | |||||
* | Added test suite for the complete Unicode database. The test previously | Marc-André Lemburg | 2000-09-26 | 1 | -7/+79 |
| | | | | only tested a few cases. | ||||
* | Marc-Andre Lemburg: | Guido van Rossum | 2000-03-28 | 1 | -0/+50 |
The attached patch set includes a workaround to get Python with Unicode compile on BSDI 4.x (courtesy Thomas Wouters; the cause is a bug in the BSDI wchar.h header file) and Python interfaces for the MBCS codec donated by Mark Hammond. Also included are some minor corrections w/r to the docs of the new "es" and "es#" parser markers (use PyMem_Free() instead of free(); thanks to Mark Hammond for finding these). The unicodedata tests are now in a separate file (test_unicodedata.py) to avoid problems if the module cannot be found. |