summaryrefslogtreecommitdiffstats
path: root/Lib/test/test_unicodedata.py
Commit message (Collapse)AuthorAgeFilesLines
* #1571184: makeunicodedata.py now generates the functions _PyUnicode_ToNumeric,Amaury Forgeot d'Arc2009-10-061-2/+3
| | | | | | _PyUnicode_IsLinebreak and _PyUnicode_IsWhitespace. It now also parses the Unihan.txt for numeric values.
* convert usage of fail* to assert*Benjamin Peterson2009-06-301-7/+7
|
* Fix typo.Walter Dörwald2009-04-261-1/+1
|
* Issue #4971: Fix titlecase for characters that are their ownMartin v. Löwis2009-04-261-1/+6
| | | | titlecase, but not their own uppercase.
* Issue #5828 (Invalid behavior of unicode.lower): Fixed bogus logic inWalter Dörwald2009-04-251-1/+14
| | | | | makeunicodedata.py and regenerated the Unicode database (This fixes u'\u1d79'.lower() == '\x00').
* don't segfault when \N escapes are used and unicodedata fails to loadBenjamin Peterson2008-11-211-2/+25
| | | | Fixes #4367
* Issue #3811: The Unicode database was updated to 5.1.Martin v. Löwis2008-09-101-2/+12
| | | | Reviewed by Fredrik Lundh and Marc-Andre Lemburg.
* Change all functions that expect one unicode character to accept a pair ofWalter Dörwald2008-06-021-0/+8
| | | | surrogates in narrow builds. Fixes issue #1706460.
* Bug #1704793: Return UTF-16 pair if unicodedata.lookup cannotMartin v. Löwis2007-07-281-0/+3
| | | | represent the result in a single character.
* Patch 1494554: Update numeric properties to Unicode 4.1.Martin v. Löwis2006-05-271-2/+2
|
* In stdlib, use hashlib instead of deprecated md5 and sha modules.Georg Brandl2006-04-301-3/+3
|
* Update Unicode database to Unicode 4.1.Martin v. Löwis2006-03-091-2/+2
|
* Fix all wrong instances of "it's".Georg Brandl2005-07-221-2/+2
|
* SF #989185: Drop unicode.iswide() and unicode.width() and addHye-Shik Chang2004-08-041-0/+11
| | | | | | | | | | | | unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w
* Special case normalization of empty strings. Fixes #924361.Martin v. Löwis2004-04-171-0/+1
| | | | Backported to 2.3.
* Combine the functionality of test_support.run_unittest()Walter Dörwald2003-05-011-5/+5
| | | | | | | | | | and test_support.run_classtests() into run_unittest() and use it wherever possible. Also don't use "from test.test_support import ...", but "from test import test_support" in a few spots. From SF patch #662807.
* Whitespace normalization.Tim Peters2003-03-071-1/+1
|
* Port test_ucn and test_unicodedata to PyUnit. Add a few tests for errorWalter Dörwald2003-02-261-111/+199
| | | | | cases increasing coverage in unicodedata.c from 87% to 95% (when the normalization tests are run). From SF patch #662807.
* Get rid of relative imports in all unittests. Now anything thatBarry Warsaw2002-07-231-1/+1
| | | | | | | | | | | imports e.g. test_support must do so using an absolute package name such as "import test.test_support" or "from test import test_support". This also updates the README in Lib/test, and gets rid of the duplicate data dirctory in Lib/test/data (replaced by Lib/email/test/data). Now Tim and Jack can have at it. :)
* This patch removes all uses of "assert" in the regression test suiteMarc-André Lemburg2001-01-171-26/+27
| | | | | | | and replaces them with a new API verify(). As a result the regression suite will also perform its tests in optimization mode. Written by Marc-Andre Lemburg. Copyright assigned to Guido van Rossum.
* Make reindent.py happy (convert everything to 4-space indents!).Fred Drake2000-10-231-5/+5
|
* Fixed encoding to use an endianness independent format.Marc-André Lemburg2000-09-271-1/+3
|
* Added test suite for the complete Unicode database. The test previouslyMarc-André Lemburg2000-09-261-7/+79
| | | | only tested a few cases.
* Marc-Andre Lemburg:Guido van Rossum2000-03-281-0/+50
The attached patch set includes a workaround to get Python with Unicode compile on BSDI 4.x (courtesy Thomas Wouters; the cause is a bug in the BSDI wchar.h header file) and Python interfaces for the MBCS codec donated by Mark Hammond. Also included are some minor corrections w/r to the docs of the new "es" and "es#" parser markers (use PyMem_Free() instead of free(); thanks to Mark Hammond for finding these). The unicodedata tests are now in a separate file (test_unicodedata.py) to avoid problems if the module cannot be found.