cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	upgrade to Unicode 8.0.0	Benjamin Peterson	2015-06-27	1	-791/+1641
\|
*	upgrade to unicode 7.0.0	Benjamin Peterson	2014-07-06	1	-1121/+1591
\|
*	upgrade unicode db to 6.3.0 (closes #19221)	Benjamin Peterson	2013-10-10	1	-8/+6
\|
*	Issue #18722: Remove uses of the "register" keyword in C code.	Antoine Pitrou	2013-08-13	1	-2/+2
\|
*	upgrade to UCD 6.2	Benjamin Peterson	2012-09-29	1	-3/+10
\|
*	update to Unicode 6.1	Benjamin Peterson	2012-02-21	1	-1166/+1273
\|
*	delta encoding of upper/lower/title makes a glorious return (#12736)	Benjamin Peterson	2012-01-16	1	-3443/+1336
\|
*	add str.casefold() (closes #13752)	Benjamin Peterson	2012-01-14	1	-117/+380
\|
*	use full unicode mappings for upper/lower/title case (#12736)	Benjamin Peterson	2012-01-11	1	-1513/+4174
\| \| \| \|	Also broaden the category of characters that count as lowercase/uppercase.
*	Upgrade to Unicode 6.0.0.	Martin v. Löwis	2010-10-11	1	-702/+804
\| \| \| \| \| \| \| \|	makeunicodedata.py: download all data files from unicode.org, switch to extracting Unihan data from zip file. Read linebreakprops and derivednormalizationprops even for old versions, even though they are not used in delta records. test:unicode.py: U+11000 is now assigned, use U+14000 instead.
*	#9210: remove --with-wctype-functions configure option.	Amaury Forgeot d'Arc	2010-09-12	1	-4/+0
\| \| \| \| \| \| \| \|	The internal unicode database is now always used. (after 5 years: see http://mail.python.org/pipermail/python-dev/2004-December/050193.html )
*	#5127: Even on narrow unicode builds, the C functions that access the Unicode	Amaury Forgeot d'Arc	2010-08-18	1	-111/+3
\| \| \| \| \| \| \| \| \| \|	Database (Py_UNICODE_TOLOWER, Py_UNICODE_ISDECIMAL, and others) now accept and return characters from the full Unicode range (Py_UCS4). The differences from Python code are few: - unicodedata.numeric(), unicodedata.decimal() and unicodedata.digit() now return the correct value for large code points - repr() may consider more characters as printable.
*	Merged revisions 79494,79496 via svnmerge from	Florent Xicluna	2010-03-30	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/trunk ........ r79494 \| florent.xicluna \| 2010-03-30 10:24:06 +0200 (mar, 30 mar 2010) \| 2 lines #7643: Unicode codepoints VT (0x0B) and FF (0x0C) are linebreaks according to Unicode Standard Annex #14. ........ r79496 \| florent.xicluna \| 2010-03-30 18:29:03 +0200 (mar, 30 mar 2010) \| 2 lines Highlight the change of behavior related to r79494. Now VT and FF are linebreaks. ........
*	Fixed a failure in test_bigmem.	Florent Xicluna	2010-03-19	1	-961/+1192
\| \| \| \| \| \| \| \| \| \| \|	Merged revision 79059 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r79059 \| florent.xicluna \| 2010-03-18 22:50:06 +0100 (jeu, 18 mar 2010) \| 2 lines Issue #8024: Update the Unicode database to 5.2 ........
*	Revert Unicode UCD 5.2 upgrade in 3.x. It broke repr() for unicode objects, ↵	Florent Xicluna	2010-03-19	1	-1192/+961
\| \| \| \|	and gave failures in test_bigmem. Revert 79062, 79065 and 79083.
*	Merged revisions 79059 via svnmerge from	Florent Xicluna	2010-03-18	1	-961/+1192
\| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/trunk ........ r79059 \| florent.xicluna \| 2010-03-18 22:50:06 +0100 (jeu, 18 mar 2010) \| 2 lines Issue #8024: Update the Unicode database to 5.2 ........
*	Merged revisions 75396 via svnmerge from	Amaury Forgeot d'Arc	2009-10-13	1	-109/+109
\| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/trunk ........ r75396 \| amaury.forgeotdarc \| 2009-10-13 23:29:34 +0200 (mar., 13 oct. 2009) \| 3 lines #7112: Fix compilation warning in unicodetype_db.h makeunicodedata now generates double literals ........
*	Merged revisions 75272-75273 via svnmerge from	Amaury Forgeot d'Arc	2009-10-06	1	-1217/+2914
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/trunk ........ r75272 \| amaury.forgeotdarc \| 2009-10-06 21:56:32 +0200 (mar., 06 oct. 2009) \| 5 lines #1571184: makeunicodedata.py now generates the functions _PyUnicode_ToNumeric, _PyUnicode_IsLinebreak and _PyUnicode_IsWhitespace. It now also parses the Unihan.txt for numeric values. ........ r75273 \| amaury.forgeotdarc \| 2009-10-06 22:02:09 +0200 (mar., 06 oct. 2009) \| 2 lines Add Anders Chrigstrom to Misc/ACKS for his work on unicodedata. ........
*	Merged revisions 71894 via svnmerge from	Walter Dörwald	2009-04-25	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/trunk ........ r71894 \| walter.doerwald \| 2009-04-25 16:03:16 +0200 (Sa, 25 Apr 2009) \| 4 lines Issue #5828 (Invalid behavior of unicode.lower): Fixed bogus logic in makeunicodedata.py and regenerated the Unicode database (This fixes u'\u1d79'.lower() == '\x00'). ........
*	fix isprintable() on space characters #5126	Benjamin Peterson	2009-03-26	1	-1027/+1029
\|
*	Merged revisions 66362 via svnmerge from	Martin v. Löwis	2008-09-10	1	-777/+938
\| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/trunk ........ r66362 \| martin.v.loewis \| 2008-09-10 15:38:12 +0200 (Mi, 10 Sep 2008) \| 3 lines Issue #3811: The Unicode database was updated to 5.1. Reviewed by Fredrik Lundh and Marc-Andre Lemburg. ........
*	Issue #3282: str.isprintable() should return False for undefined Unicode ↵	Georg Brandl	2008-07-04	1	-146/+143
\| \| \| \|	characters.
*	#2630: Implement PEP 3138.	Georg Brandl	2008-06-11	1	-946/+961
\| \| \| \| \| \| \|	The repr() of a string now contains printable Unicode characters unescaped. The new ascii() builtin can be used to get a repr() with only ASCII characters in it. PEP and patch were written by Atsuo Ishimoto.
*	Revert r63934 -- it was mixing two patches.	Georg Brandl	2008-06-04	1	-959/+947
\|
*	Remove meaning of -ttt, but still accept -t option on cmdline for compatibility.	Georg Brandl	2008-06-04	1	-947/+959
\|
*	Add XID_Start and XID_Continue properties to unicodectype.	Martin v. Löwis	2007-08-14	1	-886/+930
\|
*	Update Unicode database to Unicode 4.1.	Martin v. Löwis	2006-03-09	1	-650/+829
\|
*	SF #989185: Drop unicode.iswide() and unicode.width() and add	Hye-Shik Chang	2004-08-04	1	-360/+299
\| \| \| \| \| \| \| \| \| \| \| \|	unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w
*	- SF #962502: Add two more methods for unicode type; width() and	Hye-Shik Chang	2004-06-02	1	-423/+484
\| \| \| \| \| \| \|	iswide() for east asian width manipulation. (Inspired by David Goodger, Reviewed by Martin v. Loewis) - Move _PyUnicode_TypeRecord.flags to the end of the struct so that no padding is added for UCS-4 builds. (Suggested by Martin v. Loewis)
*	Add unidata_version. Bump generator version number.	Martin v. Löwis	2002-11-25	1	-1/+1
\|
*	Regenerate from Unicode 3.2.0 to include all First/Last ranges.	Martin v. Löwis	2002-11-24	1	-108/+77
\|
*	Update to Unicode 3.2 database.	Martin v. Löwis	2002-10-18	1	-415/+959
\|
*	forgot to check in the new makeunicodedata.py script	Fredrik Lundh	2001-01-21	1	-1/+1
\|
*	Added 38,642 missing characters to the Unicode database (first-last	Fredrik Lundh	2000-11-03	1	-87/+86
\| \| \| \| \| \| \|	ranges) -- but thanks to the 2.0 compression scheme, this doesn't add a single byte to the resulting binaries (!) Closes bug #117524
*	- don't set the titlecase flag for uppercase letters (sorry, tim)	Fredrik Lundh	2000-09-25	1	-40/+40
\|
*	unicode database compression, step 3:	Fredrik Lundh	2000-09-25	1	-0/+578
	- use unidb compression for the unicodectype module. smaller, faster, and slightly more portable... (note: this commit doesn't include the unicodectype.c file itself; I'm still waiting for the reviewers...)