summaryrefslogtreecommitdiffstats
path: root/Tools/unicode
Commit message (Collapse)AuthorAgeFilesLines
* Issue #17202: Add .bat to .hgeol to force them to CRLF.Zachary Ware2015-04-131-7/+7
| | | | | Using LF can a script to fail if it tries to use a label that is split across 512 byte blocks. Who knows why.
* Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-181-1/+1
|
* #18176: Change generic UCD PropList link to version specific link.R David Murray2014-10-101-1/+1
|
* #18176: fix another reference and add it to the makeunicodedata comment.R David Murray2014-10-091-0/+1
|
* #18176: updated stdtypes UCD link, added reminder to makeunicodedata.R David Murray2014-10-091-0/+4
| | | | Patch by Alexander Belopolsky.
* Issue #19936: Added executable bits or shebang lines to Python scripts whichSerhiy Storchaka2014-01-161-0/+0
|\ | | | | | | | | | | | | requires them. Disable executable bits and shebang lines in test and benchmark files in order to prevent using a random system python, and in source files of modules which don't provide command line interface. Fixed shebang lines in the unittestgui and checkpip scripts.
| * Issue #19936: Added executable bits or shebang lines to Python scripts whichSerhiy Storchaka2014-01-161-0/+0
| | | | | | | | | | | | | | requires them. Disable executable bits and shebang lines in test and benchmark files in order to prevent using a random system python, and in source files of modules which don't provide command line interface. Fixed shebang line to use python3 executable in the unittestgui script.
* | #1097797: add the original mapping fileAndrew Kuchling2013-11-111-0/+258
| |
* | Fix some PEP8-formatting problems in the generated codeAndrew Kuchling2013-11-111-9/+9
| |
* | upgrade unicode db to 6.3.0 (closes #19221)Benjamin Peterson2013-10-101-2/+2
| |
* | #18803: merge with 3.3.Ezio Melotti2013-08-251-1/+1
|\ \ | |/
| * #18803: fix more typos. Patch by Févry Thibault.Ezio Melotti2013-08-251-1/+1
| |
* | Issue #18722: Remove uses of the "register" keyword in C code.Antoine Pitrou2013-08-131-2/+2
| |
* | Issue #15239: Make mkstringprep.py work again on Python 3.Serhiy Storchaka2013-06-091-17/+23
|\ \ | |/
| * Issue #15239: Make mkstringprep.py work again on Python 3.Serhiy Storchaka2013-06-091-17/+23
| |
* | Issue #15378: Fix Tools/unicode/comparecodecs.py. Patch by Serhiy Storchaka.Antoine Pitrou2012-10-171-1/+1
|\ \ | |/
| * Issue #15378: Fix Tools/unicode/comparecodecs.py. Patch by Serhiy Storchaka.Antoine Pitrou2012-10-171-1/+1
| |\
| | * Issue #15378: Fix Tools/unicode/comparecodecs.py. Patch by Serhiy Storchaka.Antoine Pitrou2012-10-171-1/+1
| | |
* | | upgrade to UCD 6.2Benjamin Peterson2012-09-291-1/+1
|/ /
* | Some cleanup in the Tools directory.Florent Xicluna2012-07-071-1/+0
| |
* | Issue #14874: Restore charmap decoding speed to pre-PEP 393 levels.Antoine Pitrou2012-06-161-3/+6
| | | | | | | | Patch by Serhiy Storchaka.
* | update to Unicode 6.1Benjamin Peterson2012-02-211-4/+5
| |
* | delta encoding of upper/lower/title makes a glorious return (#12736)Benjamin Peterson2012-01-161-0/+7
| |
* | add str.casefold() (closes #13752)Benjamin Peterson2012-01-141-15/+35
| |
* | use full unicode mappings for upper/lower/title case (#12736)Benjamin Peterson2012-01-111-32/+69
| | | | | | | | Also broaden the category of characters that count as lowercase/uppercase.
* | #12753: Add support for Unicode name aliases and named sequences.Ezio Melotti2011-10-211-2/+100
| |
* | #13054: fix usage of sys.maxunicode after PEP-393.Ezio Melotti2011-10-041-1/+1
| |
* | Fix ResourceWarnings in makeunicodedata.py.Ezio Melotti2011-09-301-81/+88
|/
* #11565: Merge with 3.1.Ezio Melotti2011-03-161-1/+1
|\
| * #11565: Fix several typos. Patch by Piotr Kasprzyk.Ezio Melotti2011-03-161-1/+1
| |
* | Add updated .hgeol file and fix newlines in the 3.2 branch.Georg Brandl2011-03-051-7/+7
| |
* | Issue #10552: Partially fixed a sort error in Tools/unicode/gencodec.pyAlexander Belopolsky2010-11-301-7/+10
| |
* | Issue #10459: Update CJK character names to Unicode 6.0.Martin v. Löwis2010-11-221-2/+19
| |
* | Upgrade to Unicode 6.0.0.Martin v. Löwis2010-10-111-58/+77
| | | | | | | | | | | | | | | | makeunicodedata.py: download all data files from unicode.org, switch to extracting Unihan data from zip file. Read linebreakprops and derivednormalizationprops even for old versions, even though they are not used in delta records. test:unicode.py: U+11000 is now assigned, use U+14000 instead.
* | #9210: remove --with-wctype-functions configure option.Amaury Forgeot d'Arc2010-09-121-4/+0
| | | | | | | | | | | | | | | | The internal unicode database is now always used. (after 5 years: see http://mail.python.org/pipermail/python-dev/2004-December/050193.html )
* | #5127: Even on narrow unicode builds, the C functions that access the UnicodeAmaury Forgeot d'Arc2010-08-181-37/+4
| | | | | | | | | | | | | | | | | | | | Database (Py_UNICODE_TOLOWER, Py_UNICODE_ISDECIMAL, and others) now accept and return characters from the full Unicode range (Py_UCS4). The differences from Python code are few: - unicodedata.numeric(), unicodedata.decimal() and unicodedata.digit() now return the correct value for large code points - repr() may consider more characters as printable.
* | Merged revisions 79494,79496 via svnmerge fromFlorent Xicluna2010-03-301-5/+24
| | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r79494 | florent.xicluna | 2010-03-30 10:24:06 +0200 (mar, 30 mar 2010) | 2 lines #7643: Unicode codepoints VT (0x0B) and FF (0x0C) are linebreaks according to Unicode Standard Annex #14. ........ r79496 | florent.xicluna | 2010-03-30 18:29:03 +0200 (mar, 30 mar 2010) | 2 lines Highlight the change of behavior related to r79494. Now VT and FF are linebreaks. ........
* | Merged revisions 78982,78986 via svnmerge fromFlorent Xicluna2010-03-192-6/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r78982 | florent.xicluna | 2010-03-15 15:00:58 +0100 (lun, 15 mar 2010) | 2 lines Remove py3k deprecation warnings from these Unicode tools. ........ r78986 | florent.xicluna | 2010-03-15 19:08:58 +0100 (lun, 15 mar 2010) | 3 lines Issue #7783 and #7787: open_urlresource invalidates the outdated files from the local cache. Use this feature to fix test_normalization. ........
* | Fixed a failure in test_bigmem.Florent Xicluna2010-03-191-1/+1
| | | | | | | | | | | | | | | | | | | | | | Merged revision 79059 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r79059 | florent.xicluna | 2010-03-18 22:50:06 +0100 (jeu, 18 mar 2010) | 2 lines Issue #8024: Update the Unicode database to 5.2 ........
* | Revert Unicode UCD 5.2 upgrade in 3.x. It broke repr() for unicode objects, ↵Florent Xicluna2010-03-191-1/+1
| | | | | | | | and gave failures in test_bigmem. Revert 79062, 79065 and 79083.
* | Missing update from previous changeset r79062.Florent Xicluna2010-03-181-1/+1
| |
* | convert shebang lines: python -> python3Benjamin Peterson2010-03-111-1/+1
| |
* | Merged revisions 78806 via svnmerge fromBenjamin Peterson2010-03-081-61/+61
| | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r78806 | benjamin.peterson | 2010-03-08 16:15:11 -0600 (Mon, 08 Mar 2010) | 1 line set svn:eol-style on various files ........
* | Merged revisions 75396 via svnmerge fromAmaury Forgeot d'Arc2009-10-131-0/+5
| | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r75396 | amaury.forgeotdarc | 2009-10-13 23:29:34 +0200 (mar., 13 oct. 2009) | 3 lines #7112: Fix compilation warning in unicodetype_db.h makeunicodedata now generates double literals ........
* | Merged revisions 75272-75273 via svnmerge fromAmaury Forgeot d'Arc2009-10-061-7/+121
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r75272 | amaury.forgeotdarc | 2009-10-06 21:56:32 +0200 (mar., 06 oct. 2009) | 5 lines #1571184: makeunicodedata.py now generates the functions _PyUnicode_ToNumeric, _PyUnicode_IsLinebreak and _PyUnicode_IsWhitespace. It now also parses the Unihan.txt for numeric values. ........ r75273 | amaury.forgeotdarc | 2009-10-06 22:02:09 +0200 (mar., 06 oct. 2009) | 2 lines Add Anders Chrigstrom to Misc/ACKS for his work on unicodedata. ........
* | Oops, really pass a bytes string to the ctypes function.Amaury Forgeot d'Arc2009-07-131-1/+1
| |
* | Merged revisions 74000-74001 via svnmerge fromAmaury Forgeot d'Arc2009-07-133-2/+70
|/ | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r74000 | amaury.forgeotdarc | 2009-07-13 22:01:11 +0200 (lun., 13 juil. 2009) | 4 lines #1616979: Add the cp720 (Arabic DOS) encoding. Since there is no official mapping file from unicode.org, the codec file is generated on Windows with the new genwincodec.py script. ........ r74001 | amaury.forgeotdarc | 2009-07-13 22:03:21 +0200 (lun., 13 juil. 2009) | 2 lines NEWS entry for r74000. ........
* Merged revisions 72054 via svnmerge fromAntoine Pitrou2009-04-271-5/+32
| | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r72054 | antoine.pitrou | 2009-04-27 23:53:26 +0200 (lun., 27 avril 2009) | 5 lines Issue #1734234: Massively speedup `unicodedata.normalize()` when the string is already in normalized form, by performing a quick check beforehand. Original patch by Rauli Ruohonen. ........
* Merged revisions 71894 via svnmerge fromWalter Dörwald2009-04-251-22/+21
| | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r71894 | walter.doerwald | 2009-04-25 16:03:16 +0200 (Sa, 25 Apr 2009) | 4 lines Issue #5828 (Invalid behavior of unicode.lower): Fixed bogus logic in makeunicodedata.py and regenerated the Unicode database (This fixes u'\u1d79'.lower() == '\x00'). ........
* fix isprintable() on space characters #5126Benjamin Peterson2009-03-261-1/+1
|