Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | bpo-47243: Duplicate entry in 'Objects/unicodetype_db.h' (GH-32376) | LiarPrincess | 2022-09-28 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | Fix for duplicate 1st entry in 'Objects/unicodetype_db.h': ```c /* a list of unique character type descriptors */ const _PyUnicode_TypeRecord _PyUnicode_TypeRecords[] = { {0, 0, 0, 0, 0, 0}, {0, 0, 0, 0, 0, 0}, <--- HERE {0, 0, 0, 0, 0, 32}, {0, 0, 0, 0, 0, 48}, … ``` https://bugs.python.org/issue47243 Automerge-Triggered-By: GH:isidentical | ||||
* | closes gh-96734: Update to Unicode 15.0.0. (GH-96809) | Benjamin Peterson | 2022-09-13 | 1 | -2/+3 |
| | |||||
* | GH-96172 fix unicodedata.east_asian_width being wrong on unassigned code ↵ | Carl Friedrich Bolz-Tereick | 2022-08-26 | 1 | -7/+25 |
| | | | | points (#96207) | ||||
* | gh-96019: Fix caching of decompositions in makeunicodedata (GH-96020) | Carl Friedrich Bolz-Tereick | 2022-08-19 | 1 | -3/+7 |
| | |||||
* | closes bpo-45190: Update Unicode data to version 14.0.0. (GH-28336) | Benjamin Peterson | 2021-09-14 | 1 | -4/+4 |
| | |||||
* | Update some www.unicode.org URLs to use HTTPS. (GH-18912) | Benjamin Peterson | 2020-03-11 | 1 | -2/+2 |
| | |||||
* | closes bpo-39926: Update Unicode to 13.0.0. (GH-18910) | Benjamin Peterson | 2020-03-11 | 1 | -4/+5 |
| | |||||
* | bpo-37760: Convert from length-18 lists to a dataclass, in makeunicodedata. ↵ | Greg Price | 2019-09-12 | 1 | -62/+88 |
| | | | | | | | | | (GH-15265) Now the fields have names! Much easier to keep straight as a reader than the elements of an 18-tuple. Runs about 10-15% slower: from 10.8s to 12.3s, on my laptop. Fortunately that's perfectly fine for this maintenance script. | ||||
* | bpo-37760: Avoid cluttering work tree with downloaded Unicode files. (GH-15128) | Greg Price | 2019-08-15 | 1 | -2/+5 |
| | |||||
* | bpo-37760: Factor out standard range-expanding logic in makeunicodedata. ↵ | Greg Price | 2019-08-14 | 1 | -33/+35 |
| | | | | | | | | (GH-15248) Much like the lower-level logic in commit ef2af1ad4, we had 4 copies of this logic, written in a couple of different ways. They're all implementing the same standard, so write it just once. | ||||
* | bpo-37760: Constant-fold some old options in makeunicodedata. (GH-15129) | Greg Price | 2019-08-13 | 1 | -24/+20 |
| | | | | | | | | The `expand` option was introduced in 2000 in commit fad27aee1. It appears to have been always set since it was committed, and what it does is tell the code to do something essential. So, just always do that, and cut the option. Also cut the `linebreakprops` option, which isn't consulted anymore. | ||||
* | bpo-37760: Factor out the basic UCD parsing logic of makeunicodedata. (GH-15130) | Greg Price | 2019-08-13 | 1 | -133/+109 |
| | | | | | | There were 10 copies of this, and almost as many distinct versions of exactly how it was written. They're all implementing the same standard. Pull them out to the top, so the more interesting logic that remains becomes easier to read. | ||||
* | Clean up and reduce visual clutter in the makeunicode.py script. (GH-7558) | Stefan Behnel | 2019-06-01 | 1 | -263/+275 |
| | |||||
* | closes bpo-36861: Update Unicode database to 12.1.0. (GH-13214) | Benjamin Peterson | 2019-05-09 | 1 | -1/+1 |
| | | | Adds ㋿. | ||||
* | bpo-36642: make unicodedata const (GH-12855) | Inada Naoki | 2019-04-16 | 1 | -1/+1 |
| | |||||
* | closes bpo-33376: Update to Unicode 12.0.0. (GH-12256) | Benjamin Peterson | 2019-03-10 | 1 | -1/+1 |
| | |||||
* | update to Unicode 11.0.0 (closes bpo-33778) (GH-7439) | Benjamin Peterson | 2018-06-07 | 1 | -20/+19 |
| | | | Also, standardize indentation of generated tables. | ||||
* | bpo-30736: upgrade to Unicode 10.0 (#2344) | Benjamin Peterson | 2017-06-23 | 1 | -4/+5 |
| | | | Straightforward. While we're at it, though, strip trailing whitespace from generated tables. | ||||
* | bpo-30296 Remove unnecessary tuples, lists, sets, and dicts (#1489) | Jon Dufresne | 2017-05-18 | 1 | -1/+1 |
| | | | | | | | | * Replaced list(<generator expression>) with list comprehension * Replaced dict(<generator expression>) with dict comprehension * Replaced set(<list literal>) with set literal * Replaced builtin func(<list comprehension>) with func(<generator expression>) when supported (e.g. any(), all(), tuple(), min(), & max()) | ||||
* | Unicode 9.0.0 | Benjamin Peterson | 2016-09-15 | 1 | -3/+8 |
| | | | | | Not completely mechanical since support for East Asian Width changes—emoji codepoints became Wide—had to be added to unicodedata. | ||||
* | upgrade to Unicode 8.0.0 | Benjamin Peterson | 2015-06-27 | 1 | -3/+4 |
| | |||||
* | Merge: #18176: Change generic UCD PropList link to version specific link. | R David Murray | 2014-10-10 | 1 | -1/+1 |
|\ | |||||
| * | #18176: Change generic UCD PropList link to version specific link. | R David Murray | 2014-10-10 | 1 | -1/+1 |
| | | |||||
* | | Merge: #18176: fix another reference and add it to the makeunicodedata comment. | R David Murray | 2014-10-09 | 1 | -0/+1 |
|\ \ | |/ | |||||
| * | #18176: fix another reference and add it to the makeunicodedata comment. | R David Murray | 2014-10-09 | 1 | -0/+1 |
| | | |||||
* | | Merge: #18176: updated stdtypes UCD link, added reminder to makeunicodedata. | R David Murray | 2014-10-09 | 1 | -0/+4 |
|\ \ | |/ | |||||
| * | #18176: updated stdtypes UCD link, added reminder to makeunicodedata. | R David Murray | 2014-10-09 | 1 | -0/+4 |
| | | | | | | | | Patch by Alexander Belopolsky. | ||||
* | | upgrade to unicode 7.0.0 | Benjamin Peterson | 2014-07-06 | 1 | -1/+1 |
|/ | |||||
* | upgrade unicode db to 6.3.0 (closes #19221) | Benjamin Peterson | 2013-10-10 | 1 | -2/+2 |
| | |||||
* | #18803: merge with 3.3. | Ezio Melotti | 2013-08-25 | 1 | -1/+1 |
|\ | |||||
| * | #18803: fix more typos. Patch by Févry Thibault. | Ezio Melotti | 2013-08-25 | 1 | -1/+1 |
| | | |||||
* | | Issue #18722: Remove uses of the "register" keyword in C code. | Antoine Pitrou | 2013-08-13 | 1 | -2/+2 |
| | | |||||
* | | upgrade to UCD 6.2 | Benjamin Peterson | 2012-09-29 | 1 | -1/+1 |
|/ | |||||
* | Some cleanup in the Tools directory. | Florent Xicluna | 2012-07-07 | 1 | -1/+0 |
| | |||||
* | update to Unicode 6.1 | Benjamin Peterson | 2012-02-21 | 1 | -4/+5 |
| | |||||
* | delta encoding of upper/lower/title makes a glorious return (#12736) | Benjamin Peterson | 2012-01-16 | 1 | -0/+7 |
| | |||||
* | add str.casefold() (closes #13752) | Benjamin Peterson | 2012-01-14 | 1 | -15/+35 |
| | |||||
* | use full unicode mappings for upper/lower/title case (#12736) | Benjamin Peterson | 2012-01-11 | 1 | -32/+69 |
| | | | | Also broaden the category of characters that count as lowercase/uppercase. | ||||
* | #12753: Add support for Unicode name aliases and named sequences. | Ezio Melotti | 2011-10-21 | 1 | -2/+100 |
| | |||||
* | Fix ResourceWarnings in makeunicodedata.py. | Ezio Melotti | 2011-09-30 | 1 | -81/+88 |
| | |||||
* | #11565: Merge with 3.1. | Ezio Melotti | 2011-03-16 | 1 | -1/+1 |
|\ | |||||
| * | #11565: Fix several typos. Patch by Piotr Kasprzyk. | Ezio Melotti | 2011-03-16 | 1 | -1/+1 |
| | | |||||
* | | Issue #10459: Update CJK character names to Unicode 6.0. | Martin v. Löwis | 2010-11-22 | 1 | -2/+19 |
| | | |||||
* | | Upgrade to Unicode 6.0.0. | Martin v. Löwis | 2010-10-11 | 1 | -58/+77 |
| | | | | | | | | | | | | | | | | makeunicodedata.py: download all data files from unicode.org, switch to extracting Unihan data from zip file. Read linebreakprops and derivednormalizationprops even for old versions, even though they are not used in delta records. test:unicode.py: U+11000 is now assigned, use U+14000 instead. | ||||
* | | #9210: remove --with-wctype-functions configure option. | Amaury Forgeot d'Arc | 2010-09-12 | 1 | -4/+0 |
| | | | | | | | | | | | | | | | | The internal unicode database is now always used. (after 5 years: see http://mail.python.org/pipermail/python-dev/2004-December/050193.html ) | ||||
* | | #5127: Even on narrow unicode builds, the C functions that access the Unicode | Amaury Forgeot d'Arc | 2010-08-18 | 1 | -37/+4 |
| | | | | | | | | | | | | | | | | | | | | Database (Py_UNICODE_TOLOWER, Py_UNICODE_ISDECIMAL, and others) now accept and return characters from the full Unicode range (Py_UCS4). The differences from Python code are few: - unicodedata.numeric(), unicodedata.decimal() and unicodedata.digit() now return the correct value for large code points - repr() may consider more characters as printable. | ||||
* | | Merged revisions 79494,79496 via svnmerge from | Florent Xicluna | 2010-03-30 | 1 | -5/+24 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r79494 | florent.xicluna | 2010-03-30 10:24:06 +0200 (mar, 30 mar 2010) | 2 lines #7643: Unicode codepoints VT (0x0B) and FF (0x0C) are linebreaks according to Unicode Standard Annex #14. ........ r79496 | florent.xicluna | 2010-03-30 18:29:03 +0200 (mar, 30 mar 2010) | 2 lines Highlight the change of behavior related to r79494. Now VT and FF are linebreaks. ........ | ||||
* | | Merged revisions 78982,78986 via svnmerge from | Florent Xicluna | 2010-03-19 | 1 | -4/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r78982 | florent.xicluna | 2010-03-15 15:00:58 +0100 (lun, 15 mar 2010) | 2 lines Remove py3k deprecation warnings from these Unicode tools. ........ r78986 | florent.xicluna | 2010-03-15 19:08:58 +0100 (lun, 15 mar 2010) | 3 lines Issue #7783 and #7787: open_urlresource invalidates the outdated files from the local cache. Use this feature to fix test_normalization. ........ | ||||
* | | Fixed a failure in test_bigmem. | Florent Xicluna | 2010-03-19 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | Merged revision 79059 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r79059 | florent.xicluna | 2010-03-18 22:50:06 +0100 (jeu, 18 mar 2010) | 2 lines Issue #8024: Update the Unicode database to 5.2 ........ | ||||
* | | Revert Unicode UCD 5.2 upgrade in 3.x. It broke repr() for unicode objects, ↵ | Florent Xicluna | 2010-03-19 | 1 | -1/+1 |
| | | | | | | | | and gave failures in test_bigmem. Revert 79062, 79065 and 79083. |