summaryrefslogtreecommitdiffstats
path: root/Lib/encodings
Commit message (Collapse)AuthorAgeFilesLines
* bpo-29240: PEP 540: Add a new UTF-8 Mode (#855)Victor Stinner2017-12-131-2/+3
| | | | | | | | | | | | | | | | | | | | | | * Add -X utf8 command line option, PYTHONUTF8 environment variable and a new sys.flags.utf8_mode flag. * If the LC_CTYPE locale is "C" at startup: enable automatically the UTF-8 mode. * Add _winapi.GetACP(). encodings._alias_mbcs() now calls _winapi.GetACP() to get the ANSI code page * locale.getpreferredencoding() now returns 'UTF-8' in the UTF-8 mode. As a side effect, open() now uses the UTF-8 encoding by default in this mode. * Py_DecodeLocale() and Py_EncodeLocale() now use the UTF-8 encoding in the UTF-8 Mode. * Update subprocess._args_from_interpreter_flags() to handle -X utf8 * Skip some tests relying on the current locale if the UTF-8 mode is enabled. * Add test_utf8mode.py. * _Py_DecodeUTF8_surrogateescape() gets a new optional parameter to return also the length (number of wide characters). * pymain_get_global_config() and pymain_set_global_config() now always copy flag values, rather than only copying if the new value is greater than the old value.
* Revert #27959: ImportError within an encoding module should also skip the ↵Steve Dower2016-09-091-3/+4
| | | | encoding
* Issue #28005: Allow ImportErrors in encoding implementation to propagate.Steve Dower2016-09-081-2/+3
|
* Issue #27959: Prevent ImportError from escaping codec search functionSteve Dower2016-09-071-4/+8
|
* Issue #27959: Adds oem encoding, alias ansi to mbcs, move aliasmbcs to codec ↵Steve Dower2016-09-073-0/+52
| | | | lookup
* PEP 7 style for if/else in CVictor Stinner2016-09-021-0/+1
| | | | Add also a newline for readability in normalize_encoding().
* Issue #27076: Doc, comment and tests spelling fixesMartin Panter2016-05-262-2/+2
| | | | Most fixes to Doc/ and Lib/ directories by Ville Skyttä.
* Add some "used with permission" mentions where external resources are ↵Brett Cannon2016-01-151-0/+2
| | | | | | referenced. Permission was validated prior to adding these markings.
* Issue #16473: Merge codecs doc and test from 3.4 into 3.5Martin Panter2015-09-121-1/+1
|\
| * Issue #16473: Fix byte transform codec documentation; test quotetabs=TrueMartin Panter2015-09-121-1/+1
| | | | | | | | | | | | This changes the equivalent functions listed for the Base-64, hex and Quoted- Printable codecs to reflect the functions actually used. Also mention and test the "quotetabs" setting for Quoted-Printable encoding.
* | Added forgotten new files for issues #22681 and #22682.Serhiy Storchaka2015-05-122-0/+615
| |
* | Issue #22682: Added support for the kz1048 encoding.Serhiy Storchaka2015-05-121-0/+5
| |
* | Issue #22406: Fixed the uu_codec codec incorrectly ported to 3.x.Serhiy Storchaka2014-11-071-1/+1
|\ \ | |/ | | | | Based on patch by Martin Panter.
| * Issue #22406: Fixed the uu_codec codec incorrectly ported to 3.x.Serhiy Storchaka2014-11-071-1/+1
| | | | | | | | Based on patch by Martin Panter.
* | Issue #21171: Fixed undocumented filter API of the rot13 codec.Serhiy Storchaka2014-04-131-1/+1
|\ \ | |/ | | | | Patch by Berker Peksag.
| * Issue #21171: Fixed undocumented filter API of the rot13 codec.Serhiy Storchaka2014-04-131-1/+1
| | | | | | | | Patch by Berker Peksag.
* | Issue #20574: Implement incremental decoder for cp65001 codeVictor Stinner2014-03-171-3/+6
|/ | | | (Windows code page 65001, Microsoft UTF-8).
* Merge #7475: Remove references to '.transform' from transform codec docstrings.R David Murray2014-03-146-12/+6
|\
| * #7475: Remove references to '.transform' from transform codec docstrings.R David Murray2014-03-146-12/+6
| |
| * Issue #19619: Blacklist non-text codecs in method APISerhiy Storchaka2014-02-247-0/+7
| | | | | | | | | | | | | | | | | | | | | | | | str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings. Backported changeset d68df99d7a57.
* | whatsnew: cp273 codec (#10907797)R David Murray2014-03-081-0/+5
| | | | | | | | | | Also updated the docs and added the aliases mentioned by the references.
* | Fixed incorrectly applying a patch for issue19668.Serhiy Storchaka2013-11-232-26/+724
| |
* | Issue #19668: Added support for the cp1125 encoding.Serhiy Storchaka2013-11-232-26/+32
| |
* | Close #7475: Restore binary & text transform codecsNick Coghlan2013-11-231-18/+18
| | | | | | | | | | | | | | | | | | | | | | | | The codecs themselves were restored in Python 3.2, this completes the restoration by adding back the convenience aliases. These aliases were originally left out due to confusing errors when attempting to use them with the text encoding specific convenience methods. Python 3.4 includes several improvements to those errors, thus permitting the aliases to be restored as well.
* | Issue #19619: Blacklist non-text codecs in method APINick Coghlan2013-11-227-0/+7
| | | | | | | | | | | | | | | | | | | | str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings.
* | #1097797: Add CP273 codec, and exercise it in the test suiteAndrew Kuchling2013-11-101-0/+307
| |
* | Issue #18200: Back out usage of ModuleNotFoundError (8d28d44f3a9a)Brett Cannon2013-07-041-3/+4
| |
* | Issue #18200: Update the stdlib (except tests) to useBrett Cannon2013-06-141-4/+3
| | | | | | | | ModuleNotFoundError.
* | Add fast-path in PyUnicode_DecodeCharmap() for pure 8 bit encodings:Victor Stinner2013-04-093-3/+0
|/ | | | cp037, cp500 and iso8859_1 codecs
* Normalize whitespaceAntoine Pitrou2012-06-162-2/+0
|
* Issue #14874: Restore charmap decoding speed to pre-PEP 393 levels.Antoine Pitrou2012-06-167-422/+1078
| | | | Patch by Serhiy Storchaka.
* Speed up IDNA for the common caseAntoine Pitrou2011-11-101-0/+22
|
* Merge 3.2Florent Xicluna2011-10-281-6/+5
|\
| * Closes #13258: Use callable() built-in in the standard library.Florent Xicluna2011-10-281-6/+5
| |
* | Close #13247: Add cp65001 codec, the Windows UTF-8 (CP_UTF8)Victor Stinner2011-10-261-0/+40
|/
* Issue #10807: Remove base64, bz2, hex, quopri, rot13, uu and zlib codecs fromVictor Stinner2011-01-021-18/+18
| | | | the codec aliases. They are still accessible via codecs.lookup().
* Don't use deprecated aliases.Georg Brandl2010-12-061-4/+4
|
* #7475: add (un)transform method to bytes/bytearray and str, add back codecs ↵Georg Brandl2010-12-028-18/+550
| | | | that can be used with them from Python 2.
* remove pointless coding cookiesFlorent Xicluna2010-08-301-1/+0
|
* Fix a typo in the alias target name for 'macintosh'.Marc-André Lemburg2010-08-211-1/+1
|
* alias macintosh to mac_roman #843590Benjamin Peterson2010-08-211-0/+1
|
* Merged revisions 81499,81506 via svnmerge fromBenjamin Peterson2010-06-272-0/+703
| | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r81499 | georg.brandl | 2010-05-24 16:29:07 -0500 (Mon, 24 May 2010) | 1 line #8016: add the CP858 codec (approved by Benjamin). (Also add CP720 to the tests, it was missing there.) ........ r81506 | benjamin.peterson | 2010-05-24 17:04:53 -0500 (Mon, 24 May 2010) | 1 line set svn:eol-style ........
* Merged revisions 81471-81472 via svnmerge fromVictor Stinner2010-05-222-14/+26
| | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r81471 | victor.stinner | 2010-05-22 15:37:56 +0200 (sam., 22 mai 2010) | 7 lines Issue #6268: More bugfixes about BOM, UTF-16 and UTF-32 * Fix seek() method of codecs.open(), don't write the BOM twice after seek(0) * Fix reset() method of codecs, UTF-16, UTF-32 and StreamWriter classes * test_codecs: use "w+" mode instead of "wt+". "t" mode is not supported by Solaris or Windows, but does it really exist? I found it the in the issue. ........ r81472 | victor.stinner | 2010-05-22 15:44:25 +0200 (sam., 22 mai 2010) | 4 lines Fix my last commit (r81471) about codecs Rememder: don't touch the code just before a commit ........
* Merged revisions 78806 via svnmerge fromBenjamin Peterson2010-03-081-311/+309
| | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r78806 | benjamin.peterson | 2010-03-08 16:15:11 -0600 (Mon, 08 Mar 2010) | 1 line set svn:eol-style on various files ........
* Fix a minor grammatical error.Brett Cannon2009-12-131-1/+1
|
* Merged revisions 76337 via svnmerge fromPhilip Jenvey2009-11-171-1/+1
| | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r76337 | philip.jenvey | 2009-11-16 18:42:26 -0800 (Mon, 16 Nov 2009) | 2 lines #1757126: fix typo with the cyrillic_asian alias ........
* Oops, really pass a bytes string to the ctypes function.Amaury Forgeot d'Arc2009-07-131-27/+27
|
* Merged revisions 74000-74001 via svnmerge fromAmaury Forgeot d'Arc2009-07-131-0/+311
| | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/trunk ........ r74000 | amaury.forgeotdarc | 2009-07-13 22:01:11 +0200 (lun., 13 juil. 2009) | 4 lines #1616979: Add the cp720 (Arabic DOS) encoding. Since there is no official mapping file from unicode.org, the codec file is generated on Windows with the new genwincodec.py script. ........ r74001 | amaury.forgeotdarc | 2009-07-13 22:03:21 +0200 (lun., 13 juil. 2009) | 2 lines NEWS entry for r74000. ........
* #1276: Add temporary encoding aliases for non-supported Mac CJKHye-Shik Chang2008-08-231-0/+5
| | | | | encodings that are detected as system defaults in MacOS with CJK locales. Will be replaced by properly-implemented codecs in 3.1.
* #2834: Change re module semantics, so that str and bytes mixing is forbidden,Antoine Pitrou2008-08-191-5/+3
| | | | | and str (unicode) patterns get full unicode matching by default. The re.ASCII flag is also introduced to ask for ASCII matching instead.