Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Issue #6213: Implement getstate() and setstate() methods of utf-8-sig and | Victor Stinner | 2010-07-28 | 2 | -3/+25 |
| | | | | utf-16 incremental encoders. | ||||
* | #8016: add the CP858 codec (approved by Benjamin). (Also add CP720 to the ↵ | Georg Brandl | 2010-05-24 | 2 | -0/+703 |
| | | | | tests, it was missing there.) | ||||
* | Fix my last commit (r81471) about codecs | Victor Stinner | 2010-05-22 | 2 | -6/+6 |
| | | | | Rememder: don't touch the code just before a commit | ||||
* | Issue #6268: More bugfixes about BOM, UTF-16 and UTF-32 | Victor Stinner | 2010-05-22 | 2 | -14/+26 |
| | | | | | | | * Fix seek() method of codecs.open(), don't write the BOM twice after seek(0) * Fix reset() method of codecs, UTF-16, UTF-32 and StreamWriter classes * test_codecs: use "w+" mode instead of "wt+". "t" mode is not supported by Solaris or Windows, but does it really exist? I found it the in the issue. | ||||
* | set svn:eol-style on various files | Benjamin Peterson | 2010-03-08 | 1 | -311/+309 |
| | |||||
* | #1757126: fix typo with the cyrillic_asian alias | Philip Jenvey | 2009-11-17 | 1 | -1/+1 |
| | |||||
* | replace has_key with 'in' operator | Benjamin Peterson | 2009-10-09 | 1 | -1/+1 |
| | |||||
* | replace callable() | Benjamin Peterson | 2009-10-09 | 1 | -6/+6 |
| | |||||
* | #1616979: Add the cp720 (Arabic DOS) encoding. | Amaury Forgeot d'Arc | 2009-07-13 | 1 | -0/+311 |
| | | | | | Since there is no official mapping file from unicode.org, the codec file is generated on Windows with the new genwincodec.py script. | ||||
* | Fixed bug #1915: Python compiles with --enable-unicode=no again. However ↵ | Christian Heimes | 2008-01-23 | 1 | -1/+2 |
| | | | | several extension methods and modules do not work without unicode support. | ||||
* | The incremental decoder for utf-7 must preserve its state between calls. | Amaury Forgeot d'Arc | 2007-11-20 | 1 | -13/+10 |
| | | | | | | | Solves issue1460. Might not be a backport candidate: a new API function was added, and some code may rely on details in utf-7.py. | ||||
* | Fix for #1444: utf_8_sig.StreamReader was (indirectly through decode()) | Walter Dörwald | 2007-11-19 | 1 | -5/+11 |
| | | | | | calling codecs.utf_8_decode() with final==True, which falled with incomplete byte sequences. Fix and test by James G. Sack. | ||||
* | Fix #883466: don't allow Unicode as arguments to quopri and uu codecs. | Georg Brandl | 2007-09-03 | 2 | -4/+6 |
| | |||||
* | Backport r57105 and r57145 from the py3k branch: UTF-32 codecs. | Walter Dörwald | 2007-08-17 | 4 | -0/+228 |
| | |||||
* | Fix utf-8-sig incremental decoder, which didn't recognise a BOM when the | Walter Dörwald | 2007-04-12 | 1 | -7/+12 |
| | | | | first chunk fed to the decoder started with a BOM, but was longer than 3 bytes. | ||||
* | Make the __import__ call in encodings.__init__ absolute with a level 0 call. | Brett Cannon | 2007-02-16 | 1 | -4/+4 |
| | |||||
* | Update the encoding package's search function to use absolute imports when | Brett Cannon | 2007-02-15 | 1 | -2/+4 |
| | | | | | | | | | | | | calling __import__. This helps make the expected search locations for encoding modules be more explicit. One could use an explicit value for __path__ when making the call to __import__ to force the exact location searched for encodings. This would give the most strict search path possible if one is worried about malicious code being imported. The unfortunate side-effect of that is that if __path__ was modified on 'encodings' on purpose in a safe way it would not be picked up in future __import__ calls. | ||||
* | Patch #1634778: add missing encoding aliases for iso8859_15 and | Georg Brandl | 2007-01-27 | 1 | -0/+4 |
| | | | | iso8859_16. | ||||
* | Change decode() so that it works with a buffer (i.e. unicode(..., 'utf-8-sig')) | Walter Dörwald | 2006-11-23 | 1 | -1/+1 |
| | | | | SF bug #1601501. | ||||
* | Bug #1586613: fix zlib and bz2 codecs' incremental en/decoders. | Georg Brandl | 2006-10-29 | 2 | -8/+51 |
| | |||||
* | Bug #1446043: correctly raise a LookupError if an encoding name given | Georg Brandl | 2006-09-30 | 1 | -1/+1 |
| | | | | to encodings.search_function() contains a dot. | ||||
* | importing types is not necessary if we use isinstance | Neal Norwitz | 2006-08-25 | 1 | -2/+2 |
| | |||||
* | Correction of patch #1455898: In the mbcs decoder, set final=False | Martin v. Löwis | 2006-08-02 | 1 | -19/+11 |
| | | | | for stream decoder, but final=True for the decode function. | ||||
* | Make import/lookup of mbcs fail on non-Windows systems. | Martin v. Löwis | 2006-06-15 | 1 | -4/+8 |
| | |||||
* | Patch #1455898: Incremental mode for "mbcs" codec. | Martin v. Löwis | 2006-06-14 | 1 | -3/+4 |
| | |||||
* | errors is an attribute in the incremental decoder | Walter Dörwald | 2006-06-13 | 1 | -3/+3 |
| | | | | not an argument. | ||||
* | Fix passing errors to the encoder and decoder functions. | Walter Dörwald | 2006-06-13 | 2 | -4/+4 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2006-06-04 | 45 | -45/+0 |
| | |||||
* | Patch #1359618: Speed-up charmap encoder. | Martin v. Löwis | 2006-06-04 | 45 | -11441/+180 |
| | |||||
* | Add a BufferedIncrementalEncoder class that can be used for implementing | Walter Dörwald | 2006-04-14 | 1 | -6/+72 |
| | | | | | | | | | an incremental encoder that must retain part of the data between calls to the encode() method. Fix the incremental encoder and decoder for the IDNA encoding. This closes SF patch #1453235. | ||||
* | Make error message less misleading for u"a..b".encode("idna"). | Walter Dörwald | 2006-04-14 | 1 | -3/+3 |
| | |||||
* | Make raise statements PEP 8 compatible. | Walter Dörwald | 2006-04-14 | 1 | -11/+11 |
| | |||||
* | Whitespace. | Walter Dörwald | 2006-03-27 | 24 | -0/+72 |
| | |||||
* | Patch #1443155: Add the incremental codecs support for CJK codecs. | Hye-Shik Chang | 2006-03-26 | 24 | -432/+480 |
| | | | | (reviewed by Walter Dörwald) | ||||
* | Instead of relative imports, use (implicitly) absolute ones. | Guido van Rossum | 2006-03-15 | 1 | -1/+1 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2006-03-15 | 48 | -50/+4 |
| | |||||
* | Fix typo. | Walter Dörwald | 2006-03-15 | 1 | -2/+2 |
| | |||||
* | Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclass | Walter Dörwald | 2006-03-15 | 90 | -375/+1706 |
| | | | | | | | of tuple) that provides incremental decoders and encoders (a way to use stateful codecs without the stream API). Functions codecs.getincrementaldecoder() and codecs.getincrementalencoder() have been added. | ||||
* | Use relative imports in a few places where I noticed the need. | Guido van Rossum | 2006-03-15 | 1 | -1/+2 |
| | | | | | (Ideally, all packages in Python 2.5 will use the relative import syntax for all their relative import needs.) | ||||
* | Avoid forward-declaring the methods array. | Martin v. Löwis | 2006-03-10 | 1 | -1/+1 |
| | | | | Rename unicodedata.db* to unicodedata.ucd* | ||||
* | Update Unicode database to Unicode 4.1. | Martin v. Löwis | 2006-03-09 | 1 | -1/+2 |
| | |||||
* | Fix the encodings package codec search function to only search | Marc-André Lemburg | 2006-02-19 | 1 | -1/+1 |
| | | | | | | inside its own package. Fixes problem reported in patch #1433198. Add codec search function for codec test codec. | ||||
* | Patch #1177307: UTF-8-Sig codec. | Martin v. Löwis | 2006-01-08 | 1 | -0/+57 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2005-12-25 | 61 | -32998/+32953 |
| | |||||
* | Cosmetic change: make all hex literals use upper case hex so that they | Marc-André Lemburg | 2005-10-24 | 45 | -14236/+14281 |
| | | | | | | look more like the Unicode Consortium files. Add ending new-line to all source files. | ||||
* | Removed the decoding_map from the codecs where this is possible. | Marc-André Lemburg | 2005-10-24 | 45 | -25663/+22734 |
| | | | | | Replaced the tis_620, cp1140 and koi8_u codecs with new ones based on custom mapping files. | ||||
* | Replace the old EBCDIC codecs with new ones using the decoding table. | Marc-André Lemburg | 2005-10-21 | 4 | -981/+3027 |
| | |||||
* | Alias iso8859_1 to latin_1 which is the same encoding, but has | Marc-André Lemburg | 2005-10-21 | 1 | -0/+7 |
| | | | | a much faster codec implementation. | ||||
* | Add a few more Mac OS encodings. The mapping tables for these are | Marc-André Lemburg | 2005-10-21 | 5 | -0/+3414 |
| | | | | available at ftp.unicode.org. | ||||
* | Replace the old charmap codecs with new ones generated from the current | Marc-André Lemburg | 2005-10-21 | 49 | -5129/+29964 |
| | | | | | | | mapping tables available at ftp.unicode.org. These new codecs include and use character decoding tables which speeds up decoding by a few factors. |