Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Fixed bug #1915: Python compiles with --enable-unicode=no again. However ↵ | Christian Heimes | 2008-01-23 | 1 | -1/+2 |
| | | | | several extension methods and modules do not work without unicode support. | ||||
* | The incremental decoder for utf-7 must preserve its state between calls. | Amaury Forgeot d'Arc | 2007-11-20 | 1 | -13/+10 |
| | | | | | | | Solves issue1460. Might not be a backport candidate: a new API function was added, and some code may rely on details in utf-7.py. | ||||
* | Fix for #1444: utf_8_sig.StreamReader was (indirectly through decode()) | Walter Dörwald | 2007-11-19 | 1 | -5/+11 |
| | | | | | calling codecs.utf_8_decode() with final==True, which falled with incomplete byte sequences. Fix and test by James G. Sack. | ||||
* | Fix #883466: don't allow Unicode as arguments to quopri and uu codecs. | Georg Brandl | 2007-09-03 | 2 | -4/+6 |
| | |||||
* | Backport r57105 and r57145 from the py3k branch: UTF-32 codecs. | Walter Dörwald | 2007-08-17 | 4 | -0/+228 |
| | |||||
* | Fix utf-8-sig incremental decoder, which didn't recognise a BOM when the | Walter Dörwald | 2007-04-12 | 1 | -7/+12 |
| | | | | first chunk fed to the decoder started with a BOM, but was longer than 3 bytes. | ||||
* | Make the __import__ call in encodings.__init__ absolute with a level 0 call. | Brett Cannon | 2007-02-16 | 1 | -4/+4 |
| | |||||
* | Update the encoding package's search function to use absolute imports when | Brett Cannon | 2007-02-15 | 1 | -2/+4 |
| | | | | | | | | | | | | calling __import__. This helps make the expected search locations for encoding modules be more explicit. One could use an explicit value for __path__ when making the call to __import__ to force the exact location searched for encodings. This would give the most strict search path possible if one is worried about malicious code being imported. The unfortunate side-effect of that is that if __path__ was modified on 'encodings' on purpose in a safe way it would not be picked up in future __import__ calls. | ||||
* | Patch #1634778: add missing encoding aliases for iso8859_15 and | Georg Brandl | 2007-01-27 | 1 | -0/+4 |
| | | | | iso8859_16. | ||||
* | Change decode() so that it works with a buffer (i.e. unicode(..., 'utf-8-sig')) | Walter Dörwald | 2006-11-23 | 1 | -1/+1 |
| | | | | SF bug #1601501. | ||||
* | Bug #1586613: fix zlib and bz2 codecs' incremental en/decoders. | Georg Brandl | 2006-10-29 | 2 | -8/+51 |
| | |||||
* | Bug #1446043: correctly raise a LookupError if an encoding name given | Georg Brandl | 2006-09-30 | 1 | -1/+1 |
| | | | | to encodings.search_function() contains a dot. | ||||
* | importing types is not necessary if we use isinstance | Neal Norwitz | 2006-08-25 | 1 | -2/+2 |
| | |||||
* | Correction of patch #1455898: In the mbcs decoder, set final=False | Martin v. Löwis | 2006-08-02 | 1 | -19/+11 |
| | | | | for stream decoder, but final=True for the decode function. | ||||
* | Make import/lookup of mbcs fail on non-Windows systems. | Martin v. Löwis | 2006-06-15 | 1 | -4/+8 |
| | |||||
* | Patch #1455898: Incremental mode for "mbcs" codec. | Martin v. Löwis | 2006-06-14 | 1 | -3/+4 |
| | |||||
* | errors is an attribute in the incremental decoder | Walter Dörwald | 2006-06-13 | 1 | -3/+3 |
| | | | | not an argument. | ||||
* | Fix passing errors to the encoder and decoder functions. | Walter Dörwald | 2006-06-13 | 2 | -4/+4 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2006-06-04 | 45 | -45/+0 |
| | |||||
* | Patch #1359618: Speed-up charmap encoder. | Martin v. Löwis | 2006-06-04 | 45 | -11441/+180 |
| | |||||
* | Add a BufferedIncrementalEncoder class that can be used for implementing | Walter Dörwald | 2006-04-14 | 1 | -6/+72 |
| | | | | | | | | | an incremental encoder that must retain part of the data between calls to the encode() method. Fix the incremental encoder and decoder for the IDNA encoding. This closes SF patch #1453235. | ||||
* | Make error message less misleading for u"a..b".encode("idna"). | Walter Dörwald | 2006-04-14 | 1 | -3/+3 |
| | |||||
* | Make raise statements PEP 8 compatible. | Walter Dörwald | 2006-04-14 | 1 | -11/+11 |
| | |||||
* | Whitespace. | Walter Dörwald | 2006-03-27 | 24 | -0/+72 |
| | |||||
* | Patch #1443155: Add the incremental codecs support for CJK codecs. | Hye-Shik Chang | 2006-03-26 | 24 | -432/+480 |
| | | | | (reviewed by Walter Dörwald) | ||||
* | Instead of relative imports, use (implicitly) absolute ones. | Guido van Rossum | 2006-03-15 | 1 | -1/+1 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2006-03-15 | 48 | -50/+4 |
| | |||||
* | Fix typo. | Walter Dörwald | 2006-03-15 | 1 | -2/+2 |
| | |||||
* | Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclass | Walter Dörwald | 2006-03-15 | 90 | -375/+1706 |
| | | | | | | | of tuple) that provides incremental decoders and encoders (a way to use stateful codecs without the stream API). Functions codecs.getincrementaldecoder() and codecs.getincrementalencoder() have been added. | ||||
* | Use relative imports in a few places where I noticed the need. | Guido van Rossum | 2006-03-15 | 1 | -1/+2 |
| | | | | | (Ideally, all packages in Python 2.5 will use the relative import syntax for all their relative import needs.) | ||||
* | Avoid forward-declaring the methods array. | Martin v. Löwis | 2006-03-10 | 1 | -1/+1 |
| | | | | Rename unicodedata.db* to unicodedata.ucd* | ||||
* | Update Unicode database to Unicode 4.1. | Martin v. Löwis | 2006-03-09 | 1 | -1/+2 |
| | |||||
* | Fix the encodings package codec search function to only search | Marc-André Lemburg | 2006-02-19 | 1 | -1/+1 |
| | | | | | | inside its own package. Fixes problem reported in patch #1433198. Add codec search function for codec test codec. | ||||
* | Patch #1177307: UTF-8-Sig codec. | Martin v. Löwis | 2006-01-08 | 1 | -0/+57 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2005-12-25 | 61 | -32998/+32953 |
| | |||||
* | Cosmetic change: make all hex literals use upper case hex so that they | Marc-André Lemburg | 2005-10-24 | 45 | -14236/+14281 |
| | | | | | | look more like the Unicode Consortium files. Add ending new-line to all source files. | ||||
* | Removed the decoding_map from the codecs where this is possible. | Marc-André Lemburg | 2005-10-24 | 45 | -25663/+22734 |
| | | | | | Replaced the tis_620, cp1140 and koi8_u codecs with new ones based on custom mapping files. | ||||
* | Replace the old EBCDIC codecs with new ones using the decoding table. | Marc-André Lemburg | 2005-10-21 | 4 | -981/+3027 |
| | |||||
* | Alias iso8859_1 to latin_1 which is the same encoding, but has | Marc-André Lemburg | 2005-10-21 | 1 | -0/+7 |
| | | | | a much faster codec implementation. | ||||
* | Add a few more Mac OS encodings. The mapping tables for these are | Marc-André Lemburg | 2005-10-21 | 5 | -0/+3414 |
| | | | | available at ftp.unicode.org. | ||||
* | Replace the old charmap codecs with new ones generated from the current | Marc-André Lemburg | 2005-10-21 | 49 | -5129/+29964 |
| | | | | | | | mapping tables available at ftp.unicode.org. These new codecs include and use character decoding tables which speeds up decoding by a few factors. | ||||
* | Bug #1245379: Add "unicode-1-1-utf-7" as an alias for "utf-7" as specified | Walter Dörwald | 2005-10-09 | 1 | -0/+1 |
| | | | | by RFC 1642. | ||||
* | No need to import exceptions, they are builtins | Neal Norwitz | 2005-09-01 | 1 | -3/+2 |
| | |||||
* | Make IDNA return an empty string when the input is empty. Fixes #1163178. | Martin v. Löwis | 2005-08-25 | 1 | -0/+6 |
| | | | | Will backport to 2.4. | ||||
* | Reset internal buffers when seek() is called. This fixes SF bug #1156259. | Walter Dörwald | 2005-03-14 | 1 | -0/+7 |
| | |||||
* | Fix wrong variable name. | Walter Dörwald | 2004-12-29 | 1 | -1/+1 |
| | |||||
* | Rearranged mappings to value sorting order. | Marc-André Lemburg | 2004-12-10 | 1 | -13/+13 |
| | |||||
* | SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support | Walter Dörwald | 2004-09-07 | 4 | -72/+49 |
| | | | | | | | | | | | decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful. | ||||
* | Whitespace normalization. | Tim Peters | 2004-08-07 | 3 | -136/+136 |
| | |||||
* | Added new codecs and aliases for ISO_8859-11, ISO_8859-16 and | Marc-André Lemburg | 2004-08-05 | 4 | -1/+285 |
| | | | | | | TIS-620. Closes SF bug #1001895: Adding missing ISO 8859 codecs, especially Thai. |