summaryrefslogtreecommitdiffstats
path: root/Lib/encodings
Commit message (Collapse)AuthorAgeFilesLines
* Fixed bug #1915: Python compiles with --enable-unicode=no again. However ↵Christian Heimes2008-01-231-1/+2
| | | | several extension methods and modules do not work without unicode support.
* The incremental decoder for utf-7 must preserve its state between calls.Amaury Forgeot d'Arc2007-11-201-13/+10
| | | | | | | Solves issue1460. Might not be a backport candidate: a new API function was added, and some code may rely on details in utf-7.py.
* Fix for #1444: utf_8_sig.StreamReader was (indirectly through decode())Walter Dörwald2007-11-191-5/+11
| | | | | calling codecs.utf_8_decode() with final==True, which falled with incomplete byte sequences. Fix and test by James G. Sack.
* Fix #883466: don't allow Unicode as arguments to quopri and uu codecs.Georg Brandl2007-09-032-4/+6
|
* Backport r57105 and r57145 from the py3k branch: UTF-32 codecs.Walter Dörwald2007-08-174-0/+228
|
* Fix utf-8-sig incremental decoder, which didn't recognise a BOM when theWalter Dörwald2007-04-121-7/+12
| | | | first chunk fed to the decoder started with a BOM, but was longer than 3 bytes.
* Make the __import__ call in encodings.__init__ absolute with a level 0 call.Brett Cannon2007-02-161-4/+4
|
* Update the encoding package's search function to use absolute imports whenBrett Cannon2007-02-151-2/+4
| | | | | | | | | | | | calling __import__. This helps make the expected search locations for encoding modules be more explicit. One could use an explicit value for __path__ when making the call to __import__ to force the exact location searched for encodings. This would give the most strict search path possible if one is worried about malicious code being imported. The unfortunate side-effect of that is that if __path__ was modified on 'encodings' on purpose in a safe way it would not be picked up in future __import__ calls.
* Patch #1634778: add missing encoding aliases for iso8859_15 andGeorg Brandl2007-01-271-0/+4
| | | | iso8859_16.
* Change decode() so that it works with a buffer (i.e. unicode(..., 'utf-8-sig'))Walter Dörwald2006-11-231-1/+1
| | | | SF bug #1601501.
* Bug #1586613: fix zlib and bz2 codecs' incremental en/decoders.Georg Brandl2006-10-292-8/+51
|
* Bug #1446043: correctly raise a LookupError if an encoding name givenGeorg Brandl2006-09-301-1/+1
| | | | to encodings.search_function() contains a dot.
* importing types is not necessary if we use isinstanceNeal Norwitz2006-08-251-2/+2
|
* Correction of patch #1455898: In the mbcs decoder, set final=FalseMartin v. Löwis2006-08-021-19/+11
| | | | for stream decoder, but final=True for the decode function.
* Make import/lookup of mbcs fail on non-Windows systems.Martin v. Löwis2006-06-151-4/+8
|
* Patch #1455898: Incremental mode for "mbcs" codec.Martin v. Löwis2006-06-141-3/+4
|
* errors is an attribute in the incremental decoderWalter Dörwald2006-06-131-3/+3
| | | | not an argument.
* Fix passing errors to the encoder and decoder functions.Walter Dörwald2006-06-132-4/+4
|
* Whitespace normalization.Tim Peters2006-06-0445-45/+0
|
* Patch #1359618: Speed-up charmap encoder.Martin v. Löwis2006-06-0445-11441/+180
|
* Add a BufferedIncrementalEncoder class that can be used for implementingWalter Dörwald2006-04-141-6/+72
| | | | | | | | | an incremental encoder that must retain part of the data between calls to the encode() method. Fix the incremental encoder and decoder for the IDNA encoding. This closes SF patch #1453235.
* Make error message less misleading for u"a..b".encode("idna").Walter Dörwald2006-04-141-3/+3
|
* Make raise statements PEP 8 compatible.Walter Dörwald2006-04-141-11/+11
|
* Whitespace.Walter Dörwald2006-03-2724-0/+72
|
* Patch #1443155: Add the incremental codecs support for CJK codecs.Hye-Shik Chang2006-03-2624-432/+480
| | | | (reviewed by Walter Dörwald)
* Instead of relative imports, use (implicitly) absolute ones.Guido van Rossum2006-03-151-1/+1
|
* Whitespace normalization.Tim Peters2006-03-1548-50/+4
|
* Fix typo.Walter Dörwald2006-03-151-2/+2
|
* Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclassWalter Dörwald2006-03-1590-375/+1706
| | | | | | | of tuple) that provides incremental decoders and encoders (a way to use stateful codecs without the stream API). Functions codecs.getincrementaldecoder() and codecs.getincrementalencoder() have been added.
* Use relative imports in a few places where I noticed the need.Guido van Rossum2006-03-151-1/+2
| | | | | (Ideally, all packages in Python 2.5 will use the relative import syntax for all their relative import needs.)
* Avoid forward-declaring the methods array.Martin v. Löwis2006-03-101-1/+1
| | | | Rename unicodedata.db* to unicodedata.ucd*
* Update Unicode database to Unicode 4.1.Martin v. Löwis2006-03-091-1/+2
|
* Fix the encodings package codec search function to only searchMarc-André Lemburg2006-02-191-1/+1
| | | | | | inside its own package. Fixes problem reported in patch #1433198. Add codec search function for codec test codec.
* Patch #1177307: UTF-8-Sig codec.Martin v. Löwis2006-01-081-0/+57
|
* Whitespace normalization.Tim Peters2005-12-2561-32998/+32953
|
* Cosmetic change: make all hex literals use upper case hex so that theyMarc-André Lemburg2005-10-2445-14236/+14281
| | | | | | look more like the Unicode Consortium files. Add ending new-line to all source files.
* Removed the decoding_map from the codecs where this is possible.Marc-André Lemburg2005-10-2445-25663/+22734
| | | | | Replaced the tis_620, cp1140 and koi8_u codecs with new ones based on custom mapping files.
* Replace the old EBCDIC codecs with new ones using the decoding table.Marc-André Lemburg2005-10-214-981/+3027
|
* Alias iso8859_1 to latin_1 which is the same encoding, but hasMarc-André Lemburg2005-10-211-0/+7
| | | | a much faster codec implementation.
* Add a few more Mac OS encodings. The mapping tables for these areMarc-André Lemburg2005-10-215-0/+3414
| | | | available at ftp.unicode.org.
* Replace the old charmap codecs with new ones generated from the currentMarc-André Lemburg2005-10-2149-5129/+29964
| | | | | | | mapping tables available at ftp.unicode.org. These new codecs include and use character decoding tables which speeds up decoding by a few factors.
* Bug #1245379: Add "unicode-1-1-utf-7" as an alias for "utf-7" as specifiedWalter Dörwald2005-10-091-0/+1
| | | | by RFC 1642.
* No need to import exceptions, they are builtinsNeal Norwitz2005-09-011-3/+2
|
* Make IDNA return an empty string when the input is empty. Fixes #1163178.Martin v. Löwis2005-08-251-0/+6
| | | | Will backport to 2.4.
* Reset internal buffers when seek() is called. This fixes SF bug #1156259.Walter Dörwald2005-03-141-0/+7
|
* Fix wrong variable name.Walter Dörwald2004-12-291-1/+1
|
* Rearranged mappings to value sorting order.Marc-André Lemburg2004-12-101-13/+13
|
* SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now supportWalter Dörwald2004-09-074-72/+49
| | | | | | | | | | | decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful.
* Whitespace normalization.Tim Peters2004-08-073-136/+136
|
* Added new codecs and aliases for ISO_8859-11, ISO_8859-16 andMarc-André Lemburg2004-08-054-1/+285
| | | | | | TIS-620. Closes SF bug #1001895: Adding missing ISO 8859 codecs, especially Thai.