Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Patch #1177307: UTF-8-Sig codec. | Martin v. Löwis | 2006-01-08 | 1 | -0/+57 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2005-12-25 | 61 | -32998/+32953 |
| | |||||
* | Cosmetic change: make all hex literals use upper case hex so that they | Marc-André Lemburg | 2005-10-24 | 45 | -14236/+14281 |
| | | | | | | look more like the Unicode Consortium files. Add ending new-line to all source files. | ||||
* | Removed the decoding_map from the codecs where this is possible. | Marc-André Lemburg | 2005-10-24 | 45 | -25663/+22734 |
| | | | | | Replaced the tis_620, cp1140 and koi8_u codecs with new ones based on custom mapping files. | ||||
* | Replace the old EBCDIC codecs with new ones using the decoding table. | Marc-André Lemburg | 2005-10-21 | 4 | -981/+3027 |
| | |||||
* | Alias iso8859_1 to latin_1 which is the same encoding, but has | Marc-André Lemburg | 2005-10-21 | 1 | -0/+7 |
| | | | | a much faster codec implementation. | ||||
* | Add a few more Mac OS encodings. The mapping tables for these are | Marc-André Lemburg | 2005-10-21 | 5 | -0/+3414 |
| | | | | available at ftp.unicode.org. | ||||
* | Replace the old charmap codecs with new ones generated from the current | Marc-André Lemburg | 2005-10-21 | 49 | -5129/+29964 |
| | | | | | | | mapping tables available at ftp.unicode.org. These new codecs include and use character decoding tables which speeds up decoding by a few factors. | ||||
* | Bug #1245379: Add "unicode-1-1-utf-7" as an alias for "utf-7" as specified | Walter Dörwald | 2005-10-09 | 1 | -0/+1 |
| | | | | by RFC 1642. | ||||
* | No need to import exceptions, they are builtins | Neal Norwitz | 2005-09-01 | 1 | -3/+2 |
| | |||||
* | Make IDNA return an empty string when the input is empty. Fixes #1163178. | Martin v. Löwis | 2005-08-25 | 1 | -0/+6 |
| | | | | Will backport to 2.4. | ||||
* | Reset internal buffers when seek() is called. This fixes SF bug #1156259. | Walter Dörwald | 2005-03-14 | 1 | -0/+7 |
| | |||||
* | Fix wrong variable name. | Walter Dörwald | 2004-12-29 | 1 | -1/+1 |
| | |||||
* | Rearranged mappings to value sorting order. | Marc-André Lemburg | 2004-12-10 | 1 | -13/+13 |
| | |||||
* | SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support | Walter Dörwald | 2004-09-07 | 4 | -72/+49 |
| | | | | | | | | | | | decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful. | ||||
* | Whitespace normalization. | Tim Peters | 2004-08-07 | 3 | -136/+136 |
| | |||||
* | Added new codecs and aliases for ISO_8859-11, ISO_8859-16 and | Marc-André Lemburg | 2004-08-05 | 4 | -1/+285 |
| | | | | | | TIS-620. Closes SF bug #1001895: Adding missing ISO 8859 codecs, especially Thai. | ||||
* | Whitespace normalization. | Tim Peters | 2004-07-31 | 1 | -95/+95 |
| | |||||
* | New codec: [ 996067 ] hp-roman8 codec | Marc-André Lemburg | 2004-07-28 | 1 | -0/+139 |
| | |||||
* | Added new codec hp-roman8 submitted as patch [ 996067 ] hp-roman8 codec. | Marc-André Lemburg | 2004-07-28 | 1 | -0/+7 |
| | |||||
* | Bring CJKCodecs 1.1 into trunk. This completely reorganizes source | Hye-Shik Chang | 2004-07-18 | 25 | -87/+260 |
| | | | | | | and installed layouts to make maintenance simple and easy. And it also adds four new codecs; big5hkscs, euc-jis-2004, shift-jis-2004 and iso2022-jp-2004. | ||||
* | Whitespace normalization. | Tim Peters | 2004-07-07 | 21 | -138/+118 |
| | |||||
* | Convert input to a string object. Fixes #909230. | Martin v. Löwis | 2004-03-23 | 1 | -0/+1 |
| | | | | Backported 2.3. | ||||
* | Add a new unicode codec: ptcp154 (Kazakh) | Hye-Shik Chang | 2004-03-19 | 2 | -0/+168 |
| | |||||
* | Fix wrong character mapping in koi8_u: SF bug #902501. | Marc-André Lemburg | 2004-02-23 | 1 | -1/+1 |
| | |||||
* | Let the default encodings search function lookup aliases before trying the ↵ | Marc-André Lemburg | 2004-01-20 | 1 | -18/+26 |
| | | | | codec import. This allows applications to install codecs which override (non-special-cased) builtin codecs. | ||||
* | Add some more code page aliases needed for completeness. | Marc-André Lemburg | 2004-01-20 | 1 | -0/+16 |
| | |||||
* | Fix a typo: s/iso_3022/iso2022/ | Hye-Shik Chang | 2004-01-20 | 1 | -1/+1 |
| | |||||
* | Add CJK codecs support as discussed on python-dev. (SF #873597) | Hye-Shik Chang | 2004-01-17 | 21 | -9/+780 |
| | | | | | Several style fixes are suggested by Martin v. Loewis and Marc-Andre Lemburg. Thanks! | ||||
* | Revert previous change. MAL preferred the old version. | Raymond Hettinger | 2003-12-01 | 1 | -4/+41 |
| | |||||
* | Simplifed the code. | Raymond Hettinger | 2003-12-01 | 1 | -41/+4 |
| | |||||
* | Fix typo in the comments. | Raymond Hettinger | 2003-09-24 | 1 | -1/+1 |
| | |||||
* | Added codec for bz2 compression. | Raymond Hettinger | 2003-09-23 | 2 | -0/+67 |
| | |||||
* | Support trailing dots in DNS names. Fixes #782510. Will backport to 2.3. | Martin v. Löwis | 2003-08-05 | 1 | -3/+15 |
| | |||||
* | more generic reference to python interpreter | Skip Montanaro | 2003-07-22 | 1 | -1/+1 |
| | |||||
* | Remove usage of re module from encodings package search function. | Marc-André Lemburg | 2003-05-16 | 1 | -4/+19 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2003-04-24 | 3 | -10/+9 |
| | |||||
* | Implement IDNA (Internationalized Domain Names in Applications). | Martin v. Löwis | 2003-04-18 | 2 | -0/+409 |
| | |||||
* | Revert Patch #670715: iconv support. | Martin v. Löwis | 2003-04-03 | 2 | -39/+0 |
| | |||||
* | Handle iconv initialization erorrs | Neal Norwitz | 2003-02-28 | 1 | -1/+1 |
| | |||||
* | Patch #670715: Universal Unicode Codec for POSIX iconv. | Martin v. Löwis | 2003-01-26 | 2 | -0/+40 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2002-12-24 | 1 | -1/+1 |
| | |||||
* | Add new encoding for Ukrainian Cyrillic | Neal Norwitz | 2002-10-17 | 1 | -0/+54 |
| | |||||
* | When looking for an alias, first look for the normalized name (which | Guido van Rossum | 2002-10-04 | 1 | -1/+3 |
| | | | | | still may contain dots), then if that doesn't exist look for the name with dots replaced by underscores. This is a little more forgiving. | ||||
* | Undo the removal. Guido mentioned that the encoding name is in active | Marc-André Lemburg | 2002-10-04 | 1 | -0/+1 |
| | | | | by some email headers. | ||||
* | Remove unneeded alias. | Marc-André Lemburg | 2002-10-04 | 1 | -1/+0 |
| | |||||
* | Fix doc-string. | Marc-André Lemburg | 2002-10-04 | 1 | -3/+3 |
| | |||||
* | Adapt lookup names to new more general encoding name normalization | Marc-André Lemburg | 2002-10-04 | 1 | -14/+14 |
| | | | | scheme. | ||||
* | Extending the encoding name normalization to handle more non-alphanumeric | Marc-André Lemburg | 2002-10-04 | 1 | -8/+20 |
| | | | | characters. | ||||
* | Oops, must convert hyphens to underscores in keys of aliases dict. | Guido van Rossum | 2002-09-26 | 1 | -1/+1 |
| |