Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Revert previous change. MAL preferred the old version. | Raymond Hettinger | 2003-12-01 | 1 | -4/+41 |
| | |||||
* | Simplifed the code. | Raymond Hettinger | 2003-12-01 | 1 | -41/+4 |
| | |||||
* | Fix typo in the comments. | Raymond Hettinger | 2003-09-24 | 1 | -1/+1 |
| | |||||
* | Added codec for bz2 compression. | Raymond Hettinger | 2003-09-23 | 2 | -0/+67 |
| | |||||
* | Support trailing dots in DNS names. Fixes #782510. Will backport to 2.3. | Martin v. Löwis | 2003-08-05 | 1 | -3/+15 |
| | |||||
* | more generic reference to python interpreter | Skip Montanaro | 2003-07-22 | 1 | -1/+1 |
| | |||||
* | Remove usage of re module from encodings package search function. | Marc-André Lemburg | 2003-05-16 | 1 | -4/+19 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2003-04-24 | 3 | -10/+9 |
| | |||||
* | Implement IDNA (Internationalized Domain Names in Applications). | Martin v. Löwis | 2003-04-18 | 2 | -0/+409 |
| | |||||
* | Revert Patch #670715: iconv support. | Martin v. Löwis | 2003-04-03 | 2 | -39/+0 |
| | |||||
* | Handle iconv initialization erorrs | Neal Norwitz | 2003-02-28 | 1 | -1/+1 |
| | |||||
* | Patch #670715: Universal Unicode Codec for POSIX iconv. | Martin v. Löwis | 2003-01-26 | 2 | -0/+40 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2002-12-24 | 1 | -1/+1 |
| | |||||
* | Add new encoding for Ukrainian Cyrillic | Neal Norwitz | 2002-10-17 | 1 | -0/+54 |
| | |||||
* | When looking for an alias, first look for the normalized name (which | Guido van Rossum | 2002-10-04 | 1 | -1/+3 |
| | | | | | still may contain dots), then if that doesn't exist look for the name with dots replaced by underscores. This is a little more forgiving. | ||||
* | Undo the removal. Guido mentioned that the encoding name is in active | Marc-André Lemburg | 2002-10-04 | 1 | -0/+1 |
| | | | | by some email headers. | ||||
* | Remove unneeded alias. | Marc-André Lemburg | 2002-10-04 | 1 | -1/+0 |
| | |||||
* | Fix doc-string. | Marc-André Lemburg | 2002-10-04 | 1 | -3/+3 |
| | |||||
* | Adapt lookup names to new more general encoding name normalization | Marc-André Lemburg | 2002-10-04 | 1 | -14/+14 |
| | | | | scheme. | ||||
* | Extending the encoding name normalization to handle more non-alphanumeric | Marc-André Lemburg | 2002-10-04 | 1 | -8/+20 |
| | | | | characters. | ||||
* | Oops, must convert hyphens to underscores in keys of aliases dict. | Guido van Rossum | 2002-09-26 | 1 | -1/+1 |
| | |||||
* | Add yet another alias for ASCII found in the field. Will backport to | Guido van Rossum | 2002-09-25 | 1 | -0/+1 |
| | | | | 2.2.2. | ||||
* | Whitespace normalization. | Tim Peters | 2002-08-23 | 1 | -1/+1 |
| | |||||
* | Patch #505705: Remove eval in pickle and cPickle. | Martin v. Löwis | 2002-08-14 | 1 | -0/+23 |
| | |||||
* | Whitespace normalization. | Tim Peters | 2002-08-08 | 73 | -5769/+5762 |
| | |||||
* | Revert #571603 since it is ok to import codecs that are not subdirectories | Martin v. Löwis | 2002-07-29 | 1 | -9/+12 |
| | | | | of encodings. Skip modules that don't have a getregentry function. | ||||
* | Patch #571603: Refer to encodings package explicitly. | Martin v. Löwis | 2002-07-28 | 1 | -1/+1 |
| | |||||
* | Palm OS encoding from Sjoerd Mullender | Marc-André Lemburg | 2002-07-12 | 1 | -0/+67 |
| | |||||
* | Fix for bug #222395: UTF-16 et al. don't handle .readline(). | Marc-André Lemburg | 2002-04-05 | 3 | -2/+9 |
| | | | | They now raise an NotImplementedError to hint to the truth ;-) | ||||
* | Corrected import behaviour for codecs which live outside the encodings | Marc-André Lemburg | 2002-02-11 | 2 | -17/+12 |
| | | | | package. | ||||
* | Add IANA character set aliases to the encodings alias dictionary | Marc-André Lemburg | 2002-02-10 | 2 | -106/+355 |
| | | | | | | | and make alias lookup lazy. Note that only those IANA character set aliases were added for which we actually have codecs in the encodings package. | ||||
* | Patch #487275: Add windows-1251 charset alias. | Martin v. Löwis | 2001-12-02 | 1 | -0/+1 |
| | |||||
* | Python part of the UTF-7 codec by Brian Quinlan. | Marc-André Lemburg | 2001-09-20 | 1 | -0/+27 |
| | |||||
* | Patch #435971: UTF-7 codec by Brian Quinlan. | Marc-André Lemburg | 2001-09-20 | 1 | -0/+4 |
| | |||||
* | Patch #462635 by Andrew Kuchling correcting bugs in the new | Marc-André Lemburg | 2001-09-20 | 5 | -11/+21 |
| | | | | | codecs -- the self argument does matter for Python functions (it does not for C functions which most other codecs use). | ||||
* | Fixed search function error reporting in the encodings package | Marc-André Lemburg | 2001-09-19 | 1 | -7/+11 |
| | | | | | | | | __init__.py module to raise errors which can be catched as LookupErrors as well as SystemErrors. Modified the error messages to include more information about the failing module. | ||||
* | Fix typo (PyChecker) | Andrew M. Kuchling | 2001-08-13 | 1 | -1/+1 |
| | |||||
* | Expose nl_langinfo through locale where available. | Martin v. Löwis | 2001-08-10 | 1 | -0/+2 |
| | |||||
* | This patch by Martin v. Loewis changes the UTF-16 codec to only | Marc-André Lemburg | 2001-06-19 | 1 | -3/+33 |
| | | | | | | | | | | | write a BOM at the start of the stream and also to only read it as BOM at the start of a stream. Subsequent reading/writing of BOMs will read/write the BOM as ZWNBSP character. This is in sync with the Unicode specifications. Note that UTF-16 files will now *have* to start with a BOM mark in order to be readable by the codec. | ||||
* | Patch #429957: Add support for cp1140, which is identical to cp037, | Martin v. Löwis | 2001-06-07 | 2 | -0/+50 |
| | | | | | with the addition of the euro character. Also added a few EDBDIC aliases. | ||||
* | Add some useful Windows encodings - patch #423221. | Mark Hammond | 2001-06-04 | 1 | -0/+5 |
| | |||||
* | Moved the encoding map building logic from the individual mapping | Marc-André Lemburg | 2001-05-16 | 53 | -159/+53 |
| | | | | | | codec files to codecs.py and added logic so that multi mappings in the decoding maps now result in mappings to None (undefined mapping) in the encoding maps. | ||||
* | Add quoted-printable codec | Guido van Rossum | 2001-05-15 | 2 | -0/+59 |
| | |||||
* | This patch changes the way the string .encode() method works slightly | Marc-André Lemburg | 2001-05-15 | 6 | -0/+407 |
| | | | | | | | | | | | | | | | | | | | | | | | | | and introduces a new method .decode(). The major change is that strg.encode() will no longer try to convert Unicode returns from the codec into a string, but instead pass along the Unicode object as-is. The same is now true for all other codec return types. The underlying C APIs were changed accordingly. Note that even though this does have the potential of breaking existing code, the chances are low since conversion from Unicode previously took place using the default encoding which is normally set to ASCII rendering this auto-conversion mechanism useless for most Unicode encodings. The good news is that you can now use .encode() and .decode() with much greater ease and that the door was opened for better accessibility of the builtin codecs. As demonstration of the new feature, the patch includes a few new codecs which allow string to string encoding and decoding (rot13, hex, zip, uu, base64). Written by Marc-Andre Lemburg. Copyright assigned to the PSF. | ||||
* | This patch changes the default behaviour of the builtin charmap | Marc-André Lemburg | 2001-01-03 | 52 | -267/+359 |
| | | | | | | | | | | | | | | | | codec to not apply Latin-1 mappings for keys which are not found in the mapping dictionaries, but instead treat them as undefined mappings. The patch was originally written by Martin v. Loewis with some additional (cosmetic) changes and an updated test script by Marc-Andre Lemburg. The standard codecs were recreated from the most current files available at the Unicode.org site using the Tools/scripts/gencodec.py tool. This patch closes the bugs #116285 and #119960. | ||||
* | Changed .getaliases() support to register the new aliases in the | Marc-André Lemburg | 2000-12-12 | 1 | -4/+12 |
| | | | | | | | | | | | encodings package aliases mapping dictionary rather than in the internal cache used by the search function. This enables aliases to take advantage of the full normalization process applied to encoding names which was previously not available. The patch restricts alias registration to new aliases. Existing aliases cannot be overridden anymore. | ||||
* | Spelling fixes supplied by Rob W. W. Hooft. All these are fixes in either | Thomas Wouters | 2000-07-16 | 1 | -2/+2 |
| | | | | | | | | | | comments, docstrings or error messages. I fixed two minor things in test_winreg.py ("didn't" -> "Didn't" and "Didnt" -> "Didn't"). There is a minor style issue involved: Guido seems to have preferred English grammar (behaviour, honour) in a couple places. This patch changes that to American, which is the more prominent style in the source. I prefer English myself, so if English is preferred, I'd be happy to supply a patch myself ;) | ||||
* | Marc-Andre Lemburg <mal@lemburg.com>: | Marc-André Lemburg | 2000-06-13 | 1 | -2/+2 |
| | | | | | Removed import of string module -- use string methods directly. Thanks to Finn Bock. | ||||
* | Marc-Andre Lemburg <mal@lemburg.com>: | Marc-André Lemburg | 2000-06-07 | 1 | -0/+22 |
| | | | | | Added some more codec aliases. Some of them are needed by the new locale.py encoding support. | ||||
* | New codec which always raises an exception when used. This | Marc-André Lemburg | 2000-06-07 | 1 | -0/+34 |
| | | | | | codec can be used to effectively switch off string coercion to Unicode. |