summaryrefslogtreecommitdiffstats
path: root/Lib/encodings
Commit message (Collapse)AuthorAgeFilesLines
* Convert input to a string object. Fixes #909230.Martin v. Löwis2004-03-231-0/+1
| | | | Backported 2.3.
* Add a new unicode codec: ptcp154 (Kazakh)Hye-Shik Chang2004-03-192-0/+168
|
* Fix wrong character mapping in koi8_u: SF bug #902501.Marc-André Lemburg2004-02-231-1/+1
|
* Let the default encodings search function lookup aliases before trying the ↵Marc-André Lemburg2004-01-201-18/+26
| | | | codec import. This allows applications to install codecs which override (non-special-cased) builtin codecs.
* Add some more code page aliases needed for completeness.Marc-André Lemburg2004-01-201-0/+16
|
* Fix a typo: s/iso_3022/iso2022/Hye-Shik Chang2004-01-201-1/+1
|
* Add CJK codecs support as discussed on python-dev. (SF #873597)Hye-Shik Chang2004-01-1721-9/+780
| | | | | Several style fixes are suggested by Martin v. Loewis and Marc-Andre Lemburg. Thanks!
* Revert previous change. MAL preferred the old version.Raymond Hettinger2003-12-011-4/+41
|
* Simplifed the code.Raymond Hettinger2003-12-011-41/+4
|
* Fix typo in the comments.Raymond Hettinger2003-09-241-1/+1
|
* Added codec for bz2 compression.Raymond Hettinger2003-09-232-0/+67
|
* Support trailing dots in DNS names. Fixes #782510. Will backport to 2.3.Martin v. Löwis2003-08-051-3/+15
|
* more generic reference to python interpreterSkip Montanaro2003-07-221-1/+1
|
* Remove usage of re module from encodings package search function.Marc-André Lemburg2003-05-161-4/+19
|
* Whitespace normalization.Tim Peters2003-04-243-10/+9
|
* Implement IDNA (Internationalized Domain Names in Applications).Martin v. Löwis2003-04-182-0/+409
|
* Revert Patch #670715: iconv support.Martin v. Löwis2003-04-032-39/+0
|
* Handle iconv initialization erorrsNeal Norwitz2003-02-281-1/+1
|
* Patch #670715: Universal Unicode Codec for POSIX iconv.Martin v. Löwis2003-01-262-0/+40
|
* Whitespace normalization.Tim Peters2002-12-241-1/+1
|
* Add new encoding for Ukrainian CyrillicNeal Norwitz2002-10-171-0/+54
|
* When looking for an alias, first look for the normalized name (whichGuido van Rossum2002-10-041-1/+3
| | | | | still may contain dots), then if that doesn't exist look for the name with dots replaced by underscores. This is a little more forgiving.
* Undo the removal. Guido mentioned that the encoding name is in activeMarc-André Lemburg2002-10-041-0/+1
| | | | by some email headers.
* Remove unneeded alias.Marc-André Lemburg2002-10-041-1/+0
|
* Fix doc-string.Marc-André Lemburg2002-10-041-3/+3
|
* Adapt lookup names to new more general encoding name normalizationMarc-André Lemburg2002-10-041-14/+14
| | | | scheme.
* Extending the encoding name normalization to handle more non-alphanumericMarc-André Lemburg2002-10-041-8/+20
| | | | characters.
* Oops, must convert hyphens to underscores in keys of aliases dict.Guido van Rossum2002-09-261-1/+1
|
* Add yet another alias for ASCII found in the field. Will backport toGuido van Rossum2002-09-251-0/+1
| | | | 2.2.2.
* Whitespace normalization.Tim Peters2002-08-231-1/+1
|
* Patch #505705: Remove eval in pickle and cPickle.Martin v. Löwis2002-08-141-0/+23
|
* Whitespace normalization.Tim Peters2002-08-0873-5769/+5762
|
* Revert #571603 since it is ok to import codecs that are not subdirectoriesMartin v. Löwis2002-07-291-9/+12
| | | | of encodings. Skip modules that don't have a getregentry function.
* Patch #571603: Refer to encodings package explicitly.Martin v. Löwis2002-07-281-1/+1
|
* Palm OS encoding from Sjoerd MullenderMarc-André Lemburg2002-07-121-0/+67
|
* Fix for bug #222395: UTF-16 et al. don't handle .readline().Marc-André Lemburg2002-04-053-2/+9
| | | | They now raise an NotImplementedError to hint to the truth ;-)
* Corrected import behaviour for codecs which live outside the encodingsMarc-André Lemburg2002-02-112-17/+12
| | | | package.
* Add IANA character set aliases to the encodings alias dictionaryMarc-André Lemburg2002-02-102-106/+355
| | | | | | | and make alias lookup lazy. Note that only those IANA character set aliases were added for which we actually have codecs in the encodings package.
* Patch #487275: Add windows-1251 charset alias.Martin v. Löwis2001-12-021-0/+1
|
* Python part of the UTF-7 codec by Brian Quinlan.Marc-André Lemburg2001-09-201-0/+27
|
* Patch #435971: UTF-7 codec by Brian Quinlan.Marc-André Lemburg2001-09-201-0/+4
|
* Patch #462635 by Andrew Kuchling correcting bugs in the newMarc-André Lemburg2001-09-205-11/+21
| | | | | codecs -- the self argument does matter for Python functions (it does not for C functions which most other codecs use).
* Fixed search function error reporting in the encodings packageMarc-André Lemburg2001-09-191-7/+11
| | | | | | | | __init__.py module to raise errors which can be catched as LookupErrors as well as SystemErrors. Modified the error messages to include more information about the failing module.
* Fix typo (PyChecker)Andrew M. Kuchling2001-08-131-1/+1
|
* Expose nl_langinfo through locale where available.Martin v. Löwis2001-08-101-0/+2
|
* This patch by Martin v. Loewis changes the UTF-16 codec to onlyMarc-André Lemburg2001-06-191-3/+33
| | | | | | | | | | | write a BOM at the start of the stream and also to only read it as BOM at the start of a stream. Subsequent reading/writing of BOMs will read/write the BOM as ZWNBSP character. This is in sync with the Unicode specifications. Note that UTF-16 files will now *have* to start with a BOM mark in order to be readable by the codec.
* Patch #429957: Add support for cp1140, which is identical to cp037,Martin v. Löwis2001-06-072-0/+50
| | | | | with the addition of the euro character. Also added a few EDBDIC aliases.
* Add some useful Windows encodings - patch #423221.Mark Hammond2001-06-041-0/+5
|
* Moved the encoding map building logic from the individual mappingMarc-André Lemburg2001-05-1653-159/+53
| | | | | | codec files to codecs.py and added logic so that multi mappings in the decoding maps now result in mappings to None (undefined mapping) in the encoding maps.
* Add quoted-printable codecGuido van Rossum2001-05-152-0/+59
|