summaryrefslogtreecommitdiffstats
path: root/Modules/_codecsmodule.c
Commit message (Collapse)AuthorAgeFilesLines
* The incremental decoder for utf-7 must preserve its state between calls.Amaury Forgeot d'Arc2007-11-201-6/+13
| | | | | | | Solves issue1460. Might not be a backport candidate: a new API function was added, and some code may rely on details in utf-7.py.
* Backport r57105 and r57145 from the py3k branch: UTF-32 codecs.Walter Dörwald2007-08-171-0/+204
|
* Document that codecs.lookup() returns a CodecInfo object.Walter Dörwald2007-07-191-4/+5
| | | | (fixes SF bug #1754453).
* Klocwork made another run and found a bunch more problems.Neal Norwitz2006-08-121-1/+2
| | | | | | | This is the first batch of fixes that should be easy to verify based on context. This fixes problem numbers: 220 (ast), 323-324 (symtable), 321-322 (structseq), 215 (array), 210 (hotshot), 182 (codecs), 209 (etree).
* Correction of patch #1455898: In the mbcs decoder, set final=FalseMartin v. Löwis2006-08-021-1/+1
| | | | for stream decoder, but final=True for the decode function.
* Patch #1455898: Incremental mode for "mbcs" codec.Martin v. Löwis2006-06-141-5/+10
|
* Patch #1359618: Speed-up charmap encoder.Martin v. Löwis2006-06-041-0/+10
|
* Make use of METH_O and METH_NOARGS where possible.Georg Brandl2006-05-291-51/+15
| | | | Use Py_UnpackTuple instead of PyArg_ParseTuple where possible.
* Explain why 'consumed' is initialized.Martin v. Löwis2006-05-271-8/+4
|
* Fix more ssize_t issues.Martin v. Löwis2006-04-221-3/+3
|
* Merge ssize_t branch.Martin v. Löwis2006-02-151-23/+48
|
* Whitespace cleanup.Walter Dörwald2005-11-021-27/+27
|
* SF bug #1251300: On UCS-4 builds the "unicode-internal" codec will now complainWalter Dörwald2005-08-301-2/+2
| | | | | about illegal code points. The codec now supports PEP 293 style error handlers. (This is a variant of the Nik Haldimann's patch that detects truncated data)
* Build with --disable-unicode again. Fixes #1158607.Martin v. Löwis2005-03-081-0/+14
| | | | Will backport to 2.4.
* SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now supportWalter Dörwald2004-09-071-24/+57
| | | | | | | | | | | decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful.
* Change some declarations from ``char *`` to ``const char *``. Also addedBrett Cannon2004-07-101-6/+8
| | | | | docstrings for decode and encode; accidentally were left out of the PyMethodDev table.
* Add generic codecs.encode() and .decode() APIs that don't imposeMarc-André Lemburg2004-07-101-4/+72
| | | | any restriction on the return type (like unicode.encode() et al. do).
* Fix for [ 543344 ] Interpreter crashes when recoding; suggestedMarc-André Lemburg2003-02-041-1/+3
| | | | | | by Michael Stone (mbrierst). Python 2.1.4, 2.2.2 candidate.
* Add docstrings to register, lookup, register_errorWalter Dörwald2002-10-311-4/+36
| | | | and lookup_error. This closes SF patch #630622.
* PEP 293 implemention (from SF patch http://www.python.org/sf/432401)Walter Dörwald2002-09-021-0/+28
|
* Patch #505705: Remove eval in pickle and cPickle.Martin v. Löwis2002-08-141-1/+45
|
* Excise DL_EXPORT/DL_IMPORT from Modules/*. Required adding a prototypeMark Hammond2002-08-021-1/+1
| | | | | | for Py_Main(). Thanks to Kalle Svensson and Skip Montanaro for the patches.
* Patch #569753: Remove support for WIN16.Martin v. Löwis2002-06-301-5/+5
| | | | Rename all occurrences of MS_WIN32 to MS_WINDOWS.
* Patch #477750: Use METH_ constants in Modules.Martin v. Löwis2002-01-171-29/+29
|
* Patch #435971: UTF-7 codec by Brian Quinlan.Marc-André Lemburg2001-09-201-0/+42
|
* Patch #445762: Support --disable-unicodeMartin v. Löwis2001-08-171-0/+4
| | | | | | | | - Do not compile unicodeobject, unicodectype, and unicodedata if Unicode is disabled - check for Py_USING_UNICODE in all places that use Unicode functions - disables unicode literals, and the builtin functions - add the types.StringTypes list - remove Unicode literals from most tests.
* experimental UCS-4 support: don't assume that MS_WIN32 impliesFredrik Lundh2001-06-261-3/+3
| | | | HAVE_USABLE_WCHAR_T
* Fix for bug [ #433047 ] missing args to PyArg_ParseTupleMarc-André Lemburg2001-06-171-2/+2
|
* Added a true unicode_internal_encode function and fixed theMarc-André Lemburg2000-09-211-9/+43
| | | | | unicode_internal_decode function to support Unicode objects directly rather than by generating a copy of the object.
* Changing the CNRI copyright notice according to CNRI's instructions.Guido van Rossum2000-08-031-1/+1
| | | | | This is a notice without a date, which apparently is not a claim to copyright but only advice to the reader. IANAL. :-)
* Bunch of minor ANSIfications: 'void initfunc()' -> 'void initfunc(void)',Thomas Wouters2000-07-211-1/+1
| | | | | | | | | | | | | | | | | | and a couple of functions that were missed in the previous batches. Not terribly tested, but very carefully scrutinized, three times. All these were found by the little findkrc.py that I posted to python-dev, which means there might be more lurking. Cases such as this: long func(a, b) long a; long b; /* flagword */ { and other cases where the last ; in the argument list isn't followed by a newline and an opening curly bracket. Regexps to catch all are welcome, of course ;)
* Modified the encode interfaces of all builtin codecs to acceptMarc-André Lemburg2000-07-051-50/+105
| | | | | any object which can be converted to Unicode by means of PyUnicode_FromObject().
* Marc-Andre Lemburg:Guido van Rossum2000-03-281-0/+46
| | | | | | | | | | | | | | | The attached patch set includes a workaround to get Python with Unicode compile on BSDI 4.x (courtesy Thomas Wouters; the cause is a bug in the BSDI wchar.h header file) and Python interfaces for the MBCS codec donated by Mark Hammond. Also included are some minor corrections w/r to the docs of the new "es" and "es#" parser markers (use PyMem_Free() instead of free(); thanks to Mark Hammond for finding these). The unicodedata tests are now in a separate file (test_unicodedata.py) to avoid problems if the module cannot be found.
* Internal module _codecs -- Provides access to the codec registry andGuido van Rossum2000-03-101-0/+529
the builtin codecs. Written by Marc-Andre Lemburg.