| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
| |
only available if asserts are left in the code, in cases where they
can't be triggered from Python code.
|
| |
|
|
|
| |
Fix gb18030 codec's bug that doesn't map two-byte characters on
GB18030 extension in encoding. (bug reported by Bjorn Stabell)
|
| |
|
|
|
|
| |
Bug #1728403: Fix a bug that CJKCodecs StreamReader hangs when it
reads a file that ends with incomplete sequence and sizehint argument
for .read() is specified.
|
| |
|
|
|
| |
Bug #1572832: fix a bug in ISO-2022 codecs which may cause segfault
when encoding non-BMP unicode characters. (Submitted by Ray Chason)
|
| |
|
|
|
|
|
|
|
| |
Fixed a few bugs on cjkcodecs:
- gbk and gb18030 codec now handle U+30FB KATAKANA MIDDLE DOT correctly.
- iso2022_jp_2 codec now encodes into G0 for KS X 1001, GB2312
codepoints to conform the standard.
- iso2022_jp_3 and iso2022_jp_2004 codec can encode JIS X 0213:2
codepoints now.
|
| |
|
|
| |
decoder. (found by Neal Norwitz)
|
| |
|
|
| |
Use Py_UnpackTuple instead of PyArg_ParseTuple where possible.
|
| | |
|
| |
|
|
|
|
| |
Add empty __init__ methods for stateful multibytecodec instances.
This resolves a problem found by Thomas Wouters:
http://mail.python.org/pipermail/python-dev/2006-April/064051.html
|
| | |
|
| |
|
|
| |
changes.
|
| | |
|
| | |
|
| |
|
|
| |
(reviewed by Walter Dörwald)
|
| |
|
|
| |
SS2 (single-shift 2) escape sequences correctly.
|
| | |
|
| |
|
|
| |
- Do some minor code clean-ups.
|
| | |
|
| | |
|
| |
|
|
| |
build problem on AIX.
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
In C++, it's an error to pass a string literal to a char* function
without a const_cast(). Rather than require every C++ extension
module to put a cast around string literals, fix the API to state the
const-ness.
I focused on parts of the API where people usually pass literals:
PyArg_ParseTuple() and friends, Py_BuildValue(), PyMethodDef, the type
slots, etc. Predictably, there were a large set of functions that
needed to be fixed as a result of these changes. The most pervasive
change was to make the keyword args list passed to
PyArg_ParseTupleAndKewords() to be a const char *kwlist[].
One cast was required as a result of the changes: A type object
mallocs the memory for its tp_doc slot and later frees it.
PyTypeObject says that tp_doc is const char *; but if the type was
created by type_new(), we know it is safe to cast to char *.
|
| |
|
|
|
| |
from U+FFFD to U+FFFE which is considered more appropriate. (from
MAL's comment)
|
| |
|
|
|
|
|
| |
Fix segfault. I tried to write a test, but it wouldn't crash
when running regrtest. This really should have some sort of test.
Should definitely be backported.
|
| | |
|
| |
|
|
|
| |
found on legacy C compilers of HP-UX, IRIX and Tru64. (Reported
by roadkill, Richard Townsend, Maik Hertha and Minsik Kim)
|
| |
|
|
|
|
| |
aren't intended to be part of Python distributiuon. This was
accidently imported on mass converting from standalone version of
CJKCodecs.
|
| |
|
|
| |
And unifdef(1) compatibility blocks.
|
| |
|
|
| |
yet.
|
| |
|
|
| |
and width clearly don't need to be signed.
|
| | |
|
| |
|
|
|
|
|
|
| |
to extern. It's not legal C to say
static whatever[];
because the size isn't given. Presumably this is a gcc extension.
|
| |
|
|
|
|
| |
and installed layouts to make maintenance simple and easy. And it
also adds four new codecs; big5hkscs, euc-jis-2004, shift-jis-2004
and iso2022-jp-2004.
|
| | |
|
| | |
|
|
|
Several style fixes are suggested by Martin v. Loewis and
Marc-Andre Lemburg. Thanks!
|