| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
| |
and NEWS. Bugfix candidate? That's a dilemma for Anthony <wink>: /F
did fix a longstanding bug here, but the fix can cause code to raise an
exception that previously worked by accident.
|
|
|
|
| |
the error it really is (and always has been)
|
|
|
|
|
|
| |
\g<x> group reference followed by a character escape
(also restructured a few things on the way to fixing #449000)
|
| |
|
|
|
|
|
|
| |
- take locale into account for word boundary anchors (#410271)
- restored 2.0's *? behaviour (#233283, #408936 and others)
- speed up re.sub/re.subn
|
| |
|
|
|
|
|
| |
- removed __all__ cruft from internal modules (sorry, skip)
- don't assume ASCII for string escapes (sorry, per)
|
|
|
|
|
|
| |
also modified check_all function to suppress all warnings since they aren't
relevant to what this test is doing (allows quiet checking of regsub, for
instance)
|
| |
|
|
|
|
| |
compatibility patches.
|
| |
|
|
|
|
|
|
|
|
|
|
| |
-- added some more docstrings
-- fixed typo in scanner class (#125531)
-- the multiline flag (?m) should't affect the \Z operator (#127259)
-- fixed non-greedy backtracking bug (#123769, #127259)
-- added sre.DEBUG flag (currently dumps the parsed pattern structure)
-- fixed a couple of glitches in groupdict (the #126587 memory leak
had already been fixed by AMK)
|
|
|
|
|
|
|
|
| |
-- fixed negative lookbehind to work correctly at the beginning
of the target string (bug #117242)
-- improved syntax check; you can no longer refer to a group
inside itself (bug #110866)
|
| |
|
|
|
|
|
| |
character class. Fix provided by Andrew Kuchling. Closes bug
#116251.
|
|
|
|
| |
first scan. Closes bug #115040.
|
| |
|
|
|
|
|
| |
-- improved the SRE test harness: don't use asserts, test a few more
things (including more boundary conditions)
|
|
|
|
| |
"sre_parse.py missing '7' in DIGITS"
|
|
|
|
| |
I fixed the a bug in the regression test harness...)
|
|
|
|
|
|
|
| |
(this should fix Sjoerd's xmllib problem)
-- added skip field to INFO header
-- changed compiler to generate charset INFO header
-- changed trace messages to support post-mortem analysis
|
|
|
|
|
| |
-- added REPEAT_ONE operator
-- added ANY_ALL operator (used to represent "(?s).")
|
|
|
|
|
|
|
| |
-- fixed literal check in branch operator
(this broke test_tokenize, as reported by Mark Favas)
-- added REPEAT_ONE operator (still not enabled, though)
-- added some debugging stuff (maxlevel)
|
|
|
|
|
|
|
|
|
| |
-- reverted REPEAT operator to use "repeat context" strategy
(from 0.8.X), but done right this time.
-- got rid of backtracking stack; use nested SRE_MATCH calls
instead (should probably put it back again in 0.9.9 ;-)
-- properly reset state in scanner mode
-- don't use aggressive inlining by default
|
|
|
|
|
|
|
|
|
|
|
| |
+ added "regs" attribute
+ fixed "pos" and "endpos" attributes
+ reset "lastindex" and "lastgroup" in scanner methods
+ removed (?P#id) syntax; the "lastindex" and "lastgroup"
attributes are now always set
+ removed string module dependencies in sre_parse
+ better debugging support in sre_parse
+ various tweaks to build under 1.5.2
|
|
|
|
| |
- changed "group" operator to "groupref"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
the pattern must have a fixed width.
- got rid of array-module dependencies; the match pro-
gram is now stored inside the pattern object, rather
than in an extra string buffer.
- cleaned up a various of potential leaks, api abuses,
and other minors in the engine module.
- use mal's new isalnum macro, rather than my own work-
around.
- untabified test_sre.py. seems like I removed a couple
of trailing spaces in the process...
|
|
|
|
|
|
|
|
| |
"lastgroup" is the name of the last matched capturing group,
"lastindex" is the index of the same group. if no group was
matched, both attributes are set to None.
the (?P#) feature will be removed in the next relase.
|
|
|
|
|
|
|
|
|
|
| |
used by the code generator)
- changed max repeat value in engine (to match earlier array fix)
- added experimental "which part matched?" mechanism to sre; see
http://hem.passagen.se/eff/2000_07_01_bot-archive.htm#416954
or python-dev for details.
|
|
|
|
|
|
|
|
|
| |
speedup for some tests, including the python tokenizer.
-- added support for an optional charset anchor to the engine
(currently unused by the code generator).
-- removed workaround for array module bug.
|
|
|
|
| |
- minor comment touchups in the C module
|
|
|
|
|
|
|
|
| |
-- changed 1.6 to 2.0 in the file headers
-- fixed ISALNUM macro for the unicode locale. this
solution isn't perfect, but the best I can do with
Python's current unicode database.
|
|
|
|
| |
- fixed parser flag propagation (of all stupid bugs...)
|
|
|
|
| |
- removed evil tabs from sre_parse and sre_compile
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
-- added pickling support (only works if sre is imported)
-- fixed wordsize problems in engine
(instead of casting literals down to the character size,
cast characters up to the literal size (same as the code
word size). this prevents false hits when you're matching
a unicode pattern against an 8-bit string. (unfortunately,
this broke another test, but I think the test should be
changed in this case; more on that on python-dev)
-- added sre.purge function
(unofficial, clears the cache)
|
|
|
|
| |
- untabified sre_constants.py
|
|
|
|
| |
(closes #3 and #7 from the status report)
|
| |
|
|
|
|
|
|
| |
- fixed compiler problems when using locale/unicode flags
- fixed group/octal code parsing in sub/subn templates
|
|
|
|
|
|
|
|
|
| |
(those semantics are weird...)
- got rid of $Id$'s (for the moment, at least). in other
words, there should be no more "empty" checkins.
- internal: some minor cleanups.
|
|
|
|
| |
(closes the "SRE: symbolic reference" bug)
|
|
|
|
|
|
|
|
|
|
|
| |
tests in sre_patch back to previous version
- fixed return value from findall
- renamed a bunch of functions inside _sre (way too
many leading underscores...)
</F>
|
|
|
|
|
| |
- removed some hacks that worked around 1.6 alpha bugs
- removed bogus test code from sre_parse
|
| |
|
|
|
|
| |
latest public snapshot.""
|
|
|
|
|
| |
This one should work with unicode expressions, and compile
a bit more silently.
|
|
|
|
| |
untabifies the files. No actual code changes were made.
|
|
NOTE: THIS IS VERY ROUGH ALPHA CODE!
|