| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
|
|
|
|
| |
(this should fix Sjoerd's xmllib problem)
-- added skip field to INFO header
-- changed compiler to generate charset INFO header
-- changed trace messages to support post-mortem analysis
|
|
|
|
|
| |
-- added REPEAT_ONE operator
-- added ANY_ALL operator (used to represent "(?s).")
|
|
|
|
|
|
|
| |
-- fixed literal check in branch operator
(this broke test_tokenize, as reported by Mark Favas)
-- added REPEAT_ONE operator (still not enabled, though)
-- added some debugging stuff (maxlevel)
|
|
|
|
|
|
|
|
|
| |
-- reverted REPEAT operator to use "repeat context" strategy
(from 0.8.X), but done right this time.
-- got rid of backtracking stack; use nested SRE_MATCH calls
instead (should probably put it back again in 0.9.9 ;-)
-- properly reset state in scanner mode
-- don't use aggressive inlining by default
|
|
|
|
|
|
|
|
|
|
|
| |
+ added "regs" attribute
+ fixed "pos" and "endpos" attributes
+ reset "lastindex" and "lastgroup" in scanner methods
+ removed (?P#id) syntax; the "lastindex" and "lastgroup"
attributes are now always set
+ removed string module dependencies in sre_parse
+ better debugging support in sre_parse
+ various tweaks to build under 1.5.2
|
|
|
|
|
|
|
|
| |
- reorganized some code to get rid of -Wall and -W4
warnings
- fixed default argument handling for sub/subn/split
methods (reported by Peter Schneider-Kamp).
|
|
|
|
| |
- changed "group" operator to "groupref"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
the pattern must have a fixed width.
- got rid of array-module dependencies; the match pro-
gram is now stored inside the pattern object, rather
than in an extra string buffer.
- cleaned up a various of potential leaks, api abuses,
and other minors in the engine module.
- use mal's new isalnum macro, rather than my own work-
around.
- untabified test_sre.py. seems like I removed a couple
of trailing spaces in the process...
|
|
|
|
|
|
|
|
| |
"lastgroup" is the name of the last matched capturing group,
"lastindex" is the index of the same group. if no group was
matched, both attributes are set to None.
the (?P#) feature will be removed in the next relase.
|
|
|
|
|
|
|
|
|
|
| |
used by the code generator)
- changed max repeat value in engine (to match earlier array fix)
- added experimental "which part matched?" mechanism to sre; see
http://hem.passagen.se/eff/2000_07_01_bot-archive.htm#416954
or python-dev for details.
|
|
|
|
|
|
|
|
|
| |
speedup for some tests, including the python tokenizer.
-- added support for an optional charset anchor to the engine
(currently unused by the code generator).
-- removed workaround for array module bug.
|
|
|
|
|
|
|
|
| |
-- changed 1.6 to 2.0 in the file headers
-- fixed ISALNUM macro for the unicode locale. this
solution isn't perfect, but the best I can do with
Python's current unicode database.
|
|
|
|
| |
- fixed parser flag propagation (of all stupid bugs...)
|
|
|
|
| |
- removed evil tabs from sre_parse and sre_compile
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
-- added pickling support (only works if sre is imported)
-- fixed wordsize problems in engine
(instead of casting literals down to the character size,
cast characters up to the literal size (same as the code
word size). this prevents false hits when you're matching
a unicode pattern against an 8-bit string. (unfortunately,
this broke another test, but I think the test should be
changed in this case; more on that on python-dev)
-- added sre.purge function
(unofficial, clears the cache)
|
|
|
|
| |
- untabified sre_constants.py
|
| |
|
|
|
|
|
|
| |
- fixed compiler problems when using locale/unicode flags
- fixed group/octal code parsing in sub/subn templates
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
group reset problem. in the meantime, I added some
optimizations:
- added "inline" directive to LOCAL
(this assumes that AC_C_INLINE does what it's
supposed to do). to compile SRE on a non-unix
platform that doesn't support inline, you have
to add a "#define inline" somewhere...
- added code to generate a SRE_OP_INFO primitive
- added code to do fast prefix search
(enabled by the USE_FAST_SEARCH define; default
is on, in this release)
|
|
|
|
|
|
|
|
|
| |
(those semantics are weird...)
- got rid of $Id$'s (for the moment, at least). in other
words, there should be no more "empty" checkins.
- internal: some minor cleanups.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
(test_sre still complains about split, but that's caused by
the group reset bug, not split itself)
- added more mark slots
(should be dynamically allocated, but 100 is better than 32.
and checking for the upper limit is better than overwriting
the memory ;-)
- internal: renamed the cursor helper class
- internal: removed some bloat from sre_compile
|
|
|
|
|
| |
- removed some hacks that worked around 1.6 alpha bugs
- removed bogus test code from sre_parse
|
| |
|
| |
|
|
|
|
|
| |
This one should work with unicode expressions, and compile
a bit more silently.
|
|
|
|
| |
untabifies the files. No actual code changes were made.
|
|
NOTE: THIS IS VERY ROUGH ALPHA CODE!
|