summaryrefslogtreecommitdiffstats
path: root/Doc/lib/libre.tex
diff options
context:
space:
mode:
authorSkip Montanaro <skip@pobox.com>2002-03-04 23:08:28 (GMT)
committerSkip Montanaro <skip@pobox.com>2002-03-04 23:08:28 (GMT)
commita8e1d81bdc9cb45cc071eff2991a1e96d4b22e7c (patch)
tree663009a3a16231ede237090b821708e6a5104704 /Doc/lib/libre.tex
parentdd989e1ce795dac837b51e40883dbd6f38dd9dd8 (diff)
downloadcpython-a8e1d81bdc9cb45cc071eff2991a1e96d4b22e7c.zip
cpython-a8e1d81bdc9cb45cc071eff2991a1e96d4b22e7c.tar.gz
cpython-a8e1d81bdc9cb45cc071eff2991a1e96d4b22e7c.tar.bz2
add simple example of avoiding backtracking
Diffstat (limited to 'Doc/lib/libre.tex')
-rw-r--r--Doc/lib/libre.tex28
1 files changed, 22 insertions, 6 deletions
diff --git a/Doc/lib/libre.tex b/Doc/lib/libre.tex
index 1c14072..ab51c10 100644
--- a/Doc/lib/libre.tex
+++ b/Doc/lib/libre.tex
@@ -802,10 +802,6 @@ The string passed to \function{match()} or \function{search()}.
\subsection{Examples}
-%\begin{list}{}{\leftmargin 0.7in \labelwidth 0.65in}
-
-%\item[Simulating scanf]
-
\leftline{\strong{Simulating \cfunction{scanf()}}}
Python does not currently have an equivalent to \cfunction{scanf()}.
@@ -851,7 +847,27 @@ you would use a \cfunction{scanf()} format like
The equivalent regular expression would be
\begin{verbatim}
- ([^\s]+) - (\d+) errors, (\d+) warnings
+ (\S+) - (\d+) errors, (\d+) warnings
+\end{verbatim}
+
+\leftline{\strong{Avoiding backtracking}}
+
+If you create regular expressions that require the engine to perform a lot
+of backtracking, you may encounter a RuntimeError exception with the message
+\code{maximum recursion limit exceeded}. For example,
+
+\begin{verbatim}
+ >>> s = "<" + "that's a very big string!"*1000 + ">"
+ >>> re.match('<.*?>', s)
+ Traceback (most recent call last):
+ File "<stdin>", line 1, in ?
+ File "/usr/local/lib/python2.3/sre.py", line 132, in match
+ return _compile(pattern, flags).match(string)
+ RuntimeError: maximum recursion limit exceeded
\end{verbatim}
-%\end{list}
+You can often restructure your regular expression to avoid backtracking.
+The above regular expression can be recast as \regexp{\textless
+[\textasciicircum \textgreater]*\textgreater}. As a further
+benefit, such regular expressions will run faster than their backtracking
+equivalents.