summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorFred Drake <fdrake@acm.org>2002-11-12 23:12:54 (GMT)
committerFred Drake <fdrake@acm.org>2002-11-12 23:12:54 (GMT)
commit3d03968c75f834dcd9d58b4aaf7475d9d2d72d43 (patch)
treee07e655bbad2b277d01a247a20fd387572c1012a
parent82c72310718024f0a50f684b497ef9e9b62971e5 (diff)
downloadcpython-3d03968c75f834dcd9d58b4aaf7475d9d2d72d43.zip
cpython-3d03968c75f834dcd9d58b4aaf7475d9d2d72d43.tar.gz
cpython-3d03968c75f834dcd9d58b4aaf7475d9d2d72d43.tar.bz2
Clarified meaning of \w and \W with respect to the UNICODE and LOCALE flags.
Closes SF bug #635595.
-rw-r--r--Doc/lib/libre.tex10
1 files changed, 5 insertions, 5 deletions
diff --git a/Doc/lib/libre.tex b/Doc/lib/libre.tex
index b8aeb40..ddeee2a 100644
--- a/Doc/lib/libre.tex
+++ b/Doc/lib/libre.tex
@@ -347,10 +347,10 @@ equivalent to the set \regexp{[ \e t\e n\e r\e f\e v]}.
equivalent to the set \regexp{[\textasciicircum\ \e t\e n\e r\e f\e v]}.
\item[\code{\e w}]When the \constant{LOCALE} and \constant{UNICODE}
-flags are not specified,
-matches any alphanumeric character; this is equivalent to the set
+flags are not specified, matches any alphanumeric character and the
+underscore; this is equivalent to the set
\regexp{[a-zA-Z0-9_]}. With \constant{LOCALE}, it will match the set
-\regexp{[0-9_]} plus whatever characters are defined as letters for
+\regexp{[0-9_]} plus whatever characters are defined as alphanumeric for
the current locale. If \constant{UNICODE} is set, this will match the
characters \regexp{[0-9_]} plus whatever is classified as alphanumeric
in the Unicode character properties database.
@@ -359,9 +359,9 @@ in the Unicode character properties database.
flags are not specified, matches any non-alphanumeric character; this
is equivalent to the set \regexp{[{\textasciicircum}a-zA-Z0-9_]}. With
\constant{LOCALE}, it will match any character not in the set
-\regexp{[0-9_]}, and not defined as a letter for the current locale.
+\regexp{[0-9_]}, and not defined as alphanumeric for the current locale.
If \constant{UNICODE} is set, this will match anything other than
-\regexp{[0-9_]} and characters marked at alphanumeric in the Unicode
+\regexp{[0-9_]} and characters marked as alphanumeric in the Unicode
character properties database.
\item[\code{\e Z}]Matches only at the end of the string.