diff options
author | Martin v. Löwis <martin@v.loewis.de> | 2010-12-30 08:36:37 (GMT) |
---|---|---|
committer | Martin v. Löwis <martin@v.loewis.de> | 2010-12-30 08:36:37 (GMT) |
commit | 0dbebc02edfb5489248ab42f14f90cf9435e5796 (patch) | |
tree | b4eab3a7b1e433202b61fb370b971431d967352e | |
parent | 627284c67fcc812c75803d47ed06dd115f813b50 (diff) | |
download | cpython-0dbebc02edfb5489248ab42f14f90cf9435e5796.zip cpython-0dbebc02edfb5489248ab42f14f90cf9435e5796.tar.gz cpython-0dbebc02edfb5489248ab42f14f90cf9435e5796.tar.bz2 |
Issue #10542: Document that identifiers use XID_Start XID_Continue*.
-rw-r--r-- | Doc/reference/lexical_analysis.rst | 6 |
1 files changed, 5 insertions, 1 deletions
diff --git a/Doc/reference/lexical_analysis.rst b/Doc/reference/lexical_analysis.rst index 9fdb350..4b49738 100644 --- a/Doc/reference/lexical_analysis.rst +++ b/Doc/reference/lexical_analysis.rst @@ -292,9 +292,11 @@ Unicode Character Database as included in the :mod:`unicodedata` module. Identifiers are unlimited in length. Case is significant. .. productionlist:: - identifier: `id_start` `id_continue`* + identifier: `xid_start` `xid_continue`* id_start: <all characters in general categories Lu, Ll, Lt, Lm, Lo, Nl, the underscore, and characters with the Other_ID_Start property> id_continue: <all characters in `id_start`, plus characters in the categories Mn, Mc, Nd, Pc and others with the Other_ID_Continue property> + xid_start: <all characters in `id_start` whose NFKC normalization is in "id_start xid_continue*"> + xid_continue: <all characters in `id_continue` whose NFKC normalization is in "id_continue*"> The Unicode category codes mentioned above stand for: @@ -308,6 +310,8 @@ The Unicode category codes mentioned above stand for: * *Mc* - spacing combining marks * *Nd* - decimal numbers * *Pc* - connector punctuations +* *Other_ID_Start* - explicit list of characters in `PropList.txt <http://unicode.org/Public/UNIDATA/PropList.txt>`_ to support backwards compatibility +* *Other_ID_Continue* - likewise All identifiers are converted into the normal form NFKC while parsing; comparison of identifiers is based on NFKC. |