summaryrefslogtreecommitdiffstats
path: root/Lib/html
Commit message (Collapse)AuthorAgeFilesLines
* bpo-41748: Handles unquoted attributes with commas (#24072)Karl Dubost2021-02-011-1/+1
| | | | | | | | | | | | | | | | | | * bpo-41748: Adds tests for unquoted attributes with comma * bpo-41748: Handles unquoted attributes with comma * bpo-41748: Addresses review comments * bpo-41748: Addresses review comments * Adds more test cases * Simplifies the regex for handling spaces * bpo-41748: Moves attributes tests under the right class * bpo-41748: Addresses review about duplicate attributes * bpo-41748: Adds NEWS.d entry for this patch
* bpo-37328: remove deprecated HTMLParser.unescape (GH-14186)Inada Naoki2019-08-271-8/+0
| | | It is deprecated since Python 3.4.
* bpo-30629: Remove second call of str.lower() in html.parser.parse_endtag. ↵Motoki Naruse2017-06-171-1/+1
| | | | | | (#2099) elem is the result of .lower() 6 lines above the handle_endtag call. Patch by Motoki Naruse
* Revert "Fixed a typo in the HTMLParser.feed docstrings" (#1771)Serhiy Storchaka2017-05-241-1/+1
| | | | | * Revert "Fixed a typo in the HTMLParser.feed docstrings. The docstring started with an 'r', like a The docstring was correct. I read the patch in opposite direction, as *adding* the "r" prefix. This reverts commit 5ba185039f1bd465d3f82531324fd3fe1ee42f0c.
* Fixed a typo in the HTMLParser.feed docstrings. The docstring started with ↵Jani Šumak2017-05-231-1/+1
| | | | an 'r', like a rawstring. (#1759)
* #27364: fix "incorrect" uses of escape character in the stdlib.R David Murray2016-09-081-2/+2
| | | | | | | And most of the tools. Patch by Emanual Barry, reviewed by me, Serhiy Storchaka, and Martin Panter.
* Issue #27076: Doc, comment and tests spelling fixesMartin Panter2016-05-261-1/+1
| | | | Most fixes to Doc/ and Lib/ directories by Ville Skyttä.
* Merge spelling fixes from 3.4 into 3.5Martin Panter2015-10-311-1/+1
|\
| * Fix some spelling errors in documentation and code commentsMartin Panter2015-10-311-1/+1
| |
* | #23144: merge with 3.4.Ezio Melotti2015-09-061-1/+9
|\ \ | |/
| * #23144: Make sure that HTMLParser.feed() returns all the data, even when ↵Ezio Melotti2015-09-061-1/+9
| | | | | | | | convert_charrefs is True.
* | Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-181-2/+2
|\ \ | |/
| * Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-181-2/+2
| |
* | #21047: set the default value for the *convert_charrefs* argument of ↵Ezio Melotti2014-08-021-8/+2
| | | | | | | | HTMLParser to True. Patch by Berker Peksag.
* | Add an __all__ to html.entities.Ezio Melotti2014-08-021-0/+3
| |
* | #15114: the strict mode and argument of HTMLParser, HTMLParser.error, and ↵Ezio Melotti2014-08-021-94/+12
|/ | | | the HTMLParserError exception have been removed.
* #20288: merge with 3.3.Ezio Melotti2014-02-011-3/+3
|\
| * #20288: fix handling of invalid numeric charrefs in HTMLParser.Ezio Melotti2014-02-011-3/+3
| |
* | #13633: Added a new convert_charrefs keyword arg to HTMLParser that, when ↵Ezio Melotti2013-11-231-17/+45
| | | | | | | | True, automatically converts all character references.
* | #19688: add back and deprecate the internal HTMLParser.unescape() method.Ezio Melotti2013-11-221-0/+7
| |
* | #2927: Added the unescape() function to the html module.Ezio Melotti2013-11-192-34/+118
| |
* | #19480: merge with 3.3.Ezio Melotti2013-11-071-9/+12
|\ \ | |/
| * #19480: HTMLParser now accepts all valid start-tag names as defined by the ↵Ezio Melotti2013-11-071-9/+12
| | | | | | | | HTML5 standard.
* | #15114: The html.parser module now raises a DeprecationWarning when the ↵Ezio Melotti2013-11-021-4/+10
| | | | | | | | strict argument of HTMLParser or the HTMLParser.error method are used.
* | #18020: improve html.escape speed by an order of magnitude. Patch by Matt ↵Ezio Melotti2013-07-071-7/+6
| | | | | | | | Bryant.
* | #17802: merge with 3.3.Ezio Melotti2013-05-011-0/+1
|\ \ | |/
| * #17802: Fix an UnboundLocalError in html.parser. Initial tests by Thomas ↵Ezio Melotti2013-05-011-0/+1
| | | | | | | | Barlow.
* | #14679: add an __all__ (that contains only HTMLParser) to html.parser.Ezio Melotti2013-05-011-0/+2
|/
* #16245: Fix the value of a few entities in html.entities.html5.Ezio Melotti2012-10-231-12/+12
|
* Reorder html.entities.html5 entities to make updates easier. Patch by ↵Ezio Melotti2012-10-231-109/+109
| | | | Iuliia Proskurnia.
* #15156: HTMLParser now uses the new "html.entities.html5" dictionary.Ezio Melotti2012-06-241-17/+15
|
* #11113: add a new "html5" dictionary containing the named character ↵Ezio Melotti2012-06-241-0/+2236
| | | | references defined by the HTML5 standard and the equivalent Unicode character(s) to the html.entities module.
* #15114: the strict mode of HTMLParser and the HTMLParseError exception are ↵Ezio Melotti2012-06-231-9/+12
| | | | deprecated now that the parser is able to parse invalid markup.
* #14538: HTMLParser can now parse correctly start tags that contain a bare /.Ezio Melotti2012-04-191-3/+3
|
* HTMLParser is now able to handle slashes in the start tag.Ezio Melotti2012-02-211-7/+11
|
* Fix an index and clean up comments.Ezio Melotti2012-02-131-1/+2
|
* Improve handling of declarations in HTMLParser.Ezio Melotti2012-02-131-8/+22
|
* #13993: HTMLParser is now able to handle broken end tags when strict=False.Ezio Melotti2012-02-131-15/+27
|
* #13960: HTMLParser is now able to handle broken comments when strict=False.Ezio Melotti2012-02-101-1/+24
|
* #13358: HTMLParser now calls handle_data only once for each CDATA.Ezio Melotti2011-11-181-3/+4
|
* #1745761, #755670, #13357, #12629, #1200313: improve attribute handling in ↵Ezio Melotti2011-11-141-9/+10
| | | | HTMLParser.
* #670664: Fix HTMLParser to correctly handle the content of ↵Ezio Melotti2011-11-011-4/+18
| | | | ``<script>...</script>`` and ``<style>...</style>``.
* #13273: fix a bug that prevented HTMLParser to properly detect some tags ↵Ezio Melotti2011-10-281-3/+2
| | | | when strict=False.
* Fix issue12938 - Update the docstring of html.escape. Include the ↵Senthil Kumaran2011-09-121-1/+2
| | | | information on single quote.
* #12888: Fix a bug in HTMLParser.unescape that prevented it to escape more ↵Ezio Melotti2011-09-051-1/+1
| | | | than 128 entities. Patch by Peter Otten.
* Merge 3.1Éric Araujo2011-05-251-1/+1
|\
| * Fix display of html.parser.HTMLParser.feed docstringÉric Araujo2011-05-041-1/+1
| |
| * Merged revisions 87542 via svnmerge fromSenthil Kumaran2010-12-281-7/+10
| | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r87542 | senthil.kumaran | 2010-12-28 23:55:16 +0800 (Tue, 28 Dec 2010) | 3 lines Fix Issue10759 - html.parser.unescape() fails on HTML entities with incorrect syntax ........
| * Merged revisions 81504 via svnmerge fromVictor Stinner2010-05-241-0/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r81504 | victor.stinner | 2010-05-24 23:46:25 +0200 (lun., 24 mai 2010) | 13 lines Recorded merge of revisions 81500-81501 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81500 | victor.stinner | 2010-05-24 23:33:24 +0200 (lun., 24 mai 2010) | 2 lines Issue #6662: Fix parsing of malformatted charref (&#bad;) ........ r81501 | victor.stinner | 2010-05-24 23:37:28 +0200 (lun., 24 mai 2010) | 2 lines Add the author of the last fix (Issue #6662) ........ ................
* | #7311: fix html.parser to accept non-ASCII attribute values.Ezio Melotti2011-04-071-1/+1
| |