summaryrefslogtreecommitdiffstats
path: root/Lib/html
Commit message (Collapse)AuthorAgeFilesLines
* gh-100210: Correct the comment link for unescaping HTML (#100212)Jean-Christophe Amiel2023-02-191-1/+1
| | | gh-100210: correct the comment link for unescaping HTML
* gh-97669: Create Tools/build/ directory (#97963)Victor Stinner2022-10-171-1/+1
| | | | | | | | | | | | | | | | | | | | | | | Create Tools/build/ directory. Move the following scripts from Tools/scripts/ to Tools/build/: * check_extension_modules.py * deepfreeze.py * freeze_modules.py * generate_global_objects.py * generate_levenshtein_examples.py * generate_opcode_h.py * generate_re_casefix.py * generate_sre_constants.py * generate_stdlib_module_names.py * generate_token.py * parse_html5_entities.py * smelly.py * stable_abi.py * umarshal.py * update_file.py * verify_ensurepip_wheels.py Update references to these scripts.
* gh-95813: Improve HTMLParser from the view of inheritance (#95874)Dong-hee Na2022-08-181-1/+2
| | | | | | | * gh-95813: Improve HTMLParser from the view of inheritance * gh-95813: Add unittest * Address code review
* gh-82927: Update files related to HTML entities. (GH-92504)Ezio Melotti2022-06-211-3/+6
|
* Add source for character mappings (#92014)slateny2022-05-061-0/+1
|
* bpo-45421: Remove dead code from html.parser (GH-28847)Alberto Mardegan2021-10-121-7/+0
| | | | | Support for HtmlParserError was removed back in 2014 with commit 73a4359eb0eb624c588c5d52083ea4944f9787ea, however this small block was missed.
* Fix typos in the Lib directory (GH-28775)Christian Clauss2021-10-061-1/+1
| | | | | Fix typos in the Lib directory as identified by codespell. Co-authored-by: Terry Jan Reedy <tjreedy@udel.edu>
* bpo-41748: Handles unquoted attributes with commas (#24072)Karl Dubost2021-02-011-1/+1
| | | | | | | | | | | | | | | | | | * bpo-41748: Adds tests for unquoted attributes with comma * bpo-41748: Handles unquoted attributes with comma * bpo-41748: Addresses review comments * bpo-41748: Addresses review comments * Adds more test cases * Simplifies the regex for handling spaces * bpo-41748: Moves attributes tests under the right class * bpo-41748: Addresses review about duplicate attributes * bpo-41748: Adds NEWS.d entry for this patch
* bpo-37328: remove deprecated HTMLParser.unescape (GH-14186)Inada Naoki2019-08-271-8/+0
| | | It is deprecated since Python 3.4.
* bpo-30629: Remove second call of str.lower() in html.parser.parse_endtag. ↵Motoki Naruse2017-06-171-1/+1
| | | | | | (#2099) elem is the result of .lower() 6 lines above the handle_endtag call. Patch by Motoki Naruse
* Revert "Fixed a typo in the HTMLParser.feed docstrings" (#1771)Serhiy Storchaka2017-05-241-1/+1
| | | | | * Revert "Fixed a typo in the HTMLParser.feed docstrings. The docstring started with an 'r', like a The docstring was correct. I read the patch in opposite direction, as *adding* the "r" prefix. This reverts commit 5ba185039f1bd465d3f82531324fd3fe1ee42f0c.
* Fixed a typo in the HTMLParser.feed docstrings. The docstring started with ↵Jani Šumak2017-05-231-1/+1
| | | | an 'r', like a rawstring. (#1759)
* #27364: fix "incorrect" uses of escape character in the stdlib.R David Murray2016-09-081-2/+2
| | | | | | | And most of the tools. Patch by Emanual Barry, reviewed by me, Serhiy Storchaka, and Martin Panter.
* Issue #27076: Doc, comment and tests spelling fixesMartin Panter2016-05-261-1/+1
| | | | Most fixes to Doc/ and Lib/ directories by Ville Skyttä.
* Merge spelling fixes from 3.4 into 3.5Martin Panter2015-10-311-1/+1
|\
| * Fix some spelling errors in documentation and code commentsMartin Panter2015-10-311-1/+1
| |
* | #23144: merge with 3.4.Ezio Melotti2015-09-061-1/+9
|\ \ | |/
| * #23144: Make sure that HTMLParser.feed() returns all the data, even when ↵Ezio Melotti2015-09-061-1/+9
| | | | | | | | convert_charrefs is True.
* | Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-181-2/+2
|\ \ | |/
| * Issue #23181: More "codepoint" -> "code point".Serhiy Storchaka2015-01-181-2/+2
| |
* | #21047: set the default value for the *convert_charrefs* argument of ↵Ezio Melotti2014-08-021-8/+2
| | | | | | | | HTMLParser to True. Patch by Berker Peksag.
* | Add an __all__ to html.entities.Ezio Melotti2014-08-021-0/+3
| |
* | #15114: the strict mode and argument of HTMLParser, HTMLParser.error, and ↵Ezio Melotti2014-08-021-94/+12
|/ | | | the HTMLParserError exception have been removed.
* #20288: merge with 3.3.Ezio Melotti2014-02-011-3/+3
|\
| * #20288: fix handling of invalid numeric charrefs in HTMLParser.Ezio Melotti2014-02-011-3/+3
| |
* | #13633: Added a new convert_charrefs keyword arg to HTMLParser that, when ↵Ezio Melotti2013-11-231-17/+45
| | | | | | | | True, automatically converts all character references.
* | #19688: add back and deprecate the internal HTMLParser.unescape() method.Ezio Melotti2013-11-221-0/+7
| |
* | #2927: Added the unescape() function to the html module.Ezio Melotti2013-11-192-34/+118
| |
* | #19480: merge with 3.3.Ezio Melotti2013-11-071-9/+12
|\ \ | |/
| * #19480: HTMLParser now accepts all valid start-tag names as defined by the ↵Ezio Melotti2013-11-071-9/+12
| | | | | | | | HTML5 standard.
* | #15114: The html.parser module now raises a DeprecationWarning when the ↵Ezio Melotti2013-11-021-4/+10
| | | | | | | | strict argument of HTMLParser or the HTMLParser.error method are used.
* | #18020: improve html.escape speed by an order of magnitude. Patch by Matt ↵Ezio Melotti2013-07-071-7/+6
| | | | | | | | Bryant.
* | #17802: merge with 3.3.Ezio Melotti2013-05-011-0/+1
|\ \ | |/
| * #17802: Fix an UnboundLocalError in html.parser. Initial tests by Thomas ↵Ezio Melotti2013-05-011-0/+1
| | | | | | | | Barlow.
* | #14679: add an __all__ (that contains only HTMLParser) to html.parser.Ezio Melotti2013-05-011-0/+2
|/
* #16245: Fix the value of a few entities in html.entities.html5.Ezio Melotti2012-10-231-12/+12
|
* Reorder html.entities.html5 entities to make updates easier. Patch by ↵Ezio Melotti2012-10-231-109/+109
| | | | Iuliia Proskurnia.
* #15156: HTMLParser now uses the new "html.entities.html5" dictionary.Ezio Melotti2012-06-241-17/+15
|
* #11113: add a new "html5" dictionary containing the named character ↵Ezio Melotti2012-06-241-0/+2236
| | | | references defined by the HTML5 standard and the equivalent Unicode character(s) to the html.entities module.
* #15114: the strict mode of HTMLParser and the HTMLParseError exception are ↵Ezio Melotti2012-06-231-9/+12
| | | | deprecated now that the parser is able to parse invalid markup.
* #14538: HTMLParser can now parse correctly start tags that contain a bare /.Ezio Melotti2012-04-191-3/+3
|
* HTMLParser is now able to handle slashes in the start tag.Ezio Melotti2012-02-211-7/+11
|
* Fix an index and clean up comments.Ezio Melotti2012-02-131-1/+2
|
* Improve handling of declarations in HTMLParser.Ezio Melotti2012-02-131-8/+22
|
* #13993: HTMLParser is now able to handle broken end tags when strict=False.Ezio Melotti2012-02-131-15/+27
|
* #13960: HTMLParser is now able to handle broken comments when strict=False.Ezio Melotti2012-02-101-1/+24
|
* #13358: HTMLParser now calls handle_data only once for each CDATA.Ezio Melotti2011-11-181-3/+4
|
* #1745761, #755670, #13357, #12629, #1200313: improve attribute handling in ↵Ezio Melotti2011-11-141-9/+10
| | | | HTMLParser.
* #670664: Fix HTMLParser to correctly handle the content of ↵Ezio Melotti2011-11-011-4/+18
| | | | ``<script>...</script>`` and ``<style>...</style>``.
* #13273: fix a bug that prevented HTMLParser to properly detect some tags ↵Ezio Melotti2011-10-281-3/+2
| | | | when strict=False.