cpython.git - https://github.com/python/cpython.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Issue #27076: Doc, comment and tests spelling fixes	Martin Panter	2016-05-26	1	-1/+1
\| \| \| \|	Most fixes to Doc/ and Lib/ directories by Ville Skyttä.
*	Merge spelling fixes from 3.4 into 3.5	Martin Panter	2015-10-31	1	-1/+1
\|\
\| *	Fix some spelling errors in documentation and code comments	Martin Panter	2015-10-31	1	-1/+1
\| \|
* \|	#23144: merge with 3.4.	Ezio Melotti	2015-09-06	1	-1/+9
\|\ \ \| \|/
\| *	#23144: Make sure that HTMLParser.feed() returns all the data, even when ↵	Ezio Melotti	2015-09-06	1	-1/+9
\| \| \| \| \| \| \| \|	convert_charrefs is True.
* \|	Issue #23181: More "codepoint" -> "code point".	Serhiy Storchaka	2015-01-18	1	-2/+2
\|\ \ \| \|/
\| *	Issue #23181: More "codepoint" -> "code point".	Serhiy Storchaka	2015-01-18	1	-2/+2
\| \|
* \|	#21047: set the default value for the convert_charrefs argument of ↵	Ezio Melotti	2014-08-02	1	-8/+2
\| \| \| \| \| \| \| \|	HTMLParser to True. Patch by Berker Peksag.
* \|	Add an __all__ to html.entities.	Ezio Melotti	2014-08-02	1	-0/+3
\| \|
* \|	#15114: the strict mode and argument of HTMLParser, HTMLParser.error, and ↵	Ezio Melotti	2014-08-02	1	-94/+12
\|/ \| \| \|	the HTMLParserError exception have been removed.
*	#20288: merge with 3.3.	Ezio Melotti	2014-02-01	1	-3/+3
\|\
\| *	#20288: fix handling of invalid numeric charrefs in HTMLParser.	Ezio Melotti	2014-02-01	1	-3/+3
\| \|
* \|	#13633: Added a new convert_charrefs keyword arg to HTMLParser that, when ↵	Ezio Melotti	2013-11-23	1	-17/+45
\| \| \| \| \| \| \| \|	True, automatically converts all character references.
* \|	#19688: add back and deprecate the internal HTMLParser.unescape() method.	Ezio Melotti	2013-11-22	1	-0/+7
\| \|
* \|	#2927: Added the unescape() function to the html module.	Ezio Melotti	2013-11-19	2	-34/+118
\| \|
* \|	#19480: merge with 3.3.	Ezio Melotti	2013-11-07	1	-9/+12
\|\ \ \| \|/
\| *	#19480: HTMLParser now accepts all valid start-tag names as defined by the ↵	Ezio Melotti	2013-11-07	1	-9/+12
\| \| \| \| \| \| \| \|	HTML5 standard.
* \|	#15114: The html.parser module now raises a DeprecationWarning when the ↵	Ezio Melotti	2013-11-02	1	-4/+10
\| \| \| \| \| \| \| \|	strict argument of HTMLParser or the HTMLParser.error method are used.
* \|	#18020: improve html.escape speed by an order of magnitude. Patch by Matt ↵	Ezio Melotti	2013-07-07	1	-7/+6
\| \| \| \| \| \| \| \|	Bryant.
* \|	#17802: merge with 3.3.	Ezio Melotti	2013-05-01	1	-0/+1
\|\ \ \| \|/
\| *	#17802: Fix an UnboundLocalError in html.parser. Initial tests by Thomas ↵	Ezio Melotti	2013-05-01	1	-0/+1
\| \| \| \| \| \| \| \|	Barlow.
* \|	#14679: add an __all__ (that contains only HTMLParser) to html.parser.	Ezio Melotti	2013-05-01	1	-0/+2
\|/
*	#16245: Fix the value of a few entities in html.entities.html5.	Ezio Melotti	2012-10-23	1	-12/+12
\|
*	Reorder html.entities.html5 entities to make updates easier. Patch by ↵	Ezio Melotti	2012-10-23	1	-109/+109
\| \| \| \|	Iuliia Proskurnia.
*	#15156: HTMLParser now uses the new "html.entities.html5" dictionary.	Ezio Melotti	2012-06-24	1	-17/+15
\|
*	#11113: add a new "html5" dictionary containing the named character ↵	Ezio Melotti	2012-06-24	1	-0/+2236
\| \| \| \|	references defined by the HTML5 standard and the equivalent Unicode character(s) to the html.entities module.
*	#15114: the strict mode of HTMLParser and the HTMLParseError exception are ↵	Ezio Melotti	2012-06-23	1	-9/+12
\| \| \| \|	deprecated now that the parser is able to parse invalid markup.
*	#14538: HTMLParser can now parse correctly start tags that contain a bare /.	Ezio Melotti	2012-04-19	1	-3/+3
\|
*	HTMLParser is now able to handle slashes in the start tag.	Ezio Melotti	2012-02-21	1	-7/+11
\|
*	Fix an index and clean up comments.	Ezio Melotti	2012-02-13	1	-1/+2
\|
*	Improve handling of declarations in HTMLParser.	Ezio Melotti	2012-02-13	1	-8/+22
\|
*	#13993: HTMLParser is now able to handle broken end tags when strict=False.	Ezio Melotti	2012-02-13	1	-15/+27
\|
*	#13960: HTMLParser is now able to handle broken comments when strict=False.	Ezio Melotti	2012-02-10	1	-1/+24
\|
*	#13358: HTMLParser now calls handle_data only once for each CDATA.	Ezio Melotti	2011-11-18	1	-3/+4
\|
*	#1745761, #755670, #13357, #12629, #1200313: improve attribute handling in ↵	Ezio Melotti	2011-11-14	1	-9/+10
\| \| \| \|	HTMLParser.
*	#670664: Fix HTMLParser to correctly handle the content of ↵	Ezio Melotti	2011-11-01	1	-4/+18
\| \| \| \|	``<script>...</script>`` and ``<style>...</style>``.
*	#13273: fix a bug that prevented HTMLParser to properly detect some tags ↵	Ezio Melotti	2011-10-28	1	-3/+2
\| \| \| \|	when strict=False.
*	Fix issue12938 - Update the docstring of html.escape. Include the ↵	Senthil Kumaran	2011-09-12	1	-1/+2
\| \| \| \|	information on single quote.
*	#12888: Fix a bug in HTMLParser.unescape that prevented it to escape more ↵	Ezio Melotti	2011-09-05	1	-1/+1
\| \| \| \|	than 128 entities. Patch by Peter Otten.
*	Merge 3.1	Éric Araujo	2011-05-25	1	-1/+1
\|\
\| *	Fix display of html.parser.HTMLParser.feed docstring	Éric Araujo	2011-05-04	1	-1/+1
\| \|
\| *	Merged revisions 87542 via svnmerge from	Senthil Kumaran	2010-12-28	1	-7/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r87542 \| senthil.kumaran \| 2010-12-28 23:55:16 +0800 (Tue, 28 Dec 2010) \| 3 lines Fix Issue10759 - html.parser.unescape() fails on HTML entities with incorrect syntax ........
\| *	Merged revisions 81504 via svnmerge from	Victor Stinner	2010-05-24	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r81504 \| victor.stinner \| 2010-05-24 23:46:25 +0200 (lun., 24 mai 2010) \| 13 lines Recorded merge of revisions 81500-81501 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81500 \| victor.stinner \| 2010-05-24 23:33:24 +0200 (lun., 24 mai 2010) \| 2 lines Issue #6662: Fix parsing of malformatted charref (&#bad;) ........ r81501 \| victor.stinner \| 2010-05-24 23:37:28 +0200 (lun., 24 mai 2010) \| 2 lines Add the author of the last fix (Issue #6662) ........ ................
* \|	#7311: fix html.parser to accept non-ASCII attribute values.	Ezio Melotti	2011-04-07	1	-1/+1
\| \|
* \|	Fix Issue10759 - html.parser.unescape() fails on HTML entities with ↵	Senthil Kumaran	2010-12-28	1	-7/+10
\| \| \| \| \| \| \| \|	incorrect syntax
* \|	#1486713: Add a tolerant mode to HTMLParser.	R. David Murray	2010-12-03	1	-16/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The motivation for adding this option is that the the functionality it provides used to be provided by sgmllib in Python2, and was used by, for example, BeautifulSoup. Without this option, the Python3 version of BeautifulSoup and the many programs that use it are crippled. The original patch was by 'kxroberto'. I modified it heavily but kept his heuristics and test. I also added additional heuristics to fix #975556, #1046092, and part of #6191. This patch should be completely backward compatible: the behavior with the default strict=True is unchanged.
* \|	#2830: add html.escape() helper and move cgi.escape() uses in the standard ↵	Georg Brandl	2010-10-15	1	-1/+20
\| \| \| \| \| \| \| \|	library to it. It defaults to quote=True and also escapes single quotes, which makes casual use safer. The cgi.escape() interface is not touched, but emits a (silent) PendingDeprecationWarning.
* \|	Recorded merge of revisions 81500-81501 via svnmerge from	Victor Stinner	2010-05-24	1	-0/+3
\|/ \| \| \| \| \| \| \| \| \| \| \| \| \|	svn+ssh://pythondev@svn.python.org/python/trunk ........ r81500 \| victor.stinner \| 2010-05-24 23:33:24 +0200 (lun., 24 mai 2010) \| 2 lines Issue #6662: Fix parsing of malformatted charref (&#bad;) ........ r81501 \| victor.stinner \| 2010-05-24 23:37:28 +0200 (lun., 24 mai 2010) \| 2 lines Add the author of the last fix (Issue #6662) ........
*	#2834: Change re module semantics, so that str and bytes mixing is forbidden,	Antoine Pitrou	2008-08-19	1	-1/+1
\| \| \| \| \|	and str (unicode) patterns get full unicode matching by default. The re.ASCII flag is also introduced to ask for ASCII matching instead.
*	Change test_htmlparser to reflect the HTMLParser -> html.parser	Mark Dickinson	2008-05-21	1	-1/+1
\| \| \| \| \| \|	rename in r63439. Also fix one occurrence of unichr() in html.parser.