summaryrefslogtreecommitdiffstats
path: root/Tools
Commit message (Collapse)AuthorAgeFilesLines
* Restructured Checker class to get rid of 'ext' table.Guido van Rossum1997-02-022-177/+165
| | | | | | | | | | | | | | | | | | | Links are now either in 'todo' or 'done', and ext links are hadled more like local links except that no further links are gathered (and sometimes they aren't checked, e.g. for mailto and news URLs). The -x option reverses its meaning: it disables checking of ext links (they are moved to 'done' without checking). A new 'errors' table collects pages with bad links as we go -- redundant, but useful for the GUI version which needs to report this as we go. Some new methods, including reset(). New checkpoint format. Adapted the GUI to the changes in the Checker class. Added Quit and "Start over" buttons, and a checkbox to disable checking external links. The details window now also shows bad links emanating from the selected page. Miscellaneous small chages.
* Add root URL entry box, separate start/stop/step buttons.Guido van Rossum1997-02-011-54/+131
| | | | If the users selects an item in 'To check', start checking there.
* Process <img> and <frame> tags. Don't bother skipping second href.Guido van Rossum1997-02-011-3/+12
|
* Check in another copy of tktools.py...Guido van Rossum1997-01-311-0/+367
|
* Tk interface to webchecker. Not fully featured yet, but usable.Guido van Rossum1997-01-311-0/+329
|
* Spin off checking of external page in a subroutine.Guido van Rossum1997-01-311-17/+20
| | | | | Increase MAXPAGE to 150K. Add back printing of __doc__ for usage message.
* Many misc changes.Guido van Rossum1997-01-311-95/+142
| | | | | | | | | | | | | | | | | | | | | | | - Faster HTML parser derivede from SGMLparser (Fred Gansevles). - All manipulations of todo, done, ext, bad are done via methods, so a derived class can override. Also moved the 'done' marking to dopage(), so run() is much simpler. - Added a method status() which returns a string containing the summary counts; added a "total" count. - Drop the guessing of the file type before opening the document -- we still need to check those links for validity! - Added a subroutine to close a connection which first slurps up the remaining data when it's an ftp URL -- apparently closing an ftp connection without reading till the end makes it hang. - Added -n option to skip running (only useful with -R). - The Checker object now has an instance variable which is set to 1 when it is changed. This is not pickled.
* Set proper User-agent header (Python-webchecker/<version>).Guido van Rossum1997-01-301-14/+21
| | | | | When -x is combined with -q, still do the checking, but don't print the error in this phase -- they are reported by report_errors().
* Some refinements of the external-link checking code: insert the errorsGuido van Rossum1997-01-301-9/+22
| | | | | | in the 'bad' dictionary (sanitize them so they are picklable; the sanitation code is now a subroutine); don't check mailto: URLs; omit colon in Error message.
* Added -x option to check external links. Slooooow!Guido van Rossum1997-01-301-10/+32
|
* Catch I/O errors when parsing robots.txt file.Guido van Rossum1997-01-301-5/+13
| | | | Add version number, printed at startup in non-quited mode.
* Basic README fileGuido van Rossum1997-01-301-0/+14
|
* Added robots.txt support, using Skip Montanaro's parser.Guido van Rossum1997-01-301-3/+38
| | | | | Fixed occasional inclusion of unpicklable objects (Message in errors). Changed indent of a few messages.
* Skip Montanaro's robots.txt parser.Guido van Rossum1997-01-301-0/+97
|
* web tree checkerGuido van Rossum1997-01-301-0/+488
|
* mime types guesserGuido van Rossum1997-01-301-0/+190
|
* Version 1.1. Fix memory leak and expensive comparison with None.Guido van Rossum1997-01-181-3/+22
|
* /usr/local/bin/python -> /usr/bin/env pythonGuido van Rossum1996-11-2728-29/+28
|
* Changed so that it replaces /usr/local/bin/python with /usr/bin/env python.Guido van Rossum1996-11-271-6/+6
|
* Changed logic so it now replaces anything that has #! and python inGuido van Rossum1996-11-271-6/+5
| | | | the first line, replacing the entire line.
* Script to fix cprt noticeGuido van Rossum1996-11-271-0/+78
|
* Removed the USA'isms that aren't really trueBarry Warsaw1996-11-201-2/+4
|
* /usr/bin/env is slightly more portableBarry Warsaw1996-11-201-1/+1
|
* Added a few meta infoBarry Warsaw1996-11-201-0/+4
|
* Output original addressBarry Warsaw1996-11-191-4/+4
|
* Updated country codes as per ISO3166 10-Feb-1994Barry Warsaw1996-11-181-36/+183
|
* implementation complete. need to update country codesBarry Warsaw1996-11-181-22/+64
|
* Added `ru' --> RussiaBarry Warsaw1996-11-181-0/+1
|
* A rewrite for better Python-ish styleBarry Warsaw1996-11-181-24/+35
|
* Fixed a bug: files that no longer exist remotely would never beGuido van Rossum1996-11-141-0/+24
| | | | | | | | | removed from .mirrorinfo. Now they are (even if -r is not specified -- the files are not removed, just their .mirrorinfo entry). Added a feature: the -s pattern option is also used to skip local files when removing (i.e. -r won't remove local files matching the -s patterns).
* (texi2html.py): Minor improvement to HTML 3.0 output for code examples.Fred Drake1996-10-251-2/+7
|
* New permission notice, includes CNRI.Guido van Rossum1996-10-251-9/+16
|
* (texi2html.py): Added CLASS=Navigation attribute to DIV around top/bottomFred Drake1996-10-111-2/+2
| | | | | | | navigation links for HTML 3 version. Forced a blank line above the footnotes separator for HTML 2; at least one page did not get this spaced correctly.
* (texi2html.py): Tightened the generated HTML slightly. Added -p option toFred Drake1996-10-091-159/+251
| | | | | | | | | | | | | | | | print section titles even when the debugging output is not enabled. Added -3 option to generate HTML 3.0 constructs where meaningful. Removed repititive garbage generation: the old version added simple descriptive comments after every datadesc/funcdesc/*desc entry: function(args) -- function of module xxxx Description.... These comments are no longer generated: function(args) Description....
* (texi2html.py): Use "if __name__ == '__main__':" test. Adjust indentationFred Drake1996-09-131-1324/+1319
| | | | | | | | | to 4 spaces per level (no longer 8). (Makefile): Use .pyc versions of partparse.py and texi2html.py to generate converted documentation formats. This reduces the startup costs; probably doesn't affect anyone but me in reality, but helps when working on the docs.
* Note obsolescenceGuido van Rossum1996-09-111-1/+5
|
* Added NumVersion definition.Guido van Rossum1996-08-261-0/+3
|
* Insert filename in IOError message.Guido van Rossum1996-08-261-1/+4
|
* OK, now it uses sys.[exec_]prefix and supportsGuido van Rossum1996-08-262-131/+68
| | | | | a -o option to specify an output directory. *Much* more convenient. Removed a whole lot of hullabaloo from the README file, too.
* Get rid of it totallyGuido van Rossum1996-08-261-314/+0
|
* Added warning that this is unsupportedGuido van Rossum1996-08-261-0/+2
|
* Changes for 1.4 install targetsGuido van Rossum1996-08-261-6/+5
|
* Allow whitespace before '#' in preprocessor directives.Guido van Rossum1996-08-221-3/+4
|
* Some improvements by Fred Drake.Guido van Rossum1996-08-091-60/+58
|
* More changes by Sjoerd & JackGuido van Rossum1996-07-211-42/+136
|
* Addition by Jack to test for missing type supportGuido van Rossum1996-07-211-0/+20
|
* Add test that script name ends in .py.Guido van Rossum1996-06-171-9/+10
| | | | Cosmetic changes to usage message (refer to "make install" now).
* Get struct _frozen defn from Python.h; new variable names.Guido van Rossum1996-06-171-5/+5
|
* Removed most of the README text since it is all about the Mac specificGuido van Rossum1996-05-241-161/+5
| | | | examples, which no longer live here.
* Moved customization to bgenlocations.py module.Guido van Rossum1996-04-122-6/+13
|