Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Restructured Checker class to get rid of 'ext' table. | Guido van Rossum | 1997-02-02 | 2 | -177/+165 |
| | | | | | | | | | | | | | | | | | | | Links are now either in 'todo' or 'done', and ext links are hadled more like local links except that no further links are gathered (and sometimes they aren't checked, e.g. for mailto and news URLs). The -x option reverses its meaning: it disables checking of ext links (they are moved to 'done' without checking). A new 'errors' table collects pages with bad links as we go -- redundant, but useful for the GUI version which needs to report this as we go. Some new methods, including reset(). New checkpoint format. Adapted the GUI to the changes in the Checker class. Added Quit and "Start over" buttons, and a checkbox to disable checking external links. The details window now also shows bad links emanating from the selected page. Miscellaneous small chages. | ||||
* | Add root URL entry box, separate start/stop/step buttons. | Guido van Rossum | 1997-02-01 | 1 | -54/+131 |
| | | | | If the users selects an item in 'To check', start checking there. | ||||
* | Process <img> and <frame> tags. Don't bother skipping second href. | Guido van Rossum | 1997-02-01 | 1 | -3/+12 |
| | |||||
* | Check in another copy of tktools.py... | Guido van Rossum | 1997-01-31 | 1 | -0/+367 |
| | |||||
* | Tk interface to webchecker. Not fully featured yet, but usable. | Guido van Rossum | 1997-01-31 | 1 | -0/+329 |
| | |||||
* | Spin off checking of external page in a subroutine. | Guido van Rossum | 1997-01-31 | 1 | -17/+20 |
| | | | | | Increase MAXPAGE to 150K. Add back printing of __doc__ for usage message. | ||||
* | Many misc changes. | Guido van Rossum | 1997-01-31 | 1 | -95/+142 |
| | | | | | | | | | | | | | | | | | | | | | | | - Faster HTML parser derivede from SGMLparser (Fred Gansevles). - All manipulations of todo, done, ext, bad are done via methods, so a derived class can override. Also moved the 'done' marking to dopage(), so run() is much simpler. - Added a method status() which returns a string containing the summary counts; added a "total" count. - Drop the guessing of the file type before opening the document -- we still need to check those links for validity! - Added a subroutine to close a connection which first slurps up the remaining data when it's an ftp URL -- apparently closing an ftp connection without reading till the end makes it hang. - Added -n option to skip running (only useful with -R). - The Checker object now has an instance variable which is set to 1 when it is changed. This is not pickled. | ||||
* | Set proper User-agent header (Python-webchecker/<version>). | Guido van Rossum | 1997-01-30 | 1 | -14/+21 |
| | | | | | When -x is combined with -q, still do the checking, but don't print the error in this phase -- they are reported by report_errors(). | ||||
* | Some refinements of the external-link checking code: insert the errors | Guido van Rossum | 1997-01-30 | 1 | -9/+22 |
| | | | | | | in the 'bad' dictionary (sanitize them so they are picklable; the sanitation code is now a subroutine); don't check mailto: URLs; omit colon in Error message. | ||||
* | Added -x option to check external links. Slooooow! | Guido van Rossum | 1997-01-30 | 1 | -10/+32 |
| | |||||
* | Catch I/O errors when parsing robots.txt file. | Guido van Rossum | 1997-01-30 | 1 | -5/+13 |
| | | | | Add version number, printed at startup in non-quited mode. | ||||
* | Basic README file | Guido van Rossum | 1997-01-30 | 1 | -0/+14 |
| | |||||
* | Added robots.txt support, using Skip Montanaro's parser. | Guido van Rossum | 1997-01-30 | 1 | -3/+38 |
| | | | | | Fixed occasional inclusion of unpicklable objects (Message in errors). Changed indent of a few messages. | ||||
* | Skip Montanaro's robots.txt parser. | Guido van Rossum | 1997-01-30 | 1 | -0/+97 |
| | |||||
* | web tree checker | Guido van Rossum | 1997-01-30 | 1 | -0/+488 |
| | |||||
* | mime types guesser | Guido van Rossum | 1997-01-30 | 1 | -0/+190 |
| | |||||
* | Version 1.1. Fix memory leak and expensive comparison with None. | Guido van Rossum | 1997-01-18 | 1 | -3/+22 |
| | |||||
* | /usr/local/bin/python -> /usr/bin/env python | Guido van Rossum | 1996-11-27 | 28 | -29/+28 |
| | |||||
* | Changed so that it replaces /usr/local/bin/python with /usr/bin/env python. | Guido van Rossum | 1996-11-27 | 1 | -6/+6 |
| | |||||
* | Changed logic so it now replaces anything that has #! and python in | Guido van Rossum | 1996-11-27 | 1 | -6/+5 |
| | | | | the first line, replacing the entire line. | ||||
* | Script to fix cprt notice | Guido van Rossum | 1996-11-27 | 1 | -0/+78 |
| | |||||
* | Removed the USA'isms that aren't really true | Barry Warsaw | 1996-11-20 | 1 | -2/+4 |
| | |||||
* | /usr/bin/env is slightly more portable | Barry Warsaw | 1996-11-20 | 1 | -1/+1 |
| | |||||
* | Added a few meta info | Barry Warsaw | 1996-11-20 | 1 | -0/+4 |
| | |||||
* | Output original address | Barry Warsaw | 1996-11-19 | 1 | -4/+4 |
| | |||||
* | Updated country codes as per ISO3166 10-Feb-1994 | Barry Warsaw | 1996-11-18 | 1 | -36/+183 |
| | |||||
* | implementation complete. need to update country codes | Barry Warsaw | 1996-11-18 | 1 | -22/+64 |
| | |||||
* | Added `ru' --> Russia | Barry Warsaw | 1996-11-18 | 1 | -0/+1 |
| | |||||
* | A rewrite for better Python-ish style | Barry Warsaw | 1996-11-18 | 1 | -24/+35 |
| | |||||
* | Fixed a bug: files that no longer exist remotely would never be | Guido van Rossum | 1996-11-14 | 1 | -0/+24 |
| | | | | | | | | | removed from .mirrorinfo. Now they are (even if -r is not specified -- the files are not removed, just their .mirrorinfo entry). Added a feature: the -s pattern option is also used to skip local files when removing (i.e. -r won't remove local files matching the -s patterns). | ||||
* | (texi2html.py): Minor improvement to HTML 3.0 output for code examples. | Fred Drake | 1996-10-25 | 1 | -2/+7 |
| | |||||
* | New permission notice, includes CNRI. | Guido van Rossum | 1996-10-25 | 1 | -9/+16 |
| | |||||
* | (texi2html.py): Added CLASS=Navigation attribute to DIV around top/bottom | Fred Drake | 1996-10-11 | 1 | -2/+2 |
| | | | | | | | navigation links for HTML 3 version. Forced a blank line above the footnotes separator for HTML 2; at least one page did not get this spaced correctly. | ||||
* | (texi2html.py): Tightened the generated HTML slightly. Added -p option to | Fred Drake | 1996-10-09 | 1 | -159/+251 |
| | | | | | | | | | | | | | | | | print section titles even when the debugging output is not enabled. Added -3 option to generate HTML 3.0 constructs where meaningful. Removed repititive garbage generation: the old version added simple descriptive comments after every datadesc/funcdesc/*desc entry: function(args) -- function of module xxxx Description.... These comments are no longer generated: function(args) Description.... | ||||
* | (texi2html.py): Use "if __name__ == '__main__':" test. Adjust indentation | Fred Drake | 1996-09-13 | 1 | -1324/+1319 |
| | | | | | | | | | to 4 spaces per level (no longer 8). (Makefile): Use .pyc versions of partparse.py and texi2html.py to generate converted documentation formats. This reduces the startup costs; probably doesn't affect anyone but me in reality, but helps when working on the docs. | ||||
* | Note obsolescence | Guido van Rossum | 1996-09-11 | 1 | -1/+5 |
| | |||||
* | Added NumVersion definition. | Guido van Rossum | 1996-08-26 | 1 | -0/+3 |
| | |||||
* | Insert filename in IOError message. | Guido van Rossum | 1996-08-26 | 1 | -1/+4 |
| | |||||
* | OK, now it uses sys.[exec_]prefix and supports | Guido van Rossum | 1996-08-26 | 2 | -131/+68 |
| | | | | | a -o option to specify an output directory. *Much* more convenient. Removed a whole lot of hullabaloo from the README file, too. | ||||
* | Get rid of it totally | Guido van Rossum | 1996-08-26 | 1 | -314/+0 |
| | |||||
* | Added warning that this is unsupported | Guido van Rossum | 1996-08-26 | 1 | -0/+2 |
| | |||||
* | Changes for 1.4 install targets | Guido van Rossum | 1996-08-26 | 1 | -6/+5 |
| | |||||
* | Allow whitespace before '#' in preprocessor directives. | Guido van Rossum | 1996-08-22 | 1 | -3/+4 |
| | |||||
* | Some improvements by Fred Drake. | Guido van Rossum | 1996-08-09 | 1 | -60/+58 |
| | |||||
* | More changes by Sjoerd & Jack | Guido van Rossum | 1996-07-21 | 1 | -42/+136 |
| | |||||
* | Addition by Jack to test for missing type support | Guido van Rossum | 1996-07-21 | 1 | -0/+20 |
| | |||||
* | Add test that script name ends in .py. | Guido van Rossum | 1996-06-17 | 1 | -9/+10 |
| | | | | Cosmetic changes to usage message (refer to "make install" now). | ||||
* | Get struct _frozen defn from Python.h; new variable names. | Guido van Rossum | 1996-06-17 | 1 | -5/+5 |
| | |||||
* | Removed most of the README text since it is all about the Mac specific | Guido van Rossum | 1996-05-24 | 1 | -161/+5 |
| | | | | examples, which no longer live here. | ||||
* | Moved customization to bgenlocations.py module. | Guido van Rossum | 1996-04-12 | 2 | -6/+13 |
| |