Commit Graph

442 Commits

Author SHA1 Message Date
dan f4d82ed3f6 [svn] * README: Changed 1.5.3 in the FTP URL to 1.6.
* NEWS: Released Wget version 1.6.

* po/*.po: 'Project-Id-Version's were very haphazard, saying
either "wget" or "GNU wget", and with versions of 1.5.2-b[124], 1.5.3, the
nonexistent 1.5.4, and 1.6-pre.  Standardized all to "GNU Wget 1.7-dev".
Perhaps this is wrong to do because some of the translations haven't been
updated since the versions they state, but I know some of the files were updated
specifically for 1.6, and none of them used this version.  In any case, the
'POT-Creation-Date's and 'PO-Revision-Date's remain the best indicator of
whether a translation's out of date.
2000-12-31 04:05:29 -08:00
dan f105b3e43f [svn] Changed my address from dan-wget@dilvish.speed.net to wget@harkless.org
everywhere -- dilvish will probably be going away soon.
2000-12-31 03:18:37 -08:00
hniksic 9a4fcf6a68 [svn] Updated. 2000-12-09 18:23:47 -08:00
hniksic 649c599e0e [svn] Add a dummy Makefile.in. 2000-12-09 17:55:43 -08:00
hniksic c9ccd27730 [svn] Applied Karl Eichwalder's patches.
<20001125153354.51087DD2DC@tux.gnu.franken.de>
<20001125165332.92862DD2DB@tux.gnu.franken.de>
2000-12-05 15:56:34 -08:00
hniksic 7828e81c79 [svn] Committed C. Frankel's SSL patch. 2000-12-05 15:09:41 -08:00
hniksic 916cb86723 [svn] Determine ALL_LINGUAS dynamically. 2000-11-23 13:41:37 -08:00
hniksic b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00
hniksic ccf31643ab [svn] vsnprintf() fixup. 2000-11-16 08:37:49 -08:00
hniksic 2097b8beaf [svn] Compile in vsnprintf.c under Windows.
Published by Herold Heiko in
<412FC0AFD62ED31191B40008C7E9A11A2AABF0@srvnt04.previnet.it>.
2000-11-16 01:52:45 -08:00
hniksic 268ac1fc7f [svn] Another contributed fix. 2000-11-01 16:21:53 -08:00
hniksic 29cdc8da20 [svn] Updated long_to_string(); enhanced opt.downloaded to use
64-bit types where available.
Published in <sxswvenqsmn.fsf@florida.arsdigita.de> and
<sxssnpbqshp.fsf@florida.arsdigita.de>.
2000-11-01 13:51:25 -08:00
dan 5f96643297 [svn] NEWS: Hrvoje pointed out that relative URL grokking deserves mention. 2000-10-30 20:24:14 -08:00
dan 24c465b5ad [svn] retr.c (retrieve_url): Manually applied T. Bharath
<TBharath@responsenetworks.com>'s patch to get wget to grok illegal relative URL
redirects.  Reformatted and re-commented it.
2000-10-27 20:18:20 -07:00
dan 4d925ebc24 [svn] NEWS: Forgot to update regarding new --bind-address option. 2000-10-24 15:41:32 -07:00
dan 71994021f3 [svn] TODO: Generalize --html-extension to something like --mime-extension. 2000-10-20 16:20:24 -07:00
dan d9dd14a995 [svn] * AUTHORS: Added -E to thst list of my stuff.
* TODO: We need to check the HTTP spec w.r.t. simplification of absolute URLs.

* MAILING-LIST: I didn't realize <wget@sunsite.auc.dk> allowed posting by
  non-subscribers.  <bug-wget@gnu.org> soon to be an alias for it.

* NEWS: Always forget to update this file when making user-vis. changes.
2000-10-20 15:29:42 -07:00
dan 6dd2357558 [svn] TODO: -k needs to convert '?' to "%3F" in links to saved files containing the
'?' character (e.g. CGI output).
2000-10-20 14:44:26 -07:00
dan b77a3437a8 [svn] Added Polish and Russian message translation files. 2000-10-20 00:20:55 -07:00
dan da17e06a1e [svn] TODO: Make -I and -X allow an optional hostname before the directory name?
When simplifying paths, wget needs to stop at any '?' character.
2000-10-19 23:06:03 -07:00
dan b3e2c0ff97 [svn] Implemented and documented new -E / --html-extension / html_extension option. 2000-10-19 22:55:46 -07:00
dan de7c00c095 [svn] TODO: Add option to save local filenames without extra %-encoding. 2000-10-18 23:29:20 -07:00
dan cbf018d0c0 [svn] --retr-symlinks was not previously documented properly. Based on my newfound
understanding of what its limitations are, added a TODO item.  Also made a minor
tweak in html.c to silence a warning.
2000-10-09 15:43:11 -07:00
dan 2358c437c5 [svn] TODO: Make wget follow (illegal) relative URL HTTP redirects. 2000-09-25 17:42:50 -07:00
dan 737daec8e6 [svn] TODO: Make wget return nonzero in situations like bad HTTP auth. 2000-09-25 15:09:25 -07:00
dan 86ac3fdcbd [svn] * MACHINES: Previously said to send updates to "me" (Hrvoje) -- now says to
email the mailing list or bug-wget@gnu.org.

* MAILING-LIST: Added mention of bug-wget@gnu.org.

* NEWS: Added --waitretry and --page-requisites.
2000-08-30 14:07:40 -07:00
dan 7931200609 [svn] * *.{gmo,po,pot}: Regenerated after modifying wget --help output.
* ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define.

* html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter.
Wrapped some > 80-column lines.  When -p is specified and we're at a
leaf node, do not traverse <A>, <AREA>, or <LINK> tags other than
<LINK REL="stylesheet">.

* html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter.

* init.c: Added new -p / --page-requisites / page_requisites option.

* main.c (print_help): Clarified that -l inf and -l 0 both allow
infinite recursion.  Changed the unhelpful --mirrior description
to simply give the options it's equivalent to.  Added new -p option.
(main): Added some comments; handle new -p / --page-requisites.

* options.h (struct options): Added new page_requisites field.

* recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs".
Calculate and pass down new `dash_p_leaf_HTML' parameter to
get_urls_html().  Use new INFINITE_RECURSION #define.

* retr.c: Changed "URL-s" to "URLs".  get_urls_html() now takes
final `dash_p_leaf_HTML' parameter.

* url.c: get_urls_html() and htmlfindurl() now take final
`dash_p_leaf_HTML' parameter.

* url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter.

* wget.h: Added some comments and new INFINITE_RECURSION #define.

* wget.texi (Recursive Retrieval Options): Documented new -p option.
2000-08-30 04:26:21 -07:00
dan 93146c9432 [svn] wget works on FreeBSD 4.x as well as the previously-mentioned 2.x. Changed
MACHINES entry to have a * in the version location.
2000-08-25 15:36:34 -07:00
dan 9ec2f1d31d [svn] * MACHINES: Alphabetized, changed "architectures" to "OSes and architecures",
added missing company names, removed needless ^L, made AIX entry more general
  to reflect my testing, removed the non-factual "this version of", and fixed
  some grammatical errors.
2000-08-25 15:23:22 -07:00
dan 88c07d546e [svn] TODO: -k should convert "hostless absolute" URLs, like <A HREF="/index.html">.
However, Brian McMahon <bm@iucr.org> wants the old incorrect behavior to still
be available as an option, as he depends on it to allow mirrors of his site to
send CGI queries to his original site, but still get graphics off of the mirror
site.  Perhaps this would be better dealt with by adding an option to tell -k
not to convert certain URLs patterns?
2000-07-21 16:16:10 -07:00
dan fe387ce432 [svn] TODO: Timestamps are sometimes not copied over on files retrieved by FTP. 2000-05-24 13:29:18 -07:00
dan 6d218bc4ab [svn] TODO: Wget does not currently handle "fragment identifiers" (the part of a URL
starting with the '#' character) properly.
2000-05-22 19:40:09 -07:00
dan b05feb3ae2 [svn] Damir Dzeko <ddzeko@zesoi.fer.hr> did not document his new --referer option.
Did so (--help output and wget.texi).  Also tweaked --help output for --execute.
2000-05-22 19:29:38 -07:00
dan 7c9156e74a [svn] * AUTHORS: Added myself to this file, as Hrvoje got confirmation of my FSF
copyright assignment.
2000-05-22 16:36:30 -07:00
dan f7c83b6ee3 [svn] TODO: Make `-k' check for files that were downloaded in the past and convert
links to them in newly-downloaded documents.
2000-05-17 19:19:59 -07:00
dan e5b8bc39c0 [svn] Forgot to update the ChangeLog when I added my three entries to the TODO
yesterday.
2000-04-06 14:55:02 -07:00
dan 1c083869b0 [svn] Previously forgot to add a note about new --follow-tags and -G / --ignore-tags. 2000-04-04 19:02:08 -07:00
dan 4454f6ce0a [svn] * TODO: Removed done item: we now have an option (-G) that makes it easy to
download a single HTML document and all its constituents.

* po/*.{gmo,po,pot}: Regenerated after adding new options.

* po/hr.po: Hrvoje forgot '\n's on his translations of my altered messages,
causing msgfmt to balk and `make install' to fail.


* wget.texi (Recursive Retrieval Options): In -K description, added a link to
the discussion of interaction with -N.
(Recursive Accept/Reject Options): Did some alphabetizing and added descriptions
of new --follow-tags and -G / --ignore-tags options.
(Following Links): Changed "the loads of" to "loads of".
(Wgetrc Commands): Added descriptions of new follow_tags and ignore_tags
commands.


* html.c (idmatch): Implemented checking of my new --follow-tags and
--ignore-tags options.

* init.c (commands): Added comment reminding people adding new entries doing
allocation to add corresponding freeing in cleanup().
(commands): Added new followtags and ignoretags commands.
(cleanup): Free storage for new followtags and ignoretags.

* main.c: Use of "comma-separated list" was random -- normalized it.  Did some
alphabetization.  Added comments pointing out "Options without arguments" and
"Options accepting an argument" sections of long_options[].  Added new options
--follow-tags and -G / --ignore-tags.  Added comment that Damir's --referer is
currently undocumented.  Added comment that Heiko's --waitretry is partially
undocumented (mentioned in --help but not in wget.texi).  Moved improperly
sorted 24, 129, and 'G' cases.

* options.h (struct options): Added new fields follow_tags and ignore_tags.

* wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ".
2000-03-10 22:48:06 -08:00
dan 4331c39c9a [svn] Implemented the item I formerly had in the TODO: When -K and -N are used
together, we compare local file X.orig (if extant) against server file X.
Previously -k and -N were worthless in combination because the local converted
files always differed from the server versions.
2000-03-01 22:33:48 -08:00
dan e0a58713f7 [svn] Upped version number from 1.5.3. to 1.5.3+dev. Because the development source
is available via anonymous CVS and desirable features are being added, it's
quite possible for end-users to be getting their hands on development versions.
They may report bugs, so if we don't change the version number, we'll have to
continually followup the statement "I'm using version 1.5.3" with the question
"The FTP archive or the CVS source?"  Better to just make this development
version have a unique number.  Once we're ready to actually release the next
version, we can up the version from 1.5.3+dev to 1.5.4, or 1.6, or whatever it
turns out to be (depending on how much development gets done).

Also made minor updates (dates, email addresses) to wget.texi.
2000-02-29 16:50:52 -08:00
dan e5408e7db8 [svn] Implemented new -K / --backup-converted / backup_converted = on option. 2000-02-29 16:17:23 -08:00
kwget 31d6616c48 [svn] Initial revision 1999-12-01 23:42:23 -08:00