Commit Graph

66 Commits

Author SHA1 Message Date
hniksic 78706dc5ea [svn] * retr.c (fd_read_body): Report the amount of data *written* as
amount_read.  This is not entirely logical, but that's what the
callers expect, and it's not easy to change.

* ftp.c (ftp_loop_internal): Ditto.

* http.c (http_loop): Be smarter about assigning restval; if we're
in the nth pass of a download, simply use the information we have
about how much data has been retrieved as restval.

* ftp.c (getftp): Ditto for FTP "REST" command.

* http.c (gethttp): When the server doesn't respect range, skip
the first RESTVAL bytes of the read body.  Never truncate the
output file.

* retr.c (fd_read_body): Support skipping initial STARTPOS octets.
2003-11-30 15:39:04 -08:00
hniksic 6bf6f58ca3 [svn] Document the existence of --preserve-permissions. 2003-11-07 05:41:20 -08:00
hniksic b46a8df58d [svn] Updated. 2003-11-06 12:26:05 -08:00
hniksic aa998eb05b [svn] Some more updating. 2003-10-08 08:17:26 -07:00
hniksic 19471588bf [svn] Updated some items. 2003-10-08 08:10:55 -07:00
hniksic 10137bd186 [svn] New option --dns-cache. 2003-09-10 12:41:54 -07:00
hniksic 73fc84d889 [svn] New TODO item. 2002-04-15 18:06:44 -07:00
hniksic 886470568c [svn] New TODO item. 2002-04-15 11:27:11 -07:00
hniksic 5390ada318 [svn] Support FWTK-style proxies.
Pbublished in <sxslmbsxptu.fsf@florida.arsdigita.de>.
2002-04-12 20:04:47 -07:00
hniksic cee95e0e79 [svn] New TODO entry. 2002-01-15 07:16:19 -08:00
hniksic 980e39f9b4 [svn] New TODO item. 2001-12-01 11:04:30 -08:00
hniksic f5625139a1 [svn] New item. 2001-12-01 10:44:08 -08:00
hniksic 7ab7f93f8d [svn] Make -p work with framed pages.
Published in <sxsu1vby71t.fsf@florida.arsdigita.de>.
2001-11-30 19:06:41 -08:00
hniksic 406fb8bbef [svn] Don't allow more than 20 successive redirections. 2001-11-30 13:33:31 -08:00
hniksic a4db28e20f [svn] Ignore -np when in -p mode.
Published in <sxsg06w2c52.fsf@florida.arsdigita.de>.
2001-11-30 13:17:53 -08:00
hniksic 2aa6f0c0e6 [svn] New items. 2001-11-30 03:15:24 -08:00
hniksic df05e7ff10 [svn] Handle <base href=...> when converting links.
Published in <sxsadxaae3t.fsf@florida.arsdigita.de>.
2001-11-25 10:40:55 -08:00
hniksic 2e6e3f21f8 [svn] Attempt to quote '?' as "%3F" when linking to local files.
Given up on the attempt, as it breaks local browsing.
2001-11-25 09:44:28 -08:00
hniksic 222e9465b7 [svn] Implemented breadth-first retrieval.
Published in <sxsherjczw2.fsf@florida.arsdigita.de>.
2001-11-24 19:10:34 -08:00
hniksic 2cfb2d2ef6 [svn] Added support for cookies. 2001-04-08 15:25:24 -07:00
dan c33a1f97fe [svn] TODO: -p should probably go "_two_ more hops" on <FRAMESET> pages.
wget.texi (Recursive Retrieval Options): Explained that you need
to use -r -l1 -p to get the two levels of requisites for a
<FRAMESET> page.  Also made a few other wording improvements.
2001-03-26 19:22:17 -08:00
dan f81e644ae3 [svn] TODO: Oops. Hostless absolute link conversion _is_ working. My test that led
me to believe it wasn't was exposing a different bug -- URLs specified on the
commandline as opposed to being recursed to don't always get re-converted at the
end of the Wget run.
2001-03-20 05:54:49 -08:00
dan 7ed63bd4d6 [svn] TODO: For some reason on 2000-11-19, Hrvoje removed the item about converting
hostless absolute links.  That isn't working yet, so I've put the item back,
with a modified wording.
2001-03-16 18:01:53 -08:00
dan 8bf001f5d8 [svn] TODO: Only normal recursion should respect -np -- page-requisite recursion
should not.
2001-03-12 19:53:34 -08:00
janp 2a36d475db [svn] Removed an obsolete TODO item about adding VMS and MS FTP server support. 2001-03-08 15:11:51 -08:00
dan c12d5e7522 [svn] Add a --range parameter allowing you to explicitly specify a range of bytes to
get from a file over HTTP (FTP only supports ranges ending at the end of the
file, though forcibly disconnecting from the server at the desired endpoint
might be workable).
2001-03-05 17:20:58 -08:00
dan fd963a79d1 [svn] TODO: Make "ftp://<host>/%2F<file>" cause an initial "CWD /". 2001-02-27 18:03:59 -08:00
dan f55d94935a [svn] TODO: Re-use FTP connection if multiple URLs on one host specified. 2001-02-27 17:29:45 -08:00
dan 63ecb8215a [svn] TODO: "Timestamps are sometimes not copied over on files retrieved by FTP."
removed.  Hopefully all the failures I was seeing were due to the fact that it
wasn't documented that non-globbing, non-recursive FTP downloads need -N to get
the remote timestamp to be preserved.
2001-02-23 15:37:55 -08:00
dan 776d65e829 [svn] * configure.in: Look for perl and pod2man and make substitutions.
* Makefile.in (install): Do install.man if we have pod2man.

* Makefile.in: Make wget man page and install it if we have pod2man.  Added some
  missing '$(srcdir)/'s.  Added missing dependencies on install targets
  (allowing you to just do `make install' rather than forcing you to do `make &&
  make install').  Also, Makefile rules should always use output file parameters
  if available rather than redirecting stdout with '>', or you falsely satisfy
  dependencies if the tool you're running is missing or fails -- fixed call of
  texi2pod.pl that did this wrong.

* texi2pod.pl: Removed from CVS.  Now automatically generated.

* texi2pod.pl.in: This new file is processed into texi2pod.pl, getting the
  appropriate path to the Perl 5+ executable on this system and becoming
  executable (CVS files, by contrast, don't arrive executable).
2001-02-22 18:09:36 -08:00
dan 4586f490e5 [svn] TODO: Remove empty directories created due to --accept/--reject. 2001-02-22 15:51:01 -08:00
dan 26ef854b23 [svn] If -c used with -N, check to make sure a file hasn't changed on the server
before "continuing" to download it (preventing a bogus hybrid file).
2001-01-11 20:51:59 -08:00
dan 552ba9c965 [svn] TODO: Add a "rollback" option to have --continue throw away a configurable
number of bytes at the end of a file before resuming download.  Apparently, some
stupid proxies insert a "transfer interrupted" string we need to get rid of.
2001-01-09 21:37:59 -08:00
dan 7140f393b2 [svn] When did the bug get fixed where recursive FTP would fail if logging in put
you in a directory other than "/"?  I don't see a src/ChangeLog entry for
it.  In any case, my testing shows that it's fixed in 1.7-dev, but TODO and
a comment in src/ftp.c were not changed to reflect this.
2001-01-09 20:17:06 -08:00
dan 86ee0537d5 [svn] TODO: If -c used on a file that's already completely downloaded, don't
re-download it (unless normal --timestamping processing would cause you to do
so).
2001-01-09 19:45:19 -08:00
dan 40a77731a2 [svn] TODO: A poster to the wget list notes that gcc's nice man page is automatically
generated from its .texi file.  This may be a way to get back a wget man page.
2001-01-04 20:44:46 -08:00
dan 2aece99f0b [svn] TODO: We should make a simple man page referring to info doco. 2001-01-03 14:40:18 -08:00
hniksic 7828e81c79 [svn] Committed C. Frankel's SSL patch. 2000-12-05 15:09:41 -08:00
hniksic 6663d70a0a [svn] New TODO item. 2000-11-21 06:58:46 -08:00
hniksic b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00
hniksic fa90bec240 [svn] One more todo item. 2000-11-14 14:45:43 -08:00
hniksic bfca6f02b4 [svn] Spelling fixlet. 2000-11-06 02:16:37 -08:00
hniksic ac96041552 [svn] As of recently, path simplification does stop at '?'. 2000-11-06 02:03:57 -08:00
dan 24c465b5ad [svn] retr.c (retrieve_url): Manually applied T. Bharath
<TBharath@responsenetworks.com>'s patch to get wget to grok illegal relative URL
redirects.  Reformatted and re-commented it.
2000-10-27 20:18:20 -07:00
dan e863bff640 [svn] --mime-extensions would be more appropriate than --mime-extension.
^
2000-10-24 15:40:22 -07:00
dan 71994021f3 [svn] TODO: Generalize --html-extension to something like --mime-extension. 2000-10-20 16:20:24 -07:00
dan d9dd14a995 [svn] * AUTHORS: Added -E to thst list of my stuff.
* TODO: We need to check the HTTP spec w.r.t. simplification of absolute URLs.

* MAILING-LIST: I didn't realize <wget@sunsite.auc.dk> allowed posting by
  non-subscribers.  <bug-wget@gnu.org> soon to be an alias for it.

* NEWS: Always forget to update this file when making user-vis. changes.
2000-10-20 15:29:42 -07:00
dan 6dd2357558 [svn] TODO: -k needs to convert '?' to "%3F" in links to saved files containing the
'?' character (e.g. CGI output).
2000-10-20 14:44:26 -07:00
dan da17e06a1e [svn] TODO: Make -I and -X allow an optional hostname before the directory name?
When simplifying paths, wget needs to stop at any '?' character.
2000-10-19 23:06:03 -07:00
dan b3e2c0ff97 [svn] Implemented and documented new -E / --html-extension / html_extension option. 2000-10-19 22:55:46 -07:00