1
0
mirror of https://github.com/moparisthebest/wget synced 2024-07-03 16:38:41 -04:00
Commit Graph

56 Commits

Author SHA1 Message Date
hniksic
980e39f9b4 [svn] New TODO item. 2001-12-01 11:04:30 -08:00
hniksic
f5625139a1 [svn] New item. 2001-12-01 10:44:08 -08:00
hniksic
7ab7f93f8d [svn] Make -p work with framed pages.
Published in <sxsu1vby71t.fsf@florida.arsdigita.de>.
2001-11-30 19:06:41 -08:00
hniksic
406fb8bbef [svn] Don't allow more than 20 successive redirections. 2001-11-30 13:33:31 -08:00
hniksic
a4db28e20f [svn] Ignore -np when in -p mode.
Published in <sxsg06w2c52.fsf@florida.arsdigita.de>.
2001-11-30 13:17:53 -08:00
hniksic
2aa6f0c0e6 [svn] New items. 2001-11-30 03:15:24 -08:00
hniksic
df05e7ff10 [svn] Handle <base href=...> when converting links.
Published in <sxsadxaae3t.fsf@florida.arsdigita.de>.
2001-11-25 10:40:55 -08:00
hniksic
2e6e3f21f8 [svn] Attempt to quote '?' as "%3F" when linking to local files.
Given up on the attempt, as it breaks local browsing.
2001-11-25 09:44:28 -08:00
hniksic
222e9465b7 [svn] Implemented breadth-first retrieval.
Published in <sxsherjczw2.fsf@florida.arsdigita.de>.
2001-11-24 19:10:34 -08:00
hniksic
2cfb2d2ef6 [svn] Added support for cookies. 2001-04-08 15:25:24 -07:00
dan
c33a1f97fe [svn] TODO: -p should probably go "_two_ more hops" on <FRAMESET> pages.
wget.texi (Recursive Retrieval Options): Explained that you need
to use -r -l1 -p to get the two levels of requisites for a
<FRAMESET> page.  Also made a few other wording improvements.
2001-03-26 19:22:17 -08:00
dan
f81e644ae3 [svn] TODO: Oops. Hostless absolute link conversion _is_ working. My test that led
me to believe it wasn't was exposing a different bug -- URLs specified on the
commandline as opposed to being recursed to don't always get re-converted at the
end of the Wget run.
2001-03-20 05:54:49 -08:00
dan
7ed63bd4d6 [svn] TODO: For some reason on 2000-11-19, Hrvoje removed the item about converting
hostless absolute links.  That isn't working yet, so I've put the item back,
with a modified wording.
2001-03-16 18:01:53 -08:00
dan
8bf001f5d8 [svn] TODO: Only normal recursion should respect -np -- page-requisite recursion
should not.
2001-03-12 19:53:34 -08:00
janp
2a36d475db [svn] Removed an obsolete TODO item about adding VMS and MS FTP server support. 2001-03-08 15:11:51 -08:00
dan
c12d5e7522 [svn] Add a --range parameter allowing you to explicitly specify a range of bytes to
get from a file over HTTP (FTP only supports ranges ending at the end of the
file, though forcibly disconnecting from the server at the desired endpoint
might be workable).
2001-03-05 17:20:58 -08:00
dan
fd963a79d1 [svn] TODO: Make "ftp://<host>/%2F<file>" cause an initial "CWD /". 2001-02-27 18:03:59 -08:00
dan
f55d94935a [svn] TODO: Re-use FTP connection if multiple URLs on one host specified. 2001-02-27 17:29:45 -08:00
dan
63ecb8215a [svn] TODO: "Timestamps are sometimes not copied over on files retrieved by FTP."
removed.  Hopefully all the failures I was seeing were due to the fact that it
wasn't documented that non-globbing, non-recursive FTP downloads need -N to get
the remote timestamp to be preserved.
2001-02-23 15:37:55 -08:00
dan
776d65e829 [svn] * configure.in: Look for perl and pod2man and make substitutions.
* Makefile.in (install): Do install.man if we have pod2man.

* Makefile.in: Make wget man page and install it if we have pod2man.  Added some
  missing '$(srcdir)/'s.  Added missing dependencies on install targets
  (allowing you to just do `make install' rather than forcing you to do `make &&
  make install').  Also, Makefile rules should always use output file parameters
  if available rather than redirecting stdout with '>', or you falsely satisfy
  dependencies if the tool you're running is missing or fails -- fixed call of
  texi2pod.pl that did this wrong.

* texi2pod.pl: Removed from CVS.  Now automatically generated.

* texi2pod.pl.in: This new file is processed into texi2pod.pl, getting the
  appropriate path to the Perl 5+ executable on this system and becoming
  executable (CVS files, by contrast, don't arrive executable).
2001-02-22 18:09:36 -08:00
dan
4586f490e5 [svn] TODO: Remove empty directories created due to --accept/--reject. 2001-02-22 15:51:01 -08:00
dan
26ef854b23 [svn] If -c used with -N, check to make sure a file hasn't changed on the server
before "continuing" to download it (preventing a bogus hybrid file).
2001-01-11 20:51:59 -08:00
dan
552ba9c965 [svn] TODO: Add a "rollback" option to have --continue throw away a configurable
number of bytes at the end of a file before resuming download.  Apparently, some
stupid proxies insert a "transfer interrupted" string we need to get rid of.
2001-01-09 21:37:59 -08:00
dan
7140f393b2 [svn] When did the bug get fixed where recursive FTP would fail if logging in put
you in a directory other than "/"?  I don't see a src/ChangeLog entry for
it.  In any case, my testing shows that it's fixed in 1.7-dev, but TODO and
a comment in src/ftp.c were not changed to reflect this.
2001-01-09 20:17:06 -08:00
dan
86ee0537d5 [svn] TODO: If -c used on a file that's already completely downloaded, don't
re-download it (unless normal --timestamping processing would cause you to do
so).
2001-01-09 19:45:19 -08:00
dan
40a77731a2 [svn] TODO: A poster to the wget list notes that gcc's nice man page is automatically
generated from its .texi file.  This may be a way to get back a wget man page.
2001-01-04 20:44:46 -08:00
dan
2aece99f0b [svn] TODO: We should make a simple man page referring to info doco. 2001-01-03 14:40:18 -08:00
hniksic
7828e81c79 [svn] Committed C. Frankel's SSL patch. 2000-12-05 15:09:41 -08:00
hniksic
6663d70a0a [svn] New TODO item. 2000-11-21 06:58:46 -08:00
hniksic
b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00
hniksic
fa90bec240 [svn] One more todo item. 2000-11-14 14:45:43 -08:00
hniksic
bfca6f02b4 [svn] Spelling fixlet. 2000-11-06 02:16:37 -08:00
hniksic
ac96041552 [svn] As of recently, path simplification does stop at '?'. 2000-11-06 02:03:57 -08:00
dan
24c465b5ad [svn] retr.c (retrieve_url): Manually applied T. Bharath
<TBharath@responsenetworks.com>'s patch to get wget to grok illegal relative URL
redirects.  Reformatted and re-commented it.
2000-10-27 20:18:20 -07:00
dan
e863bff640 [svn] --mime-extensions would be more appropriate than --mime-extension.
^
2000-10-24 15:40:22 -07:00
dan
71994021f3 [svn] TODO: Generalize --html-extension to something like --mime-extension. 2000-10-20 16:20:24 -07:00
dan
d9dd14a995 [svn] * AUTHORS: Added -E to thst list of my stuff.
* TODO: We need to check the HTTP spec w.r.t. simplification of absolute URLs.

* MAILING-LIST: I didn't realize <wget@sunsite.auc.dk> allowed posting by
  non-subscribers.  <bug-wget@gnu.org> soon to be an alias for it.

* NEWS: Always forget to update this file when making user-vis. changes.
2000-10-20 15:29:42 -07:00
dan
6dd2357558 [svn] TODO: -k needs to convert '?' to "%3F" in links to saved files containing the
'?' character (e.g. CGI output).
2000-10-20 14:44:26 -07:00
dan
da17e06a1e [svn] TODO: Make -I and -X allow an optional hostname before the directory name?
When simplifying paths, wget needs to stop at any '?' character.
2000-10-19 23:06:03 -07:00
dan
b3e2c0ff97 [svn] Implemented and documented new -E / --html-extension / html_extension option. 2000-10-19 22:55:46 -07:00
dan
de7c00c095 [svn] TODO: Add option to save local filenames without extra %-encoding. 2000-10-18 23:29:20 -07:00
dan
cbf018d0c0 [svn] --retr-symlinks was not previously documented properly. Based on my newfound
understanding of what its limitations are, added a TODO item.  Also made a minor
tweak in html.c to silence a warning.
2000-10-09 15:43:11 -07:00
dan
2358c437c5 [svn] TODO: Make wget follow (illegal) relative URL HTTP redirects. 2000-09-25 17:42:50 -07:00
dan
737daec8e6 [svn] TODO: Make wget return nonzero in situations like bad HTTP auth. 2000-09-25 15:09:25 -07:00
dan
51642074f4 [svn] Just fixed a typo. 2000-07-21 18:36:44 -07:00
dan
88c07d546e [svn] TODO: -k should convert "hostless absolute" URLs, like <A HREF="/index.html">.
However, Brian McMahon <bm@iucr.org> wants the old incorrect behavior to still
be available as an option, as he depends on it to allow mirrors of his site to
send CGI queries to his original site, but still get graphics off of the mirror
site.  Perhaps this would be better dealt with by adding an option to tell -k
not to convert certain URLs patterns?
2000-07-21 16:16:10 -07:00
dan
e1d4d0995f [svn] -k should convert "hostless absolute" URLs, like <A HREF="/index.html">. 2000-07-19 18:19:58 -07:00
dan
fe387ce432 [svn] TODO: Timestamps are sometimes not copied over on files retrieved by FTP. 2000-05-24 13:29:18 -07:00
dan
6d218bc4ab [svn] TODO: Wget does not currently handle "fragment identifiers" (the part of a URL
starting with the '#' character) properly.
2000-05-22 19:40:09 -07:00
dan
f7c83b6ee3 [svn] TODO: Make `-k' check for files that were downloaded in the past and convert
links to them in newly-downloaded documents.
2000-05-17 19:19:59 -07:00