1
0
mirror of https://github.com/moparisthebest/wget synced 2024-07-03 16:38:41 -04:00
Commit Graph

182 Commits

Author SHA1 Message Date
hniksic
9179f2e699 [svn] Don't fiddle with "interactive timeout" if read timeout is unset. 2003-12-12 14:40:39 -08:00
hniksic
a8155e7bcc [svn] Correctly calculate bandwidth as total data read divided with download
time.
2003-12-05 18:32:57 -08:00
hniksic
78706dc5ea [svn] * retr.c (fd_read_body): Report the amount of data *written* as
amount_read.  This is not entirely logical, but that's what the
callers expect, and it's not easy to change.

* ftp.c (ftp_loop_internal): Ditto.

* http.c (http_loop): Be smarter about assigning restval; if we're
in the nth pass of a download, simply use the information we have
about how much data has been retrieved as restval.

* ftp.c (getftp): Ditto for FTP "REST" command.

* http.c (gethttp): When the server doesn't respect range, skip
the first RESTVAL bytes of the read body.  Never truncate the
output file.

* retr.c (fd_read_body): Support skipping initial STARTPOS octets.
2003-11-30 15:39:04 -08:00
hniksic
2f357b5eb8 [svn] Better documentation for fd_read_body args. 2003-11-29 20:41:08 -08:00
hniksic
3390f7eb45 [svn] Try to reuse connections that return error codes. 2003-11-29 17:34:33 -08:00
hniksic
8f286c1238 [svn] Remove obsolete finder_t typedef. 2003-11-27 15:34:32 -08:00
hniksic
dfe1f43dc0 [svn] Use fd_read_hunk for reading HTTP response head. Support HTTP/0.9
responses.
2003-11-27 15:29:36 -08:00
hniksic
d9fea91a0a [svn] Remove the "rbuf" buffering layer. Provide peeking primitives instead. 2003-11-20 17:48:11 -08:00
hniksic
0716c335a0 [svn] Renamed xread/xwrite/xclose to fd_read/fd_write/fd_close. The "x" prefix is
not appropriate because the semantics have diverged from read/write/close
too significantly.
2003-11-20 07:19:59 -08:00
hniksic
34a06b07b0 [svn] Doc fix. 2003-11-08 11:29:36 -08:00
hniksic
94ca33d2fb [svn] Update the progress gauge even when the data does not arrive. 2003-11-08 08:56:52 -08:00
hniksic
0bd6576cde [svn] Timer code update. 2003-11-07 20:55:44 -08:00
hniksic
c8e92cdbd6 [svn] Get rid of ssl_iread/ssl_iwrite. Have the SSL code register its
routines for reading and writing data.
2003-11-05 17:12:03 -08:00
hniksic
8cd9b4cd8a [svn] New function xsleep that resumes sleeps interrupted by signals
on systems that support nanosleep.
2003-11-03 13:57:04 -08:00
hniksic
e2e9b753e4 [svn] Retired the `boolean' type. Renamed FREE_MAYBE to xfree_null and moved the
definition from wget.h to xmalloc.h.  Moved the DEFAULT_LOGFILE
define to log.h.  Moved the INFINITE_RECURSION define to recur.h.
2003-11-02 11:56:37 -08:00
hniksic
29e36b051b [svn] Fix -p so it doesn't imply FTP recursion. 2003-10-30 05:00:29 -08:00
hniksic
711bf72609 [svn] Remove VERY_LONG_TYPE; use LARGE_INT instead. Remove special code
for handling VERY_LONG_TYPE overflows.
Make opt.quota a LARGE_INT.
2003-10-11 06:57:11 -07:00
hniksic
f70c6cacb9 [svn] Initialize DUMMY storage for DT. 2003-10-02 14:21:07 -07:00
hniksic
451ddf3cf9 [svn] Display percentage in title bar.
By Gisle Vanem.
2003-09-30 14:24:36 -07:00
hniksic
b8fa2039a7 [svn] Doc update. 2003-09-22 06:34:55 -07:00
hniksic
95c647eb44 [svn] Split off non-URL related stuff from url.c to convert.c. 2003-09-21 15:47:14 -07:00
hniksic
b8e416c6c7 [svn] Adjust bandwidth limitation sleep for the error of previous sleeps.
Allow decimal numbers in bandwidth limit specification.
2003-09-20 21:05:12 -07:00
hniksic
7870937036 [svn] Allow decimal values for --timeout, --wait, and --waitretry.
Message-ID: <m3llsjhx6t.fsf@hniksic.iskon.hr>
2003-09-20 16:12:18 -07:00
hniksic
e7998ad916 [svn] Cosmetic fixes. 2003-09-20 14:08:41 -07:00
hniksic
fb67dd6201 [svn] Call ssl_iread with the correct arguments. 2003-09-18 04:43:00 -07:00
hniksic
d54b2a086d [svn] Improved --limit-rate for small bandwidths.
Message-ID: <m3znh5n2og.fsf@hniksic.iskon.hr>
2003-09-15 14:48:43 -07:00
hniksic
9228f0bf53 [svn] Update progress code to use higher timer resolution.
Message-ID: <m37k49oivp.fsf@hniksic.iskon.hr>
2003-09-15 14:14:15 -07:00
hniksic
b2be7522c7 [svn] Update the license to include the OpenSSL exception. 2002-05-17 19:16:36 -07:00
hniksic
9f2cc07924 [svn] Minor cleanup of iread/iwrite.
Published in <sxsk7raj1t2.fsf@florida.arsdigita.de>.
2002-04-13 22:19:27 -07:00
hniksic
5cb8a6f44d [svn] Add the POST method.
Published in <sxssn5yj4eb.fsf@florida.arsdigita.de>.
2002-04-13 21:22:47 -07:00
hniksic
5390ada318 [svn] Support FWTK-style proxies.
Pbublished in <sxslmbsxptu.fsf@florida.arsdigita.de>.
2002-04-12 20:04:47 -07:00
abbotti
cfd7b9a951 [svn] Use new function to test filename for common html suffixes.
Submitted by Ian Abbott in <3CB72D29.4898.1F34872@localhost> with minor
changes to formatting and comments.
2002-04-12 11:53:39 -07:00
hniksic
6c4c785cc2 [svn] Remove redirection cycle detection.
Published in <sxs7kpzhcwa.fsf@florida.arsdigita.de>.
2002-01-30 11:12:20 -08:00
hniksic
eb0f9de05d [svn] Simplify indentation in sleep_between_retrievals. 2001-12-17 07:00:25 -08:00
hniksic
75699d6213 [svn] Use the new function `random_number' that doesn't depend on RAND_MAX
being defined.
Published in <sxsheqqq6xb.fsf@florida.arsdigita.de>.
2001-12-17 06:05:08 -08:00
hniksic
0fdc1bd8c0 [svn] Fix downloading of duplicate URLs.
Published in <sxsvgfmu2bj.fsf@florida.arsdigita.de>.
2001-12-04 13:03:35 -08:00
hniksic
406fb8bbef [svn] Don't allow more than 20 successive redirections. 2001-11-30 13:33:31 -08:00
hniksic
a4db28e20f [svn] Ignore -np when in -p mode.
Published in <sxsg06w2c52.fsf@florida.arsdigita.de>.
2001-11-30 13:17:53 -08:00
hniksic
fb98d1e4b0 [svn] Remove implicit declaration warnings. 2001-11-29 10:22:18 -08:00
hniksic
d4c6949a05 [svn] Use rand instead of random.
Published in <sxsu1vgataq.fsf@florida.arsdigita.de>.
2001-11-27 11:49:28 -08:00
hniksic
9b202ba261 [svn] * retr.c (retrieve_from_file): Initialize `new_file' to NULL to
prevent seg fault.
Submitted by Ian Abbott in <3C03DE8A.845.D0E495@localhost>.
2001-11-27 11:14:34 -08:00
hniksic
bae3162c03 [svn] New option `--limit-rate'.
Published in <sxsd725cn56.fsf@florida.arsdigita.de>.
2001-11-26 12:07:13 -08:00
hniksic
84396de673 [svn] Allocate a timer directly in get_contents.
Published in <sxsherhbsvl.fsf@florida.arsdigita.de>.
2001-11-26 04:49:10 -08:00
hniksic
024cb5ed3a [svn] A lot of host name changes.
Published in <sxs3d32856s.fsf@florida.arsdigita.de>.
2001-11-25 21:36:33 -08:00
hniksic
2c41d783c6 [svn] New option --random-wait.
Submitted by Alan Eldridge in
<200111042106.fA4L63b75804@wwweasel.geeksrus.net>.
2001-11-25 13:23:15 -08:00
hniksic
3afb9c659a [svn] Recursion and progress bar tweaks.
Published in <sxsd727cvc0.fsf@florida.arsdigita.de>.
2001-11-25 13:03:30 -08:00
hniksic
df05e7ff10 [svn] Handle <base href=...> when converting links.
Published in <sxsadxaae3t.fsf@florida.arsdigita.de>.
2001-11-25 10:40:55 -08:00
hniksic
05463c7121 [svn] Commit progress bar tweaks.
Published in <sxsd727cvc0.fsf@florida.arsdigita.de>.
2001-11-24 20:46:26 -08:00
hniksic
222e9465b7 [svn] Implemented breadth-first retrieval.
Published in <sxsherjczw2.fsf@florida.arsdigita.de>.
2001-11-24 19:10:34 -08:00
hniksic
177b6ee8fd [svn] Print the actual error message when parsing of redirection URL fails. 2001-11-24 06:02:10 -08:00
hniksic
cdcf67a5bd [svn] Big progress bar update.
Published in <sxsn11ehyn1.fsf@florida.arsdigita.de>.
2001-11-22 20:59:52 -08:00
hniksic
4d352013ac [svn] Replace opt.no_flush with a function to disable/enable flushing.
Published in <sxssnb6imyz.fsf@florida.arsdigita.de>.
2001-11-22 12:13:13 -08:00
hniksic
d5be8ecca4 [svn] Rewrite parsing and handling of URLs.
Published in <sxs4rnnlklo.fsf@florida.arsdigita.de>.
2001-11-21 16:24:28 -08:00
hniksic
f178e6c613 [svn] Clean up handling of schemes.
Published in <sxswv0n7h7s.fsf@florida.arsdigita.de>.
2001-11-18 16:12:05 -08:00
hniksic
0b056d1720 [svn] Update copyright notices. 2001-05-27 12:35:15 -07:00
hniksic
72eca0976b [svn] Commit several minor changes:
* main.c (print_help): Document `--no-http-keep-alive'.

* utils.c (numdigit): Handle negative numbers *correctly*.

* hash.c (make_nocase_string_hash_table): Use term "nocase" rather
than the confusing "unsigned".

* utils.c (string_set_contains): Renamed from string_set_exists.

* hash.c (hash_table_contains): Renamed from hash_table_exists.

* cookies.c: Move case-insensitive hash tables to hash.c.

Published in <sxsheyq9vvl.fsf@florida.arsdigita.de>.
2001-05-12 13:06:41 -07:00
hniksic
45eea7387f [svn] New timer functions. Published in <sxs8zkp28vk.fsf@florida.arsdigita.de>. 2001-04-24 16:09:45 -07:00
hniksic
6ba7711d6a [svn] Print download rate even when percentages are not available.
Published in <sxs1yqjd4ct.fsf@florida.arsdigita.de>.
2001-04-23 08:28:34 -07:00
hniksic
61bb00adc0 [svn] Various url.c-related changes.
Published in <sxsvgo8nmub.fsf@florida.arsdigita.de>.

* retr.c (retrieve_url): Call uri_merge, not url_concat.
* html-url.c (collect_tags_mapper): Call uri_merge, not
url_concat.
* url.c (mkstruct): Use encode_string instead of xstrdup followed
by URL_CLEANSE.
(path_simplify_with_kludge): Deleted.
(contains_unsafe): Deleted.
(construct): Renamed to uri_merge_1.
(url_concat): Renamed to uri_merge.
* url.c (str_url): Use encode_string instead of the unnecessary
CLEANDUP.
(encode_string_maybe): New function, returns input string if no
encoding is needed.
(encode_string): Call encode_string_maybe to do the dirty work,
xstrdup if no work needed.
* wget.h (XDIGIT_TO_xchar): Define here.
* url.c (decode_string): Use new name.
(encode_string): Ditto.
* http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar.
(dump_hash): Use new name.
* wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and
XDIGIT_TO_XCHAR respectively.
2001-04-13 21:11:35 -07:00
hniksic
66a0c39e22 [svn] Make sure opt.dfp is rewound only on the first retrieval.
(See the code comment for a more detailed explanation.)
Published in <sxsy9tkqizz.fsf@florida.arsdigita.de>.
2001-04-01 16:50:04 -07:00
hniksic
728584d072 [svn] Record downloaded files and downloaded HTML files in all cases.
Published under the subject "Link conversion fix" in
<sxsn1a2n2zd.fsf@florida.arsdigita.de>.
2001-03-30 18:05:54 -08:00
hniksic
1a6058b1ec [svn] Applied Philipp Thomas's safe-ctype patch. Published in
<20010330025159.U21662@jeffreys.suse.de>.
2001-03-30 14:36:59 -08:00
hniksic
71a53ffe25 [svn] Applied Jonas Jensen's patch to correctly calculate the number of bytes
in the first line of the download that.
Published in <sxsu25hioxj.fsf@florida.arsdigita.de>.
2001-03-01 14:25:20 -08:00
hniksic
366b832e3f [svn] Applied Jonas Jensen's download rate patch with my additions.
Published in <015b01c09084$ce2d9330$0100000a@bob> and
<sxspugq3xai.fsf@florida.arsdigita.de>.
2001-02-10 16:00:30 -08:00
hniksic
2e8fc46b7b [svn] Include <netdb.h> where h_errno is used. Likewise for <errno.h> and errno.
From <sxsvgsi7wcw.fsf@florida.arsdigita.de>.
2000-12-17 10:12:02 -08:00
hniksic
b0e9dfd4e2 [svn] Fix opt.wait. 2000-12-05 16:24:40 -08:00
hniksic
7828e81c79 [svn] Committed C. Frankel's SSL patch. 2000-12-05 15:09:41 -08:00
hniksic
2ffb47eabf [svn] Committed <sxsbsv854j9.fsf@florida.arsdigita.de>. 2000-11-22 08:58:28 -08:00
hniksic
b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00
hniksic
eef4a668b7 [svn] Update copyright blurbs with the year 2000. 2000-11-01 17:50:03 -08:00
hniksic
986c445029 [svn] Fixed minor memory leaks. 2000-11-01 16:18:27 -08:00
hniksic
b7a8c6d3f5 [svn] Gracefully handle opt.downloaded overflowing.
Published in <sxsd7gfnv17.fsf@florida.arsdigita.de>.
2000-11-01 15:17:31 -08:00
hniksic
6d13e17142 [svn] Detect redirection cycles.
Published in <sxsd7ggtjac.fsf@florida.arsdigita.de>.
2000-10-31 20:21:50 -08:00
hniksic
0dd418242a [svn] Committed my patches from <sxsbsw16sbu.fsf@florida.arsdigita.de>
and <sxsvgu824xk.fsf@florida.arsdigita.de>.
2000-10-31 11:25:32 -08:00
hniksic
b095202cad [svn] Applied Adrian Aichner's patch from
<20001029223711.28688.qmail@web10601.mail.yahoo.com>.
2000-10-30 13:07:04 -08:00
dan
24c465b5ad [svn] retr.c (retrieve_url): Manually applied T. Bharath
<TBharath@responsenetworks.com>'s patch to get wget to grok illegal relative URL
redirects.  Reformatted and re-commented it.
2000-10-27 20:18:20 -07:00
dan
f4673bcdaf [svn] --delete-after wasn't implemented for files retrieved by FTP or corresponding to
files specified on the commandline.  Made --convert-links be ignored when
--delete-after is specified.  Added note about this fact to --delete-after docs
and made general improvements to them, including the clarification that
--delete-after only deletes local files.
2000-10-23 20:43:47 -07:00
dan
7931200609 [svn] * *.{gmo,po,pot}: Regenerated after modifying wget --help output.
* ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define.

* html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter.
Wrapped some > 80-column lines.  When -p is specified and we're at a
leaf node, do not traverse <A>, <AREA>, or <LINK> tags other than
<LINK REL="stylesheet">.

* html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter.

* init.c: Added new -p / --page-requisites / page_requisites option.

* main.c (print_help): Clarified that -l inf and -l 0 both allow
infinite recursion.  Changed the unhelpful --mirrior description
to simply give the options it's equivalent to.  Added new -p option.
(main): Added some comments; handle new -p / --page-requisites.

* options.h (struct options): Added new page_requisites field.

* recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs".
Calculate and pass down new `dash_p_leaf_HTML' parameter to
get_urls_html().  Use new INFINITE_RECURSION #define.

* retr.c: Changed "URL-s" to "URLs".  get_urls_html() now takes
final `dash_p_leaf_HTML' parameter.

* url.c: get_urls_html() and htmlfindurl() now take final
`dash_p_leaf_HTML' parameter.

* url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter.

* wget.h: Added some comments and new INFINITE_RECURSION #define.

* wget.texi (Recursive Retrieval Options): Documented new -p option.
2000-08-30 04:26:21 -07:00
hniksic
7794db052c [svn] Committed Jan Prikryl's patch from
<20000709171425.A16267@launzatte.cg.tuwien.ac.at>.
2000-07-14 07:15:23 -07:00
hniksic
1dc66a6cf6 [svn] *** empty log message *** 2000-03-02 06:23:22 -08:00
dan
4331c39c9a [svn] Implemented the item I formerly had in the TODO: When -K and -N are used
together, we compare local file X.orig (if extant) against server file X.
Previously -k and -N were worthless in combination because the local converted
files always differed from the server versions.
2000-03-01 22:33:48 -08:00
kwget
31d6616c48 [svn] Initial revision 1999-12-01 23:42:23 -08:00