1
0
mirror of https://github.com/moparisthebest/wget synced 2024-07-03 16:38:41 -04:00
Commit Graph

54 Commits

Author SHA1 Message Date
hniksic
cdcf67a5bd [svn] Big progress bar update.
Published in <sxsn11ehyn1.fsf@florida.arsdigita.de>.
2001-11-22 20:59:52 -08:00
hniksic
d5be8ecca4 [svn] Rewrite parsing and handling of URLs.
Published in <sxs4rnnlklo.fsf@florida.arsdigita.de>.
2001-11-21 16:24:28 -08:00
hniksic
f4dcb55851 [svn] Split the copyright notice for easier translation. 2001-11-21 15:15:19 -08:00
hniksic
de7f766f58 [svn] Remove --wait/--waitretry compatibility code.
Published in <sxs6686py1q.fsf@florida.arsdigita.de>.
2001-11-19 07:42:56 -08:00
hniksic
e8e8797873 [svn] Rewrite shorthand URLs in a step separate from parsing.
Published in <sxspu6f7ecz.fsf@florida.arsdigita.de>.
2001-11-18 17:14:14 -08:00
hniksic
ee99e8998a [svn] Don't compile getopt.c on systems that already have getopt_long.
Published in <sxshersbs0w.fsf@florida.arsdigita.de>.
2001-11-17 20:52:18 -08:00
hniksic
0b056d1720 [svn] Update copyright notices. 2001-05-27 12:35:15 -07:00
hniksic
72eca0976b [svn] Commit several minor changes:
* main.c (print_help): Document `--no-http-keep-alive'.

* utils.c (numdigit): Handle negative numbers *correctly*.

* hash.c (make_nocase_string_hash_table): Use term "nocase" rather
than the confusing "unsigned".

* utils.c (string_set_contains): Renamed from string_set_exists.

* hash.c (hash_table_contains): Renamed from hash_table_exists.

* cookies.c: Move case-insensitive hash tables to hash.c.

Published in <sxsheyq9vvl.fsf@florida.arsdigita.de>.
2001-05-12 13:06:41 -07:00
hniksic
00773e7d15 [svn] Make `--cookies' respect its arg.
Published in <sxs4rv8ht5e.fsf@florida.arsdigita.de>.
2001-04-29 03:56:43 -07:00
hniksic
dfc1eb5766 [svn] Change default anonymous FTP password to "-wget@".
Published in <sxsu239htnl.fsf@florida.arsdigita.de>.
2001-04-29 03:53:55 -07:00
hniksic
d70101ea4e [svn] Commit various HTTP fixes.
Published in <sxsy9slhu7g.fsf@florida.arsdigita.de>.

* http.c (gethttp): Return RETRUNNEEDED when the retrieval is
unneeded because the file is already there and fully downloaded,
and -c is specified.
(http_loop): Handle RETRUNNEEDED.

* wget.h (uerr_t): New value RETRUNNEEDED.

* http.c (http_loop): Set no_truncate for files that both exist
and are non-empty.
(gethttp): Consider the download finished when restval >= contlen,
not only when restval==contlen.
(gethttp): Handle redirection before giving up due to -c.
(gethttp): Clarify error message which explains that -c will not
truncate the file.
(gethttp): When returning CONTNOTSUPPORTED, don't forget to free
the stuff that needs freeing and release the socket.

* main.c (print_help): Wget booleans accept "off", not "no".
2001-04-28 09:22:20 -07:00
hniksic
d80f6cbe8c [svn] Reimplemented UNSAFE_CHAR and RESERVED_CHAR.
Fixed snprintf.c to avoid ISDIGIT.
2001-04-24 17:20:30 -07:00
hniksic
5fa53b5a1d [svn] Implement better sorting and matching of cookies.
Fix previously broken command-line options.
Published in <sxspuek9255.fsf@florida.arsdigita.de>.
2001-04-10 09:04:18 -07:00
hniksic
2cfb2d2ef6 [svn] Added support for cookies. 2001-04-08 15:25:24 -07:00
hniksic
f1d534cd8c [svn] Make -c' and -O' work together.
Published in <sxsitkmgokh.fsf@florida.arsdigita.de>.
2001-04-03 05:24:49 -07:00
hniksic
4dbad3a894 [svn] Make sure -C is recognized. 2001-04-01 14:26:28 -07:00
hniksic
e4d49f0b40 [svn] Published my patch "main.c tweaks" published in
<sxspueyonjt.fsf@florida.arsdigita.de>.
2001-03-30 15:17:59 -08:00
hniksic
1a6058b1ec [svn] Applied Philipp Thomas's safe-ctype patch. Published in
<20010330025159.U21662@jeffreys.suse.de>.
2001-03-30 14:36:59 -08:00
dan
b6febbc84b [svn] * po/*.po*: Updated after moving -nr to a different category.
* wget.texi: Moved -nr from "Recursive Retrieval Options" to "FTP Options" and
  gave it a @cindex entry. Alphabetized FTP options by long option name.

* main.c (print_help): -nr belongs in "FTP options" section of --help output,
  not "Recursive retrieval" section.  Alphabetized FTP options by long option
  name.
2001-02-23 13:31:54 -08:00
dan
2aa1d53fb2 [svn] * main.c (print_help): --help documentation for -N said it would re-download
files if they had the _same_ timestamp on server.

* po/*.po*: Updated after changing --help's description of -N.
2001-02-23 12:43:35 -08:00
hniksic
07f60f19a4 [svn] Applied Christian Fraenkel's patch "custom certificate patch for wget-1.7+dev;
2nd try", published in <200102082132220160.00474E73@pop.gmx.net>.
It adds two options for custom certificates.
2001-02-10 14:33:31 -08:00
dan
f84a94e913 [svn] Made clarifications and corrections to --continue's --help output and wget.texi
documentation.  Published in <200101100629.WAA09744@dilvish.speed.net>.
2001-01-09 22:51:51 -08:00
dan
f105b3e43f [svn] Changed my address from dan-wget@dilvish.speed.net to wget@harkless.org
everywhere -- dilvish will probably be going away soon.
2000-12-31 03:18:37 -08:00
hniksic
5099ec0306 [svn] Apply lint-expired fixes from <sxsn1du7ufa.fsf@florida.arsdigita.de>. 2000-12-17 10:52:52 -08:00
hniksic
2e8fc46b7b [svn] Include <netdb.h> where h_errno is used. Likewise for <errno.h> and errno.
From <sxsvgsi7wcw.fsf@florida.arsdigita.de>.
2000-12-17 10:12:02 -08:00
hniksic
d5cd691b12 [svn] Fix -O timestamp bug.
From <sxs7l57u6o6.fsf@florida.arsdigita.de>.
2000-12-10 16:47:44 -08:00
hniksic
62994ffc8c [svn] Applied Karl Eichwalder's patch from
<20001124044348.AF524DD15E@tux.gnu.franken.de>.
2000-11-24 05:51:36 -08:00
hniksic
1cddc05edb [svn] Committed memory debugging stuff.
Published in <sxs1yw34pt4.fsf@florida.arsdigita.de>.
2000-11-22 14:15:45 -08:00
hniksic
2ffb47eabf [svn] Committed <sxsbsv854j9.fsf@florida.arsdigita.de>. 2000-11-22 08:58:28 -08:00
hniksic
f2f77d87fd [svn] New option --no-http-keep-alive.
Published in <sxsd7fr1pdf.fsf@florida.arsdigita.de>.
2000-11-19 16:04:06 -08:00
hniksic
b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00
hniksic
eef4a668b7 [svn] Update copyright blurbs with the year 2000. 2000-11-01 17:50:03 -08:00
hniksic
b7a8c6d3f5 [svn] Gracefully handle opt.downloaded overflowing.
Published in <sxsd7gfnv17.fsf@florida.arsdigita.de>.
2000-11-01 15:17:31 -08:00
hniksic
29cdc8da20 [svn] Updated long_to_string(); enhanced opt.downloaded to use
64-bit types where available.
Published in <sxswvenqsmn.fsf@florida.arsdigita.de> and
<sxssnpbqshp.fsf@florida.arsdigita.de>.
2000-11-01 13:51:25 -08:00
dan
1396b30055 [svn] Manually applied Rob Mayoff <mayoff@dqd.com>'s patch (vs. 1.5.3, not 1.5.3+dev)
to add --bind-address, making many necessary alphabetization, coding style,
comment, documentation, and naming fixes and additions.
2000-10-23 23:19:17 -07:00
dan
2fbb4936a0 [svn] main.c (print_help): Clarified that --delete-after deletes local files. 2000-10-23 20:52:34 -07:00
dan
f4673bcdaf [svn] --delete-after wasn't implemented for files retrieved by FTP or corresponding to
files specified on the commandline.  Made --convert-links be ignored when
--delete-after is specified.  Added note about this fact to --delete-after docs
and made general improvements to them, including the clarification that
--delete-after only deletes local files.
2000-10-23 20:43:47 -07:00
hniksic
778160a155 [svn] hniksic@iskon.hr -> hniksic@arsdigita.com 2000-10-23 08:43:04 -07:00
dan
b3e2c0ff97 [svn] Implemented and documented new -E / --html-extension / html_extension option. 2000-10-19 22:55:46 -07:00
dan
cbf018d0c0 [svn] --retr-symlinks was not previously documented properly. Based on my newfound
understanding of what its limitations are, added a TODO item.  Also made a minor
tweak in html.c to silence a warning.
2000-10-09 15:43:11 -07:00
dan
570f12e437 [svn] Accidentally said "system.wgetrc" instead of "system wgetrc". 2000-08-30 14:08:06 -07:00
dan
7931200609 [svn] * *.{gmo,po,pot}: Regenerated after modifying wget --help output.
* ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define.

* html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter.
Wrapped some > 80-column lines.  When -p is specified and we're at a
leaf node, do not traverse <A>, <AREA>, or <LINK> tags other than
<LINK REL="stylesheet">.

* html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter.

* init.c: Added new -p / --page-requisites / page_requisites option.

* main.c (print_help): Clarified that -l inf and -l 0 both allow
infinite recursion.  Changed the unhelpful --mirrior description
to simply give the options it's equivalent to.  Added new -p option.
(main): Added some comments; handle new -p / --page-requisites.

* options.h (struct options): Added new page_requisites field.

* recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs".
Calculate and pass down new `dash_p_leaf_HTML' parameter to
get_urls_html().  Use new INFINITE_RECURSION #define.

* retr.c: Changed "URL-s" to "URLs".  get_urls_html() now takes
final `dash_p_leaf_HTML' parameter.

* url.c: get_urls_html() and htmlfindurl() now take final
`dash_p_leaf_HTML' parameter.

* url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter.

* wget.h: Added some comments and new INFINITE_RECURSION #define.

* wget.texi (Recursive Retrieval Options): Documented new -p option.
2000-08-30 04:26:21 -07:00
dan
001392bf2b [svn] * main.c (print_help): -B / --base was not mentioned. 2000-08-23 15:40:20 -07:00
dan
1f0acebeb0 [svn] * main.c (print_help): Modified -nc description to mention that it also prevents
the creation of multiple versions of the same file with ".<number>" suffixes.
2000-08-22 20:11:55 -07:00
dan
eea2d24220 [svn] Heiko's --help output for --waitretry was over 80 columns. Shortened. It also
said that 0 seconds are waited after the first retry, which I believe is
incorrect and does not match what's written elsewhere (e.g. wget.texi).  Changed
to 1.
2000-06-09 13:59:56 -07:00
dan
b05feb3ae2 [svn] Damir Dzeko <ddzeko@zesoi.fer.hr> did not document his new --referer option.
Did so (--help output and wget.texi).  Also tweaked --help output for --execute.
2000-05-22 19:29:38 -07:00
hniksic
80cebf2a76 [svn] hniksic@srce.hr -> hniksic@iskon.hr 2000-04-21 02:34:39 -07:00
hniksic
6b4a85888e [svn] Commit several fixes. 2000-04-12 06:23:35 -07:00
dan
4454f6ce0a [svn] * TODO: Removed done item: we now have an option (-G) that makes it easy to
download a single HTML document and all its constituents.

* po/*.{gmo,po,pot}: Regenerated after adding new options.

* po/hr.po: Hrvoje forgot '\n's on his translations of my altered messages,
causing msgfmt to balk and `make install' to fail.


* wget.texi (Recursive Retrieval Options): In -K description, added a link to
the discussion of interaction with -N.
(Recursive Accept/Reject Options): Did some alphabetizing and added descriptions
of new --follow-tags and -G / --ignore-tags options.
(Following Links): Changed "the loads of" to "loads of".
(Wgetrc Commands): Added descriptions of new follow_tags and ignore_tags
commands.


* html.c (idmatch): Implemented checking of my new --follow-tags and
--ignore-tags options.

* init.c (commands): Added comment reminding people adding new entries doing
allocation to add corresponding freeing in cleanup().
(commands): Added new followtags and ignoretags commands.
(cleanup): Free storage for new followtags and ignoretags.

* main.c: Use of "comma-separated list" was random -- normalized it.  Did some
alphabetization.  Added comments pointing out "Options without arguments" and
"Options accepting an argument" sections of long_options[].  Added new options
--follow-tags and -G / --ignore-tags.  Added comment that Damir's --referer is
currently undocumented.  Added comment that Heiko's --waitretry is partially
undocumented (mentioned in --help but not in wget.texi).  Moved improperly
sorted 24, 129, and 'G' cases.

* options.h (struct options): Added new fields follow_tags and ignore_tags.

* wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ".
2000-03-10 22:48:06 -08:00
dan
d2e1d7fe9d [svn] Hrvoje didn't regenerate the .info files after changing wget.texi.
Got rid of newly-introduced nested-if warnings in ftp.c and http.c.  Fixed
apparently completely untested code in main.c that was trying to provide --wait
/ --waitretry backwards compatibility, but had multiple fundamental bugs.
2000-03-02 13:17:47 -08:00