Commit Graph

34 Commits

Author SHA1 Message Date
hniksic b2be7522c7 [svn] Update the license to include the OpenSSL exception. 2002-05-17 19:16:36 -07:00
hniksic 8c3287dabf [svn] Remove the OPTIONS_DEFINED_HERE kludge.
Published in <sxs4rijqmhv.fsf@florida.arsdigita.de>.
2002-04-10 14:20:55 -07:00
hniksic 562ab4ae8a [svn] (DO_REALLOC_FROM_ALLOCA): Multiply with sizeof(type) when calling
xmalloc and memcpy.
Submitted by Andreas Damm in
<Pine.LNX.4.33.0202011632280.1281-200000@gate.rad-ab.u-net.com>.
2002-02-18 21:32:59 -08:00
hniksic 1365950c01 [svn] Logging system bugfixes and improvements.
Published in <sxsofl7vdyo.fsf@florida.arsdigita.de>.
2001-12-09 21:31:45 -08:00
hniksic 222e9465b7 [svn] Implemented breadth-first retrieval.
Published in <sxsherjczw2.fsf@florida.arsdigita.de>.
2001-11-24 19:10:34 -08:00
hniksic 8e082d2558 [svn] Remove accidentally committed DEBUG_MALLOC. 2001-11-22 21:01:32 -08:00
hniksic cdcf67a5bd [svn] Big progress bar update.
Published in <sxsn11ehyn1.fsf@florida.arsdigita.de>.
2001-11-22 20:59:52 -08:00
hniksic 4d352013ac [svn] Replace opt.no_flush with a function to disable/enable flushing.
Published in <sxssnb6imyz.fsf@florida.arsdigita.de>.
2001-11-22 12:13:13 -08:00
hniksic d5be8ecca4 [svn] Rewrite parsing and handling of URLs.
Published in <sxs4rnnlklo.fsf@florida.arsdigita.de>.
2001-11-21 16:24:28 -08:00
hniksic b4e6054de9 [svn] Check for do_realloc_newsize in loop condition because we're no longer
setting SIZEVAR here.
2001-06-26 02:48:51 -07:00
hniksic d015d3cbd3 [svn] Fix access violation bug in DO_REALLOC_FROM_ALLOCA.
Published in <sxspubriqay.fsf@florida.arsdigita.de>.
2001-06-26 01:45:09 -07:00
hniksic 0b056d1720 [svn] Update copyright notices. 2001-05-27 12:35:15 -07:00
hniksic d70101ea4e [svn] Commit various HTTP fixes.
Published in <sxsy9slhu7g.fsf@florida.arsdigita.de>.

* http.c (gethttp): Return RETRUNNEEDED when the retrieval is
unneeded because the file is already there and fully downloaded,
and -c is specified.
(http_loop): Handle RETRUNNEEDED.

* wget.h (uerr_t): New value RETRUNNEEDED.

* http.c (http_loop): Set no_truncate for files that both exist
and are non-empty.
(gethttp): Consider the download finished when restval >= contlen,
not only when restval==contlen.
(gethttp): Handle redirection before giving up due to -c.
(gethttp): Clarify error message which explains that -c will not
truncate the file.
(gethttp): When returning CONTNOTSUPPORTED, don't forget to free
the stuff that needs freeing and release the socket.

* main.c (print_help): Wget booleans accept "off", not "no".
2001-04-28 09:22:20 -07:00
hniksic 61bb00adc0 [svn] Various url.c-related changes.
Published in <sxsvgo8nmub.fsf@florida.arsdigita.de>.

* retr.c (retrieve_url): Call uri_merge, not url_concat.
* html-url.c (collect_tags_mapper): Call uri_merge, not
url_concat.
* url.c (mkstruct): Use encode_string instead of xstrdup followed
by URL_CLEANSE.
(path_simplify_with_kludge): Deleted.
(contains_unsafe): Deleted.
(construct): Renamed to uri_merge_1.
(url_concat): Renamed to uri_merge.
* url.c (str_url): Use encode_string instead of the unnecessary
CLEANDUP.
(encode_string_maybe): New function, returns input string if no
encoding is needed.
(encode_string): Call encode_string_maybe to do the dirty work,
xstrdup if no work needed.
* wget.h (XDIGIT_TO_xchar): Define here.
* url.c (decode_string): Use new name.
(encode_string): Ditto.
* http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar.
(dump_hash): Use new name.
* wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and
XDIGIT_TO_XCHAR respectively.
2001-04-13 21:11:35 -07:00
hniksic 2cfb2d2ef6 [svn] Added support for cookies. 2001-04-08 15:25:24 -07:00
hniksic a52aaeb451 [svn] Make sure -c doesn't truncate the file.
Published in <sxsofuhi4kc.fsf@florida.arsdigita.de>.
2001-04-01 13:04:21 -07:00
hniksic 1a6058b1ec [svn] Applied Philipp Thomas's safe-ctype patch. Published in
<20010330025159.U21662@jeffreys.suse.de>.
2001-03-30 14:36:59 -08:00
hniksic 07f60f19a4 [svn] Applied Christian Fraenkel's patch "custom certificate patch for wget-1.7+dev;
2nd try", published in <200102082132220160.00474E73@pop.gmx.net>.
It adds two options for custom certificates.
2001-02-10 14:33:31 -08:00
dan f105b3e43f [svn] Changed my address from dan-wget@dilvish.speed.net to wget@harkless.org
everywhere -- dilvish will probably be going away soon.
2000-12-31 03:18:37 -08:00
hniksic bdea0b3bf6 [svn] Made xfree alias to free if memory debugging is not used.
From <sxsd7ezd7p2.fsf@florida.arsdigita.de>.
2000-12-10 18:17:44 -08:00
hniksic 7828e81c79 [svn] Committed C. Frankel's SSL patch. 2000-12-05 15:09:41 -08:00
hniksic 1cddc05edb [svn] Committed memory debugging stuff.
Published in <sxs1yw34pt4.fsf@florida.arsdigita.de>.
2000-11-22 14:15:45 -08:00
hniksic 2ffb47eabf [svn] Committed <sxsbsv854j9.fsf@florida.arsdigita.de>. 2000-11-22 08:58:28 -08:00
hniksic b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00
hniksic 0e2b74ce3b [svn] Commit "minor fixes". 2000-11-06 13:24:57 -08:00
hniksic c2c821b3c9 [svn] snprintf.c addition. 2000-11-04 14:49:46 -08:00
hniksic b7a8c6d3f5 [svn] Gracefully handle opt.downloaded overflowing.
Published in <sxsd7gfnv17.fsf@florida.arsdigita.de>.
2000-11-01 15:17:31 -08:00
dan b3e2c0ff97 [svn] Implemented and documented new -E / --html-extension / html_extension option. 2000-10-19 22:55:46 -07:00
dan 7931200609 [svn] * *.{gmo,po,pot}: Regenerated after modifying wget --help output.
* ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define.

* html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter.
Wrapped some > 80-column lines.  When -p is specified and we're at a
leaf node, do not traverse <A>, <AREA>, or <LINK> tags other than
<LINK REL="stylesheet">.

* html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter.

* init.c: Added new -p / --page-requisites / page_requisites option.

* main.c (print_help): Clarified that -l inf and -l 0 both allow
infinite recursion.  Changed the unhelpful --mirrior description
to simply give the options it's equivalent to.  Added new -p option.
(main): Added some comments; handle new -p / --page-requisites.

* options.h (struct options): Added new page_requisites field.

* recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs".
Calculate and pass down new `dash_p_leaf_HTML' parameter to
get_urls_html().  Use new INFINITE_RECURSION #define.

* retr.c: Changed "URL-s" to "URLs".  get_urls_html() now takes
final `dash_p_leaf_HTML' parameter.

* url.c: get_urls_html() and htmlfindurl() now take final
`dash_p_leaf_HTML' parameter.

* url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter.

* wget.h: Added some comments and new INFINITE_RECURSION #define.

* wget.texi (Recursive Retrieval Options): Documented new -p option.
2000-08-30 04:26:21 -07:00
hniksic 6b4a85888e [svn] Commit several fixes. 2000-04-12 06:23:35 -07:00
hniksic c71f174ed6 [svn] Changes from <9t9pusol5a1.fsf@mraz.iskon.hr>. 2000-03-21 07:47:45 -08:00
dan 4454f6ce0a [svn] * TODO: Removed done item: we now have an option (-G) that makes it easy to
download a single HTML document and all its constituents.

* po/*.{gmo,po,pot}: Regenerated after adding new options.

* po/hr.po: Hrvoje forgot '\n's on his translations of my altered messages,
causing msgfmt to balk and `make install' to fail.


* wget.texi (Recursive Retrieval Options): In -K description, added a link to
the discussion of interaction with -N.
(Recursive Accept/Reject Options): Did some alphabetizing and added descriptions
of new --follow-tags and -G / --ignore-tags options.
(Following Links): Changed "the loads of" to "loads of".
(Wgetrc Commands): Added descriptions of new follow_tags and ignore_tags
commands.


* html.c (idmatch): Implemented checking of my new --follow-tags and
--ignore-tags options.

* init.c (commands): Added comment reminding people adding new entries doing
allocation to add corresponding freeing in cleanup().
(commands): Added new followtags and ignoretags commands.
(cleanup): Free storage for new followtags and ignoretags.

* main.c: Use of "comma-separated list" was random -- normalized it.  Did some
alphabetization.  Added comments pointing out "Options without arguments" and
"Options accepting an argument" sections of long_options[].  Added new options
--follow-tags and -G / --ignore-tags.  Added comment that Damir's --referer is
currently undocumented.  Added comment that Heiko's --waitretry is partially
undocumented (mentioned in --help but not in wget.texi).  Moved improperly
sorted 24, 129, and 'G' cases.

* options.h (struct options): Added new fields follow_tags and ignore_tags.

* wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ".
2000-03-10 22:48:06 -08:00
dan 4331c39c9a [svn] Implemented the item I formerly had in the TODO: When -K and -N are used
together, we compare local file X.orig (if extant) against server file X.
Previously -k and -N were worthless in combination because the local converted
files always differed from the server versions.
2000-03-01 22:33:48 -08:00
kwget 31d6616c48 [svn] Initial revision 1999-12-01 23:42:23 -08:00