Commit Graph

48 Commits

Author SHA1 Message Date
hniksic fc532c9bee [svn] Don't explicitly set entry->next to NULL since entry is already zeroed out. 2005-06-26 13:29:39 -07:00
hniksic 74fbb03b10 [svn] Use bool type for boolean variables and values. 2005-06-22 12:38:10 -07:00
hniksic 277e840a0f [svn] Remove K&R support. 2005-06-19 15:34:58 -07:00
hniksic 4206f96614 [svn] Gettext-ize previously missed messages.
By Benno Schulenberg.
2005-06-15 13:26:37 -07:00
hniksic 1b49b07598 [svn] Mark the content from the <link src="..."> tag as expecting HTML.
Submitted by Andreas Beckmann.
2005-05-27 06:53:47 -07:00
hniksic 1fab70a664 [svn] Fix warnings reported by gcc -Wstrict-prototypes -Wmissing-prototypes. 2005-05-08 02:58:39 -07:00
hniksic ef22bf610a [svn] Large file support added. Published in <87psyr6jn7.fsf@xemacs.org>. 2005-02-23 14:21:04 -08:00
hniksic 3f84a5e00e [svn] Squash embedded newlines in links. 2003-11-26 08:37:04 -08:00
hniksic eb88464568 [svn] Download URLs from <object data="...">. 2003-11-08 17:33:33 -08:00
hniksic c929628ba8 [svn] No need to initialize NEWEL->next to NULL, it's already NULL. 2003-11-07 04:08:05 -08:00
hniksic 095d89995e [svn] Abort on xfree(NULL). 2003-11-02 13:12:49 -08:00
hniksic e2e9b753e4 [svn] Retired the `boolean' type. Renamed FREE_MAYBE to xfree_null and moved the
definition from wget.h to xmalloc.h.  Moved the DEFAULT_LOGFILE
define to log.h.  Moved the INFINITE_RECURSION define to recur.h.
2003-11-02 11:56:37 -08:00
hniksic 5f0a2b3f08 [svn] Use new macros xnew, xnew0, xnew_array, and xnew0_array in various places. 2003-10-31 06:55:50 -08:00
hniksic 5535c9ff17 [svn] Expect links followed from Refreshes to contain URLs. 2003-10-13 07:32:25 -07:00
hniksic 1b3cdef574 [svn] Don't descend into HTML that was downloaded by following <img src=...>
and such.
2003-10-10 07:25:10 -07:00
hniksic ffc2d0f653 [svn] Make the first argument to hash_table_new a minimal count of items before
regrow, not raw size, which is more useful.
2003-10-09 19:46:09 -07:00
hniksic 233ebb78de [svn] Use hash table for tag lookup in html-url.c and html-parse.c. 2003-10-09 08:01:58 -07:00
hniksic ae1d264fcc [svn] Add FLAGS argument to map_html_tags. 2003-10-08 09:17:33 -07:00
hniksic a9c3c58c9f [svn] Switch to binary search for find_tag. 2003-10-08 09:00:10 -07:00
hniksic 95c647eb44 [svn] Split off non-URL related stuff from url.c to convert.c. 2003-09-21 15:47:14 -07:00
hniksic 7b5fb50cb1 [svn] Renamed wget.h XDIGIT-related macros to (hopefully) clearer names. 2003-09-19 07:08:37 -07:00
hniksic b2be7522c7 [svn] Update the license to include the OpenSSL exception. 2002-05-17 19:16:36 -07:00
hniksic 9de4909bea [svn] Update copyright years. 2002-04-11 18:23:23 -07:00
hniksic 8f93191f26 [svn] Convert URLs in <form action=...>.
Published in <sxsit6yf7jo.fsf@florida.arsdigita.de>.
2002-04-11 10:51:45 -07:00
hniksic 0e40fc9a3c [svn] Don't crash on <meta http-equiv=refresh>.
Published in <sxshep1q3je.fsf@florida.arsdigita.de>.
2002-01-31 19:34:31 -08:00
hniksic ca9319aaba [svn] Treat the "shortcut icon" link as inline.
Published in <sxsadwgko2q.fsf@florida.arsdigita.de>.
2001-12-18 17:15:34 -08:00
hniksic 3ddcea34a4 [svn] Mark <embed href=...> as external. 2001-12-12 23:18:59 -08:00
hniksic 390adeea93 [svn] Use "incomplete" rather than "relative". 2001-12-12 10:32:17 -08:00
hniksic d425985c37 [svn] Minor doc update. 2001-12-12 09:01:26 -08:00
hniksic 8817f4c1a4 [svn] Break up collect_tags_mapper into several functions.
Published in <sxsg06gla1h.fsf@florida.arsdigita.de>.
2001-12-12 07:43:01 -08:00
hniksic dd84231c6a [svn] Minor fixes prompted by `lint'.
Published in <sxsadwt2nkg.fsf@florida.arsdigita.de>.
2001-12-08 17:24:41 -08:00
hniksic 936b074bd9 [svn] Doc fix. 2001-12-04 16:02:49 -08:00
hniksic f4d019a423 [svn] Correctly convert links in <meta http-equiv=Refresh content="...">.
Published in <sxsadx3wp49.fsf@florida.arsdigita.de>.
2001-11-30 20:18:51 -08:00
hniksic a4db28e20f [svn] Ignore -np when in -p mode.
Published in <sxsg06w2c52.fsf@florida.arsdigita.de>.
2001-11-30 13:17:53 -08:00
hniksic df05e7ff10 [svn] Handle <base href=...> when converting links.
Published in <sxsadxaae3t.fsf@florida.arsdigita.de>.
2001-11-25 10:40:55 -08:00
hniksic 222e9465b7 [svn] Implemented breadth-first retrieval.
Published in <sxsherjczw2.fsf@florida.arsdigita.de>.
2001-11-24 19:10:34 -08:00
hniksic f178e6c613 [svn] Clean up handling of schemes.
Published in <sxswv0n7h7s.fsf@florida.arsdigita.de>.
2001-11-18 16:12:05 -08:00
hniksic 3d9dda6485 [svn] Process attributes in order in which they appear in the tag.
Submitted by Ian Abbott in <3B868388.6538.14A7848@localhost>
based on analysis by Edward Sabol.
2001-11-16 11:44:42 -08:00
hniksic 0b056d1720 [svn] Update copyright notices. 2001-05-27 12:35:15 -07:00
hniksic 9ae0328c3d [svn] Applied Roger Beeman's mktime_from_utc fix published in
<Pine.HPX.4.02.10104181128180.6232-100000@mail1.cisco.com>.
Also, minor doc fixes.
2001-04-24 17:50:22 -07:00
hniksic 61bb00adc0 [svn] Various url.c-related changes.
Published in <sxsvgo8nmub.fsf@florida.arsdigita.de>.

* retr.c (retrieve_url): Call uri_merge, not url_concat.
* html-url.c (collect_tags_mapper): Call uri_merge, not
url_concat.
* url.c (mkstruct): Use encode_string instead of xstrdup followed
by URL_CLEANSE.
(path_simplify_with_kludge): Deleted.
(contains_unsafe): Deleted.
(construct): Renamed to uri_merge_1.
(url_concat): Renamed to uri_merge.
* url.c (str_url): Use encode_string instead of the unnecessary
CLEANDUP.
(encode_string_maybe): New function, returns input string if no
encoding is needed.
(encode_string): Call encode_string_maybe to do the dirty work,
xstrdup if no work needed.
* wget.h (XDIGIT_TO_xchar): Define here.
* url.c (decode_string): Use new name.
(encode_string): Ditto.
* http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar.
(dump_hash): Use new name.
* wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and
XDIGIT_TO_XCHAR respectively.
2001-04-13 21:11:35 -07:00
hniksic 1a6058b1ec [svn] Applied Philipp Thomas's safe-ctype patch. Published in
<20010330025159.U21662@jeffreys.suse.de>.
2001-03-30 14:36:59 -08:00
dan 040aae87b5 [svn] html-url.c: A bunch of fixup of `--page-requisites'-related comments to reflect
Hrvoje's changes to my code when transplanting it into this new file, to fix
spelling mistakes, to clarify, etc.
2001-01-09 18:54:52 -08:00
dan bc5fd29baf [svn] 2001-01-09 Dan Harkless <wget@harkless.org>
* html-url.c: Addition and clarification of comments related to -p.

	* url.c (write_backup_file): Clarified a comment.
	  [Committed this fix separately.]
2001-01-09 18:28:24 -08:00
hniksic 1cddc05edb [svn] Committed memory debugging stuff.
Published in <sxs1yw34pt4.fsf@florida.arsdigita.de>.
2000-11-22 14:15:45 -08:00
hniksic 2ffb47eabf [svn] Committed <sxsbsv854j9.fsf@florida.arsdigita.de>. 2000-11-22 08:58:28 -08:00
hniksic 6e598c81e3 [svn] Committed a bunch of different tweaks of mine.
Published in <sxsr9463wrx.fsf@florida.arsdigita.de>.
2000-11-20 18:06:36 -08:00
hniksic b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00