hniksic
74fbb03b10
[svn] Use bool type for boolean variables and values.
2005-06-22 12:38:10 -07:00
hniksic
277e840a0f
[svn] Remove K&R support.
2005-06-19 15:34:58 -07:00
hniksic
4206f96614
[svn] Gettext-ize previously missed messages.
...
By Benno Schulenberg.
2005-06-15 13:26:37 -07:00
hniksic
1b49b07598
[svn] Mark the content from the <link src="..."> tag as expecting HTML.
...
Submitted by Andreas Beckmann.
2005-05-27 06:53:47 -07:00
hniksic
1fab70a664
[svn] Fix warnings reported by gcc -Wstrict-prototypes -Wmissing-prototypes.
2005-05-08 02:58:39 -07:00
hniksic
ef22bf610a
[svn] Large file support added. Published in <87psyr6jn7.fsf@xemacs.org>.
2005-02-23 14:21:04 -08:00
hniksic
3f84a5e00e
[svn] Squash embedded newlines in links.
2003-11-26 08:37:04 -08:00
hniksic
eb88464568
[svn] Download URLs from <object data="...">.
2003-11-08 17:33:33 -08:00
hniksic
c929628ba8
[svn] No need to initialize NEWEL->next to NULL, it's already NULL.
2003-11-07 04:08:05 -08:00
hniksic
095d89995e
[svn] Abort on xfree(NULL).
2003-11-02 13:12:49 -08:00
hniksic
e2e9b753e4
[svn] Retired the `boolean' type. Renamed FREE_MAYBE to xfree_null and moved the
...
definition from wget.h to xmalloc.h. Moved the DEFAULT_LOGFILE
define to log.h. Moved the INFINITE_RECURSION define to recur.h.
2003-11-02 11:56:37 -08:00
hniksic
5f0a2b3f08
[svn] Use new macros xnew, xnew0, xnew_array, and xnew0_array in various places.
2003-10-31 06:55:50 -08:00
hniksic
5535c9ff17
[svn] Expect links followed from Refreshes to contain URLs.
2003-10-13 07:32:25 -07:00
hniksic
1b3cdef574
[svn] Don't descend into HTML that was downloaded by following <img src=...>
...
and such.
2003-10-10 07:25:10 -07:00
hniksic
ffc2d0f653
[svn] Make the first argument to hash_table_new a minimal count of items before
...
regrow, not raw size, which is more useful.
2003-10-09 19:46:09 -07:00
hniksic
233ebb78de
[svn] Use hash table for tag lookup in html-url.c and html-parse.c.
2003-10-09 08:01:58 -07:00
hniksic
ae1d264fcc
[svn] Add FLAGS argument to map_html_tags.
2003-10-08 09:17:33 -07:00
hniksic
a9c3c58c9f
[svn] Switch to binary search for find_tag.
2003-10-08 09:00:10 -07:00
hniksic
95c647eb44
[svn] Split off non-URL related stuff from url.c to convert.c.
2003-09-21 15:47:14 -07:00
hniksic
7b5fb50cb1
[svn] Renamed wget.h XDIGIT-related macros to (hopefully) clearer names.
2003-09-19 07:08:37 -07:00
hniksic
b2be7522c7
[svn] Update the license to include the OpenSSL exception.
2002-05-17 19:16:36 -07:00
hniksic
9de4909bea
[svn] Update copyright years.
2002-04-11 18:23:23 -07:00
hniksic
8f93191f26
[svn] Convert URLs in <form action=...>.
...
Published in <sxsit6yf7jo.fsf@florida.arsdigita.de>.
2002-04-11 10:51:45 -07:00
hniksic
0e40fc9a3c
[svn] Don't crash on <meta http-equiv=refresh>.
...
Published in <sxshep1q3je.fsf@florida.arsdigita.de>.
2002-01-31 19:34:31 -08:00
hniksic
ca9319aaba
[svn] Treat the "shortcut icon" link as inline.
...
Published in <sxsadwgko2q.fsf@florida.arsdigita.de>.
2001-12-18 17:15:34 -08:00
hniksic
3ddcea34a4
[svn] Mark <embed href=...> as external.
2001-12-12 23:18:59 -08:00
hniksic
390adeea93
[svn] Use "incomplete" rather than "relative".
2001-12-12 10:32:17 -08:00
hniksic
d425985c37
[svn] Minor doc update.
2001-12-12 09:01:26 -08:00
hniksic
8817f4c1a4
[svn] Break up collect_tags_mapper into several functions.
...
Published in <sxsg06gla1h.fsf@florida.arsdigita.de>.
2001-12-12 07:43:01 -08:00
hniksic
dd84231c6a
[svn] Minor fixes prompted by `lint'.
...
Published in <sxsadwt2nkg.fsf@florida.arsdigita.de>.
2001-12-08 17:24:41 -08:00
hniksic
936b074bd9
[svn] Doc fix.
2001-12-04 16:02:49 -08:00
hniksic
f4d019a423
[svn] Correctly convert links in <meta http-equiv=Refresh content="...">.
...
Published in <sxsadx3wp49.fsf@florida.arsdigita.de>.
2001-11-30 20:18:51 -08:00
hniksic
a4db28e20f
[svn] Ignore -np when in -p mode.
...
Published in <sxsg06w2c52.fsf@florida.arsdigita.de>.
2001-11-30 13:17:53 -08:00
hniksic
df05e7ff10
[svn] Handle <base href=...> when converting links.
...
Published in <sxsadxaae3t.fsf@florida.arsdigita.de>.
2001-11-25 10:40:55 -08:00
hniksic
222e9465b7
[svn] Implemented breadth-first retrieval.
...
Published in <sxsherjczw2.fsf@florida.arsdigita.de>.
2001-11-24 19:10:34 -08:00
hniksic
f178e6c613
[svn] Clean up handling of schemes.
...
Published in <sxswv0n7h7s.fsf@florida.arsdigita.de>.
2001-11-18 16:12:05 -08:00
hniksic
3d9dda6485
[svn] Process attributes in order in which they appear in the tag.
...
Submitted by Ian Abbott in <3B868388.6538.14A7848@localhost>
based on analysis by Edward Sabol.
2001-11-16 11:44:42 -08:00
hniksic
0b056d1720
[svn] Update copyright notices.
2001-05-27 12:35:15 -07:00
hniksic
9ae0328c3d
[svn] Applied Roger Beeman's mktime_from_utc fix published in
...
<Pine.HPX.4.02.10104181128180.6232-100000@mail1.cisco.com>.
Also, minor doc fixes.
2001-04-24 17:50:22 -07:00
hniksic
61bb00adc0
[svn] Various url.c-related changes.
...
Published in <sxsvgo8nmub.fsf@florida.arsdigita.de>.
* retr.c (retrieve_url): Call uri_merge, not url_concat.
* html-url.c (collect_tags_mapper): Call uri_merge, not
url_concat.
* url.c (mkstruct): Use encode_string instead of xstrdup followed
by URL_CLEANSE.
(path_simplify_with_kludge): Deleted.
(contains_unsafe): Deleted.
(construct): Renamed to uri_merge_1.
(url_concat): Renamed to uri_merge.
* url.c (str_url): Use encode_string instead of the unnecessary
CLEANDUP.
(encode_string_maybe): New function, returns input string if no
encoding is needed.
(encode_string): Call encode_string_maybe to do the dirty work,
xstrdup if no work needed.
* wget.h (XDIGIT_TO_xchar): Define here.
* url.c (decode_string): Use new name.
(encode_string): Ditto.
* http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar.
(dump_hash): Use new name.
* wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and
XDIGIT_TO_XCHAR respectively.
2001-04-13 21:11:35 -07:00
hniksic
1a6058b1ec
[svn] Applied Philipp Thomas's safe-ctype patch. Published in
...
<20010330025159.U21662@jeffreys.suse.de>.
2001-03-30 14:36:59 -08:00
dan
040aae87b5
[svn] html-url.c: A bunch of fixup of `--page-requisites'-related comments to reflect
...
Hrvoje's changes to my code when transplanting it into this new file, to fix
spelling mistakes, to clarify, etc.
2001-01-09 18:54:52 -08:00
dan
bc5fd29baf
[svn] 2001-01-09 Dan Harkless <wget@harkless.org>
...
* html-url.c: Addition and clarification of comments related to -p.
* url.c (write_backup_file): Clarified a comment.
[Committed this fix separately.]
2001-01-09 18:28:24 -08:00
hniksic
1cddc05edb
[svn] Committed memory debugging stuff.
...
Published in <sxs1yw34pt4.fsf@florida.arsdigita.de>.
2000-11-22 14:15:45 -08:00
hniksic
2ffb47eabf
[svn] Committed <sxsbsv854j9.fsf@florida.arsdigita.de>.
2000-11-22 08:58:28 -08:00
hniksic
6e598c81e3
[svn] Committed a bunch of different tweaks of mine.
...
Published in <sxsr9463wrx.fsf@florida.arsdigita.de>.
2000-11-20 18:06:36 -08:00
hniksic
b0b1c815c1
[svn] A bunch of new features:
...
- use mmap() to read whole files in core instead of allocating memory
and read'ing it.
- use a new, more general, HTML parser (html-parse.c) and interface to
it from Wget (html-url.c).
- respect <meta name=robots content=nofollow> (easy with the new HTML
parser).
- use hash tables instead of linked lists in places where the lists
were used to facilitate mappings.
- rewrite the code in host.c to be more readable and faster (hash
tables instead of home-grown lists.)
- make convert_links properly convert partial URLs to complete ones
for those URLs that have *not* been downloaded.
- use HTTP persistent connections where available. very
simple-minded, caches the last connection to the server.
Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00