moparisthebest/wget - wget - code.moparisthebest.com

mirror of https://github.com/moparisthebest/wget synced 2024-07-03 16:38:41 -04:00

Author	SHA1	Message	Date
hniksic	df05e7ff10	[svn] Handle <base href=...> when converting links. Published in <sxsadxaae3t.fsf@florida.arsdigita.de>.	2001-11-25 10:40:55 -08:00
hniksic	222e9465b7	[svn] Implemented breadth-first retrieval. Published in <sxsherjczw2.fsf@florida.arsdigita.de>.	2001-11-24 19:10:34 -08:00
hniksic	f178e6c613	[svn] Clean up handling of schemes. Published in <sxswv0n7h7s.fsf@florida.arsdigita.de>.	2001-11-18 16:12:05 -08:00
hniksic	3d9dda6485	[svn] Process attributes in order in which they appear in the tag. Submitted by Ian Abbott in <3B868388.6538.14A7848@localhost> based on analysis by Edward Sabol.	2001-11-16 11:44:42 -08:00
hniksic	0b056d1720	[svn] Update copyright notices.	2001-05-27 12:35:15 -07:00
hniksic	9ae0328c3d	[svn] Applied Roger Beeman's mktime_from_utc fix published in <Pine.HPX.4.02.10104181128180.6232-100000@mail1.cisco.com>. Also, minor doc fixes.	2001-04-24 17:50:22 -07:00
hniksic	61bb00adc0	[svn] Various url.c-related changes. Published in <sxsvgo8nmub.fsf@florida.arsdigita.de>. * retr.c (retrieve_url): Call uri_merge, not url_concat. * html-url.c (collect_tags_mapper): Call uri_merge, not url_concat. * url.c (mkstruct): Use encode_string instead of xstrdup followed by URL_CLEANSE. (path_simplify_with_kludge): Deleted. (contains_unsafe): Deleted. (construct): Renamed to uri_merge_1. (url_concat): Renamed to uri_merge. * url.c (str_url): Use encode_string instead of the unnecessary CLEANDUP. (encode_string_maybe): New function, returns input string if no encoding is needed. (encode_string): Call encode_string_maybe to do the dirty work, xstrdup if no work needed. * wget.h (XDIGIT_TO_xchar): Define here. * url.c (decode_string): Use new name. (encode_string): Ditto. * http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar. (dump_hash): Use new name. * wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and XDIGIT_TO_XCHAR respectively.	2001-04-13 21:11:35 -07:00
hniksic	1a6058b1ec	[svn] Applied Philipp Thomas's safe-ctype patch. Published in <20010330025159.U21662@jeffreys.suse.de>.	2001-03-30 14:36:59 -08:00
dan	040aae87b5	[svn] html-url.c: A bunch of fixup of `--page-requisites'-related comments to reflect Hrvoje's changes to my code when transplanting it into this new file, to fix spelling mistakes, to clarify, etc.	2001-01-09 18:54:52 -08:00
dan	bc5fd29baf	[svn] 2001-01-09 Dan Harkless <wget@harkless.org> * html-url.c: Addition and clarification of comments related to -p. * url.c (write_backup_file): Clarified a comment. [Committed this fix separately.]	2001-01-09 18:28:24 -08:00
hniksic	1cddc05edb	[svn] Committed memory debugging stuff. Published in <sxs1yw34pt4.fsf@florida.arsdigita.de>.	2000-11-22 14:15:45 -08:00
hniksic	2ffb47eabf	[svn] Committed <sxsbsv854j9.fsf@florida.arsdigita.de>.	2000-11-22 08:58:28 -08:00
hniksic	6e598c81e3	[svn] Committed a bunch of different tweaks of mine. Published in <sxsr9463wrx.fsf@florida.arsdigita.de>.	2000-11-20 18:06:36 -08:00
hniksic	b0b1c815c1	[svn] A bunch of new features: - use mmap() to read whole files in core instead of allocating memory and read'ing it. - use a new, more general, HTML parser (html-parse.c) and interface to it from Wget (html-url.c). - respect <meta name=robots content=nofollow> (easy with the new HTML parser). - use hash tables instead of linked lists in places where the lists were used to facilitate mappings. - rewrite the code in host.c to be more readable and faster (hash tables instead of home-grown lists.) - make convert_links properly convert partial URLs to complete ones for those URLs that have not been downloaded. - use HTTP persistent connections where available. very simple-minded, caches the last connection to the server. Published in <sxshf533d5r.fsf@florida.arsdigita.de>.	2000-11-19 12:50:10 -08:00

1 2

64 Commits