1
0
mirror of https://github.com/moparisthebest/wget synced 2024-07-03 16:38:41 -04:00
Commit Graph

82 Commits

Author SHA1 Message Date
hniksic
233ebb78de [svn] Use hash table for tag lookup in html-url.c and html-parse.c. 2003-10-09 08:01:58 -07:00
hniksic
ae1d264fcc [svn] Add FLAGS argument to map_html_tags. 2003-10-08 09:17:33 -07:00
hniksic
a9c3c58c9f [svn] Switch to binary search for find_tag. 2003-10-08 09:00:10 -07:00
hniksic
95c647eb44 [svn] Split off non-URL related stuff from url.c to convert.c. 2003-09-21 15:47:14 -07:00
hniksic
7b5fb50cb1 [svn] Renamed wget.h XDIGIT-related macros to (hopefully) clearer names. 2003-09-19 07:08:37 -07:00
hniksic
b2be7522c7 [svn] Update the license to include the OpenSSL exception. 2002-05-17 19:16:36 -07:00
hniksic
9de4909bea [svn] Update copyright years. 2002-04-11 18:23:23 -07:00
hniksic
8f93191f26 [svn] Convert URLs in <form action=...>.
Published in <sxsit6yf7jo.fsf@florida.arsdigita.de>.
2002-04-11 10:51:45 -07:00
hniksic
0e40fc9a3c [svn] Don't crash on <meta http-equiv=refresh>.
Published in <sxshep1q3je.fsf@florida.arsdigita.de>.
2002-01-31 19:34:31 -08:00
hniksic
ca9319aaba [svn] Treat the "shortcut icon" link as inline.
Published in <sxsadwgko2q.fsf@florida.arsdigita.de>.
2001-12-18 17:15:34 -08:00
hniksic
3ddcea34a4 [svn] Mark <embed href=...> as external. 2001-12-12 23:18:59 -08:00
hniksic
390adeea93 [svn] Use "incomplete" rather than "relative". 2001-12-12 10:32:17 -08:00
hniksic
d425985c37 [svn] Minor doc update. 2001-12-12 09:01:26 -08:00
hniksic
8817f4c1a4 [svn] Break up collect_tags_mapper into several functions.
Published in <sxsg06gla1h.fsf@florida.arsdigita.de>.
2001-12-12 07:43:01 -08:00
hniksic
dd84231c6a [svn] Minor fixes prompted by `lint'.
Published in <sxsadwt2nkg.fsf@florida.arsdigita.de>.
2001-12-08 17:24:41 -08:00
hniksic
936b074bd9 [svn] Doc fix. 2001-12-04 16:02:49 -08:00
hniksic
f4d019a423 [svn] Correctly convert links in <meta http-equiv=Refresh content="...">.
Published in <sxsadx3wp49.fsf@florida.arsdigita.de>.
2001-11-30 20:18:51 -08:00
hniksic
a4db28e20f [svn] Ignore -np when in -p mode.
Published in <sxsg06w2c52.fsf@florida.arsdigita.de>.
2001-11-30 13:17:53 -08:00
hniksic
df05e7ff10 [svn] Handle <base href=...> when converting links.
Published in <sxsadxaae3t.fsf@florida.arsdigita.de>.
2001-11-25 10:40:55 -08:00
hniksic
222e9465b7 [svn] Implemented breadth-first retrieval.
Published in <sxsherjczw2.fsf@florida.arsdigita.de>.
2001-11-24 19:10:34 -08:00
hniksic
f178e6c613 [svn] Clean up handling of schemes.
Published in <sxswv0n7h7s.fsf@florida.arsdigita.de>.
2001-11-18 16:12:05 -08:00
hniksic
3d9dda6485 [svn] Process attributes in order in which they appear in the tag.
Submitted by Ian Abbott in <3B868388.6538.14A7848@localhost>
based on analysis by Edward Sabol.
2001-11-16 11:44:42 -08:00
hniksic
0b056d1720 [svn] Update copyright notices. 2001-05-27 12:35:15 -07:00
hniksic
9ae0328c3d [svn] Applied Roger Beeman's mktime_from_utc fix published in
<Pine.HPX.4.02.10104181128180.6232-100000@mail1.cisco.com>.
Also, minor doc fixes.
2001-04-24 17:50:22 -07:00
hniksic
61bb00adc0 [svn] Various url.c-related changes.
Published in <sxsvgo8nmub.fsf@florida.arsdigita.de>.

* retr.c (retrieve_url): Call uri_merge, not url_concat.
* html-url.c (collect_tags_mapper): Call uri_merge, not
url_concat.
* url.c (mkstruct): Use encode_string instead of xstrdup followed
by URL_CLEANSE.
(path_simplify_with_kludge): Deleted.
(contains_unsafe): Deleted.
(construct): Renamed to uri_merge_1.
(url_concat): Renamed to uri_merge.
* url.c (str_url): Use encode_string instead of the unnecessary
CLEANDUP.
(encode_string_maybe): New function, returns input string if no
encoding is needed.
(encode_string): Call encode_string_maybe to do the dirty work,
xstrdup if no work needed.
* wget.h (XDIGIT_TO_xchar): Define here.
* url.c (decode_string): Use new name.
(encode_string): Ditto.
* http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar.
(dump_hash): Use new name.
* wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and
XDIGIT_TO_XCHAR respectively.
2001-04-13 21:11:35 -07:00
hniksic
1a6058b1ec [svn] Applied Philipp Thomas's safe-ctype patch. Published in
<20010330025159.U21662@jeffreys.suse.de>.
2001-03-30 14:36:59 -08:00
dan
040aae87b5 [svn] html-url.c: A bunch of fixup of `--page-requisites'-related comments to reflect
Hrvoje's changes to my code when transplanting it into this new file, to fix
spelling mistakes, to clarify, etc.
2001-01-09 18:54:52 -08:00
dan
bc5fd29baf [svn] 2001-01-09 Dan Harkless <wget@harkless.org>
* html-url.c: Addition and clarification of comments related to -p.

	* url.c (write_backup_file): Clarified a comment.
	  [Committed this fix separately.]
2001-01-09 18:28:24 -08:00
hniksic
1cddc05edb [svn] Committed memory debugging stuff.
Published in <sxs1yw34pt4.fsf@florida.arsdigita.de>.
2000-11-22 14:15:45 -08:00
hniksic
2ffb47eabf [svn] Committed <sxsbsv854j9.fsf@florida.arsdigita.de>. 2000-11-22 08:58:28 -08:00
hniksic
6e598c81e3 [svn] Committed a bunch of different tweaks of mine.
Published in <sxsr9463wrx.fsf@florida.arsdigita.de>.
2000-11-20 18:06:36 -08:00
hniksic
b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00