1
0
mirror of https://github.com/moparisthebest/wget synced 2024-07-03 16:38:41 -04:00
Commit Graph

289 Commits

Author SHA1 Message Date
hniksic
649c599e0e [svn] Add a dummy Makefile.in. 2000-12-09 17:55:43 -08:00
hniksic
c9ccd27730 [svn] Applied Karl Eichwalder's patches.
<20001125153354.51087DD2DC@tux.gnu.franken.de>
<20001125165332.92862DD2DB@tux.gnu.franken.de>
2000-12-05 15:56:34 -08:00
hniksic
7828e81c79 [svn] Committed C. Frankel's SSL patch. 2000-12-05 15:09:41 -08:00
hniksic
916cb86723 [svn] Determine ALL_LINGUAS dynamically. 2000-11-23 13:41:37 -08:00
hniksic
b0b1c815c1 [svn] A bunch of new features:
- use mmap() to read whole files in core instead of allocating memory
  and read'ing it.

- use a new, more general, HTML parser (html-parse.c) and interface to
  it from Wget (html-url.c).

- respect <meta name=robots content=nofollow> (easy with the new HTML
  parser).

- use hash tables instead of linked lists in places where the lists
  were used to facilitate mappings.

- rewrite the code in host.c to be more readable and faster (hash
  tables instead of home-grown lists.)

- make convert_links properly convert partial URLs to complete ones
  for those URLs that have *not* been downloaded.

- use HTTP persistent connections where available.  very
  simple-minded, caches the last connection to the server.

Published in <sxshf533d5r.fsf@florida.arsdigita.de>.
2000-11-19 12:50:10 -08:00
hniksic
ccf31643ab [svn] vsnprintf() fixup. 2000-11-16 08:37:49 -08:00
hniksic
2097b8beaf [svn] Compile in vsnprintf.c under Windows.
Published by Herold Heiko in
<412FC0AFD62ED31191B40008C7E9A11A2AABF0@srvnt04.previnet.it>.
2000-11-16 01:52:45 -08:00
hniksic
268ac1fc7f [svn] Another contributed fix. 2000-11-01 16:21:53 -08:00
hniksic
29cdc8da20 [svn] Updated long_to_string(); enhanced opt.downloaded to use
64-bit types where available.
Published in <sxswvenqsmn.fsf@florida.arsdigita.de> and
<sxssnpbqshp.fsf@florida.arsdigita.de>.
2000-11-01 13:51:25 -08:00
dan
5f96643297 [svn] NEWS: Hrvoje pointed out that relative URL grokking deserves mention. 2000-10-30 20:24:14 -08:00
dan
24c465b5ad [svn] retr.c (retrieve_url): Manually applied T. Bharath
<TBharath@responsenetworks.com>'s patch to get wget to grok illegal relative URL
redirects.  Reformatted and re-commented it.
2000-10-27 20:18:20 -07:00
dan
4d925ebc24 [svn] NEWS: Forgot to update regarding new --bind-address option. 2000-10-24 15:41:32 -07:00
dan
71994021f3 [svn] TODO: Generalize --html-extension to something like --mime-extension. 2000-10-20 16:20:24 -07:00
dan
d9dd14a995 [svn] * AUTHORS: Added -E to thst list of my stuff.
* TODO: We need to check the HTTP spec w.r.t. simplification of absolute URLs.

* MAILING-LIST: I didn't realize <wget@sunsite.auc.dk> allowed posting by
  non-subscribers.  <bug-wget@gnu.org> soon to be an alias for it.

* NEWS: Always forget to update this file when making user-vis. changes.
2000-10-20 15:29:42 -07:00
dan
6dd2357558 [svn] TODO: -k needs to convert '?' to "%3F" in links to saved files containing the
'?' character (e.g. CGI output).
2000-10-20 14:44:26 -07:00
dan
b77a3437a8 [svn] Added Polish and Russian message translation files. 2000-10-20 00:20:55 -07:00
dan
da17e06a1e [svn] TODO: Make -I and -X allow an optional hostname before the directory name?
When simplifying paths, wget needs to stop at any '?' character.
2000-10-19 23:06:03 -07:00
dan
b3e2c0ff97 [svn] Implemented and documented new -E / --html-extension / html_extension option. 2000-10-19 22:55:46 -07:00
dan
de7c00c095 [svn] TODO: Add option to save local filenames without extra %-encoding. 2000-10-18 23:29:20 -07:00
dan
cbf018d0c0 [svn] --retr-symlinks was not previously documented properly. Based on my newfound
understanding of what its limitations are, added a TODO item.  Also made a minor
tweak in html.c to silence a warning.
2000-10-09 15:43:11 -07:00
dan
2358c437c5 [svn] TODO: Make wget follow (illegal) relative URL HTTP redirects. 2000-09-25 17:42:50 -07:00
dan
737daec8e6 [svn] TODO: Make wget return nonzero in situations like bad HTTP auth. 2000-09-25 15:09:25 -07:00
dan
86ac3fdcbd [svn] * MACHINES: Previously said to send updates to "me" (Hrvoje) -- now says to
email the mailing list or bug-wget@gnu.org.

* MAILING-LIST: Added mention of bug-wget@gnu.org.

* NEWS: Added --waitretry and --page-requisites.
2000-08-30 14:07:40 -07:00
dan
7931200609 [svn] * *.{gmo,po,pot}: Regenerated after modifying wget --help output.
* ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define.

* html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter.
Wrapped some > 80-column lines.  When -p is specified and we're at a
leaf node, do not traverse <A>, <AREA>, or <LINK> tags other than
<LINK REL="stylesheet">.

* html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter.

* init.c: Added new -p / --page-requisites / page_requisites option.

* main.c (print_help): Clarified that -l inf and -l 0 both allow
infinite recursion.  Changed the unhelpful --mirrior description
to simply give the options it's equivalent to.  Added new -p option.
(main): Added some comments; handle new -p / --page-requisites.

* options.h (struct options): Added new page_requisites field.

* recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs".
Calculate and pass down new `dash_p_leaf_HTML' parameter to
get_urls_html().  Use new INFINITE_RECURSION #define.

* retr.c: Changed "URL-s" to "URLs".  get_urls_html() now takes
final `dash_p_leaf_HTML' parameter.

* url.c: get_urls_html() and htmlfindurl() now take final
`dash_p_leaf_HTML' parameter.

* url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter.

* wget.h: Added some comments and new INFINITE_RECURSION #define.

* wget.texi (Recursive Retrieval Options): Documented new -p option.
2000-08-30 04:26:21 -07:00
dan
93146c9432 [svn] wget works on FreeBSD 4.x as well as the previously-mentioned 2.x. Changed
MACHINES entry to have a * in the version location.
2000-08-25 15:36:34 -07:00
dan
9ec2f1d31d [svn] * MACHINES: Alphabetized, changed "architectures" to "OSes and architecures",
added missing company names, removed needless ^L, made AIX entry more general
  to reflect my testing, removed the non-factual "this version of", and fixed
  some grammatical errors.
2000-08-25 15:23:22 -07:00
dan
88c07d546e [svn] TODO: -k should convert "hostless absolute" URLs, like <A HREF="/index.html">.
However, Brian McMahon <bm@iucr.org> wants the old incorrect behavior to still
be available as an option, as he depends on it to allow mirrors of his site to
send CGI queries to his original site, but still get graphics off of the mirror
site.  Perhaps this would be better dealt with by adding an option to tell -k
not to convert certain URLs patterns?
2000-07-21 16:16:10 -07:00
dan
fe387ce432 [svn] TODO: Timestamps are sometimes not copied over on files retrieved by FTP. 2000-05-24 13:29:18 -07:00
dan
6d218bc4ab [svn] TODO: Wget does not currently handle "fragment identifiers" (the part of a URL
starting with the '#' character) properly.
2000-05-22 19:40:09 -07:00
dan
b05feb3ae2 [svn] Damir Dzeko <ddzeko@zesoi.fer.hr> did not document his new --referer option.
Did so (--help output and wget.texi).  Also tweaked --help output for --execute.
2000-05-22 19:29:38 -07:00
dan
7c9156e74a [svn] * AUTHORS: Added myself to this file, as Hrvoje got confirmation of my FSF
copyright assignment.
2000-05-22 16:36:30 -07:00
dan
f7c83b6ee3 [svn] TODO: Make `-k' check for files that were downloaded in the past and convert
links to them in newly-downloaded documents.
2000-05-17 19:19:59 -07:00
dan
e5b8bc39c0 [svn] Forgot to update the ChangeLog when I added my three entries to the TODO
yesterday.
2000-04-06 14:55:02 -07:00
dan
1c083869b0 [svn] Previously forgot to add a note about new --follow-tags and -G / --ignore-tags. 2000-04-04 19:02:08 -07:00
dan
4454f6ce0a [svn] * TODO: Removed done item: we now have an option (-G) that makes it easy to
download a single HTML document and all its constituents.

* po/*.{gmo,po,pot}: Regenerated after adding new options.

* po/hr.po: Hrvoje forgot '\n's on his translations of my altered messages,
causing msgfmt to balk and `make install' to fail.


* wget.texi (Recursive Retrieval Options): In -K description, added a link to
the discussion of interaction with -N.
(Recursive Accept/Reject Options): Did some alphabetizing and added descriptions
of new --follow-tags and -G / --ignore-tags options.
(Following Links): Changed "the loads of" to "loads of".
(Wgetrc Commands): Added descriptions of new follow_tags and ignore_tags
commands.


* html.c (idmatch): Implemented checking of my new --follow-tags and
--ignore-tags options.

* init.c (commands): Added comment reminding people adding new entries doing
allocation to add corresponding freeing in cleanup().
(commands): Added new followtags and ignoretags commands.
(cleanup): Free storage for new followtags and ignoretags.

* main.c: Use of "comma-separated list" was random -- normalized it.  Did some
alphabetization.  Added comments pointing out "Options without arguments" and
"Options accepting an argument" sections of long_options[].  Added new options
--follow-tags and -G / --ignore-tags.  Added comment that Damir's --referer is
currently undocumented.  Added comment that Heiko's --waitretry is partially
undocumented (mentioned in --help but not in wget.texi).  Moved improperly
sorted 24, 129, and 'G' cases.

* options.h (struct options): Added new fields follow_tags and ignore_tags.

* wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ".
2000-03-10 22:48:06 -08:00
dan
4331c39c9a [svn] Implemented the item I formerly had in the TODO: When -K and -N are used
together, we compare local file X.orig (if extant) against server file X.
Previously -k and -N were worthless in combination because the local converted
files always differed from the server versions.
2000-03-01 22:33:48 -08:00
dan
e0a58713f7 [svn] Upped version number from 1.5.3. to 1.5.3+dev. Because the development source
is available via anonymous CVS and desirable features are being added, it's
quite possible for end-users to be getting their hands on development versions.
They may report bugs, so if we don't change the version number, we'll have to
continually followup the statement "I'm using version 1.5.3" with the question
"The FTP archive or the CVS source?"  Better to just make this development
version have a unique number.  Once we're ready to actually release the next
version, we can up the version from 1.5.3+dev to 1.5.4, or 1.6, or whatever it
turns out to be (depending on how much development gets done).

Also made minor updates (dates, email addresses) to wget.texi.
2000-02-29 16:50:52 -08:00
dan
e5408e7db8 [svn] Implemented new -K / --backup-converted / backup_converted = on option. 2000-02-29 16:17:23 -08:00
kwget
31d6616c48 [svn] Initial revision 1999-12-01 23:42:23 -08:00