Commit Graph

49 Commits

Author SHA1 Message Date
Tim Rühsen 100da11312 Fix writing WARC-Target-URI value
src/warc.c: Add function warc_write_header_uri(),
            Use it for creating WARC-Target-URI

Fixes #47281
2016-02-27 23:08:28 +01:00
Tim Rühsen 474935665e Remove redundant definition of _GNU_SOURCE
* src/warc.c: Remove definition of _GNU_SOURCE

_GNU_SOURCE is already defined in config.h
2015-08-07 13:24:14 +02:00
Tim Rühsen c579c7bf1e Check memory allocations in WARC code
* src/warc.c: Remove some memory allocations,
              use xmalloc instead of malloc

Reported-by: Bill Parker <wp02855@gmail.com>
2015-04-17 22:42:59 +02:00
Tim Rühsen 4dde3e200f Add more const usage to function params
* warc.c, warc.h: Add const specifier to several function args
2015-04-17 22:42:59 +02:00
Giuseppe Scrivano 16f1fb1d1f maint: update copyright year ranges to include 2015 2015-03-09 16:32:01 +01:00
Eli Zaretskii 33c5d979ce warc.c: native uuid generation on Windows
* warc.c (windows_uuid_str) [WINDOWS]: New function specific to
MS-Windows.
(warc_uuid_str) [WINDOWS]: If windows_uuid_str succeeds, use its
result; otherwise use the fallback method.
2015-02-23 23:36:02 +01:00
Tim Ruehsen f6b28575cc src/main.c, src/warc.c: Use gnulib's base_name() instead of basename()
Reported-by: Eli Zaretskii <eliz@gnu.org>
2014-12-25 12:07:42 +01:00
Tim Rühsen cfe7589397 gnulib: Use basename() from gnulib module 'dirname'
Avoid basename incompatibilities between POSIX and GNU implementations.
Also, libgen.h isn't needed any more which increases compatibility.
2014-12-21 17:34:14 +01:00
Jérémie Courrèges-Anglas b8c567a3ef Fix use of uuid libc functions (tiny change)
* Always attempt to detect uuid.h and uuid_create().
* Split libuuid and uuid.h implementations of warc_uuid_str(), since
  those APIs vary significantly.
* Correctly use the uuid.h functions
2014-12-17 09:24:31 +01:00
Tim Rühsen e4583ab364 Fix issues reported by static code analysis tool 'parfait'
Closes: #41235
Reported-by: Jiri Kukacka
2014-12-01 20:39:44 +01:00
Tim Ruehsen 4850e9c873 Replaced xfree_null() by xfree() and nullify argument after freeing. 2014-12-01 16:15:37 +01:00
Darshit Shah 3e609a1192 Replace all occurences of free() with xfree() 2014-11-27 11:11:34 +05:30
Tim Rühsen 0c18773308 Add size of buffer to warc_timestamp() 2014-11-24 10:41:08 +01:00
Tim Rühsen 5e2ecaf277 Fix C89 compile errors using ./configure --without-libuuid 2014-11-24 09:44:48 +01:00
Darshit Shah 1b8e54b808 Add extern declaration for program_arsgstring
In test.c, this patch changes the type of program_argstring to char *
from a const char *. This is because, we use a char * string in the rest
of the program and declare an extern accordingly. Removing the const
type helps in keeping the code cleaner at no extra cost.
2014-11-22 17:26:06 +05:30
Darshit Shah c6b750061a Add extern declaration for version.c strings 2014-11-22 17:26:06 +05:30
Tim Rühsen 1356e90a14 Trivial fixes for C89 compliancy 2014-11-20 09:56:57 +01:00
Tim Rühsen 2ece0cc425 Remove 'make check'compiler warnings 2014-11-17 11:28:20 +01:00
Giuseppe Scrivano e2c8436e17 warc: Avoid out-of-scope variable usage 2014-06-29 11:22:44 +02:00
Giuseppe Scrivano 601401da71 warc: Fix UUID generation on FreeBSD 2014-06-24 15:05:19 +02:00
Giuseppe Scrivano 8a919932a6 Do not include strings.h 2014-06-12 18:49:13 +02:00
Giuseppe Scrivano 087e17be1c Do not use exit() with a magic number 2014-06-12 18:48:48 +02:00
Darshit Shah 8624553a31 Whitespace and formatting changes.(Aesthetic only)
This commit makes lots of whitespace only changes. It has been ensured that this
commit does not make any changes to the functioning of the program. The only
changes that have been made are:
    * Remove trailing whitespaces
    * Convert tabs to spaces
    * Fix indentation issues in the code
    * Other aesthetic changes to the formatting of comments
2014-05-30 21:12:57 +05:30
Tim Ruehsen 38a7829dcb Fix compiler warnings 2014-05-12 12:18:50 +02:00
Giuseppe Scrivano 70f7cdf1af Remove some useless if statements 2013-12-29 11:46:04 +01:00
Steven M. Schweda 72b2c58983 warc: Fix some portability issues on VMS. 2013-07-13 13:36:55 +02:00
Bykov Aleksey 5ce9389384 Removing "Could not open temporary WARC manifest file." issue in Win builds 2013-06-04 20:36:42 +02:00
Gijs van Tulder 7df4b05f0b warc: Follow the guidelines for metadata records
Do not use the same UUID for the manifest and arguments records.
Write the manifest as a metadata record, not as a resource.
2013-04-14 14:01:01 +02:00
Gijs van Tulder edbeaa0c4e warc: correctly write the field length in the skip length field 2013-04-06 20:57:47 +02:00
Giuseppe Scrivano 3ebbd84518 warc: remove a magic number 2012-11-26 22:50:06 +01:00
Ángel González 2c1e0326a3 Fix compilation under gcc -std=c89 2012-11-25 15:01:03 +01:00
Gijs van Tulder 359dd16760 warc: fix format string for off_t in CDX function. 2012-11-24 16:06:32 +01:00
Giuseppe Scrivano 1e229375aa warc: fix format string for off_t 2012-11-24 12:27:16 +01:00
Hrvoje Niksic 85b98d0834 Use hash_table_get instead of hash_table_get_pair. 2012-08-26 11:48:07 +02:00
Tim Ruehsen 321b5dce85 * fix a few little dissonances 2012-06-04 22:05:40 +02:00
Gijs van Tulder 1d14c18d7f warc: Fix segfault if CDX record is not found. 2012-06-02 14:26:18 +02:00
Mike Frysinger 639a454528 warc: use the right type for the gzip stream 2012-05-26 14:05:56 +02:00
Giuseppe Scrivano 9cc514d21c Use the right type as result from readline. 2012-05-26 02:55:53 +02:00
Giuseppe Scrivano 8ac9c05fc0 warc: Cut long lines to 80 columns. 2012-05-18 11:28:49 +02:00
Tim Ruehsen 2e86829809 removed 'const' warnings.
* hash.h (hash_table_put): Make argument "value" const.
* hash.c (hash_table_put): Make argument value const.  Cast `value' to
void.
* http.c (request_set_header): Make argument `name' const.  Cast `value'
and `name' to void*.
(request_remove_header): Make argument `name' const.
* url.c (url_file_name): Make `index_filename' static.
* warc.h (warc_write_cdx_record): Make `url', `timestamp', `mime_type',
`payload_digest', `redirect_location', `warc_filename', response_uuid'
arguments const. Make `checksum' const.
* warc.c (warc_write_date_header): Make the `timestamp' argument const.
Make `extension' const.
(warc_write_cdx_record): Make `url', `timestamp', `mime_type',
`payload_digest', `redirect_location', `warc_filename', response_uuid'
arguments const. Make `checksum' const.
2012-05-14 23:20:10 +02:00
Tim Ruehsen 0aa3c5d33c Fix some compiler warnings. 2012-05-05 15:24:35 +02:00
Tim Ruehsen bd4f1e6042 Fix a memory leak. 2012-04-13 21:35:29 +02:00
Tim Ruehsen 2541e0b579 warc: make warc_uuid_str implementation depend on HAVE_LIBUUID 2012-04-01 13:59:46 +02:00
Gijs van Tulder 6a25955fe6 warc: support large files. 2012-02-25 11:58:21 +01:00
Steven Schubiger c60530b369 warc: add license header. 2012-02-17 17:05:26 +01:00
Gijs van Tulder 0a8a898fbe Fix a linker error if zlib is not found. 2012-01-11 15:27:06 +01:00
Gijs van Tulder 1316701791 Fix for gzip bug in WARC + zlib 1.2.4. 2011-11-20 18:28:19 +01:00
Steven Schweda eed850d938 warc: Fix a problem under OS X. 2011-11-05 11:52:51 +01:00
Gijs van Tulder e3820953b2 Add support for WARC files. 2011-11-04 22:25:00 +01:00