Commit Graph

3672 Commits

Author SHA1 Message Date
Nick Burch
3aef368b71 Apply patch from Jukka from bug #43670 to improve HDGF v11 Separator detection, and handle short strings better, hopefully solving the Negative length of ChunkHeader issue
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904052 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:05:13 +00:00
Nick Burch
2880d934f9 Improve error message, and fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904049 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:00:38 +00:00
Nick Burch
81755dc8d2 Add patch from Jukka from bug #48617 + test - Optionally allow the overriding of the Locale used by DataFormatter to control how the default number and date formats should look
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@903303 13f79535-47bb-0310-9956-ffa450edef68
2010-01-26 16:21:17 +00:00
Nick Burch
a2e6cafca9 New event based xssf text extractor (XSSFEventBasedExcelExtractor)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@903182 13f79535-47bb-0310-9956-ffa450edef68
2010-01-26 11:39:44 +00:00
Nick Burch
90700558cd The ReadOnlySharedStringsTable is re-usable for other event based things, so copy out to it's own class
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@903170 13f79535-47bb-0310-9956-ffa450edef68
2010-01-26 11:10:48 +00:00
Nick Burch
6e2c32e1c5 Most of support suggested by Phil Varner on the list - ExtractorFactory can now be told to prefer Event Based extractors (current Excel only) on a per-thread or overall basis
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902927 13f79535-47bb-0310-9956-ffa450edef68
2010-01-25 19:02:13 +00:00
Yegor Kozlov
7854649016 avoid failures in XLSX2CSV when shared string table is missing, see Bugzilla 48544
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902566 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:40:58 +00:00
Yegor Kozlov
d49e83fe92 properly close all IO streams created in OPCPackage, see Bugzilla 48571
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902565 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:26:19 +00:00
Yegor Kozlov
76a0ce511b always copy all declared inner classes and interfaces when generating poi-ooxml-schemas, see Bugzilla 48572
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902563 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:11:46 +00:00
Yegor Kozlov
3e5d8b9cad added mvn-install Ant target equivalent to 'mvn install' in maven-driven projects
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902561 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:05:48 +00:00
Maxim Valyanskiy
ed3cae95f8 PowerPoint OLEShape: extract last version of embedded ole object
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@901215 13f79535-47bb-0310-9956-ffa450edef68
2010-01-20 14:33:58 +00:00
Nick Burch
8f421fc5b0 Add a note about the UnicodeString move from a few days ago
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900816 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 15:20:04 +00:00
Nick Burch
b631a6edee Add changelog note
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900747 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:06:41 +00:00
Nick Burch
55f1878628 Implement the ExtRst part of a UnicodeString
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900746 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:04:14 +00:00
Nick Burch
d39c43b4b2 If there's no more data but there ought to be for unicode strings, in a partly corrupt file, give a warning and substitute in empty strings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900745 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:03:27 +00:00
Nick Burch
bfa3409fc5 Add more documentation, and make the constructor public for testing use
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900744 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:02:10 +00:00
Nick Burch
3b7ac92989 Improve unhandled continue record message by saying what it follows
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900743 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:01:13 +00:00
Nick Burch
64ee32ec12 Fix a couple of warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900399 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 14:11:26 +00:00
Nick Burch
0e53beb61e New debugging class, useful for when figuring out how to split on continue records, where continue records lie etc
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900397 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 14:07:38 +00:00
Nick Burch
0111e33db5 Tidy up some of the IntMapper/UnicodeString bits with generics
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900376 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 12:56:18 +00:00
Nick Burch
6605995ac6 records.UnicodeString isn't actually a Record, just a common part that exists in many other records. As such, move it to records.common, which is where all the other record components live
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900362 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 12:18:00 +00:00
Maxim Valyanskiy
a52cd3e05a bug#48068: RecordFormatException: The content of an excel record cannot exceed 8224 bytes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899608 13f79535-47bb-0310-9956-ffa450edef68
2010-01-15 12:25:15 +00:00
Yegor Kozlov
4be190d69d avoid creating unnecessary temp files, tighten up usage of temp files in unit tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899364 13f79535-47bb-0310-9956-ffa450edef68
2010-01-14 19:23:13 +00:00
Maxim Valyanskiy
a9e8084bb7 revert previous commit
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899129 13f79535-47bb-0310-9956-ffa450edef68
2010-01-14 09:22:39 +00:00
Maxim Valyanskiy
2e6ffb4f7e ExtractorFactory: save OOXML stream into temporary file before text extraction - this reduces memory usage and allows temporary file cleanup
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899123 13f79535-47bb-0310-9956-ffa450edef68
2010-01-14 08:56:35 +00:00
Yegor Kozlov
8b377f2f67 Avoid creating temporary files when opening OPC packages from input stream
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898927 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 20:24:15 +00:00
Nick Burch
b88672b637 Add a command line wrapper around ExtractorFactory, mostly aimed at when debugging rather than production
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898810 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 15:52:53 +00:00
Nick Burch
4cb27e5095 Remove some generics warnings from the HPSF custom properties and sections parts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898804 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 15:42:53 +00:00
Nick Burch
0d7ef148b5 Tweak the generics definition on workbook so that method signatures on hssf and xssf won't need to change
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898769 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 14:25:31 +00:00
Nick Burch
b5ccd57cd6 Tweak the iterator section to avoid casts as we have generics, and then ditch the hssf duplicate bit (everyone should be pointed at the ss.usermodel version)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898750 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 13:46:36 +00:00
Nick Burch
1362cf6ba7 Fix a few generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898748 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 13:42:15 +00:00
Nick Burch
545f2e1119 Improved how HSMF handles multiple recipients
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898295 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 12:02:18 +00:00
Maxim Valyanskiy
a85c56df35 add more powerpoint xml mime types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898292 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 11:53:35 +00:00
Nick Burch
6e97a360a3 Add PublisherTextExtractor support to ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897887 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:55:43 +00:00
Nick Burch
55ae8cd33d Add status message for recent change from maxcom
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897886 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:54:16 +00:00
Maxim Valyanskiy
f1a1f4a6ff XSLF: text extraction from tables
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897875 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:27:54 +00:00
Nick Burch
5621bb0800 Make it possible to return null on missing chunks, rather than the exception
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897847 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 12:19:42 +00:00
Nick Burch
e605489dc0 Do a manual sync, as per the common skins README, to avoid a build warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897830 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 10:39:35 +00:00
Nick Burch
5ad8301c2a Add embeded (attachment) support to the outlook text extractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897258 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:44:08 +00:00
Nick Burch
98cea49eb5 Rename the outlook extractor to be more consistent with other extractors
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897249 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:18:52 +00:00
Nick Burch
cefe4e1d28 Wire up the new HSMFTextExtactor to the ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897246 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:14:27 +00:00
Nick Burch
bd2f63c721 Add a text extractor to HSMF for simpler extraction of text from .msg files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897242 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:04:15 +00:00
Nick Burch
a6e7575999 Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897239 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:02:35 +00:00
Nick Burch
7ae86fab09 More work on the recipient related chunks, including a helper method to do best-effort finding of the recipients email address
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897213 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 14:26:27 +00:00
Nick Burch
52695c1811 Quick bit of refactoring to save parsing the type and id twice
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897205 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:49:09 +00:00
Nick Burch
ff94e5c61b Support fetching the message date from the submission id
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897201 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:53 +00:00
Nick Burch
1ae7d3f871 Docs tweaks
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897200 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:17 +00:00
Nick Burch
58806414fc Tweak a few tests, and add in a few more chunk types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897185 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 12:08:35 +00:00
Nick Burch
795ed3ce55 Complete chunk parser tests, and make more chunk groups available
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897172 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:37:37 +00:00
Nick Burch
0e368a23da Fix some chunk types, fix the directory descent, fix the Msg2txt example, and start on fixing core tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897167 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:14:58 +00:00