Commit Graph

3717 Commits

Author SHA1 Message Date
Nick Burch 6e2c32e1c5 Most of support suggested by Phil Varner on the list - ExtractorFactory can now be told to prefer Event Based extractors (current Excel only) on a per-thread or overall basis
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902927 13f79535-47bb-0310-9956-ffa450edef68
2010-01-25 19:02:13 +00:00
Yegor Kozlov 7854649016 avoid failures in XLSX2CSV when shared string table is missing, see Bugzilla 48544
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902566 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:40:58 +00:00
Yegor Kozlov d49e83fe92 properly close all IO streams created in OPCPackage, see Bugzilla 48571
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902565 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:26:19 +00:00
Yegor Kozlov 76a0ce511b always copy all declared inner classes and interfaces when generating poi-ooxml-schemas, see Bugzilla 48572
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902563 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:11:46 +00:00
Yegor Kozlov 3e5d8b9cad added mvn-install Ant target equivalent to 'mvn install' in maven-driven projects
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@902561 13f79535-47bb-0310-9956-ffa450edef68
2010-01-24 13:05:48 +00:00
Maxim Valyanskiy ed3cae95f8 PowerPoint OLEShape: extract last version of embedded ole object
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@901215 13f79535-47bb-0310-9956-ffa450edef68
2010-01-20 14:33:58 +00:00
Nick Burch 8f421fc5b0 Add a note about the UnicodeString move from a few days ago
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900816 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 15:20:04 +00:00
Nick Burch b631a6edee Add changelog note
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900747 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:06:41 +00:00
Nick Burch 55f1878628 Implement the ExtRst part of a UnicodeString
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900746 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:04:14 +00:00
Nick Burch d39c43b4b2 If there's no more data but there ought to be for unicode strings, in a partly corrupt file, give a warning and substitute in empty strings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900745 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:03:27 +00:00
Nick Burch bfa3409fc5 Add more documentation, and make the constructor public for testing use
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900744 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:02:10 +00:00
Nick Burch 3b7ac92989 Improve unhandled continue record message by saying what it follows
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900743 13f79535-47bb-0310-9956-ffa450edef68
2010-01-19 12:01:13 +00:00
Nick Burch 64ee32ec12 Fix a couple of warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900399 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 14:11:26 +00:00
Nick Burch 0e53beb61e New debugging class, useful for when figuring out how to split on continue records, where continue records lie etc
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900397 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 14:07:38 +00:00
Nick Burch 0111e33db5 Tidy up some of the IntMapper/UnicodeString bits with generics
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900376 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 12:56:18 +00:00
Nick Burch 6605995ac6 records.UnicodeString isn't actually a Record, just a common part that exists in many other records. As such, move it to records.common, which is where all the other record components live
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@900362 13f79535-47bb-0310-9956-ffa450edef68
2010-01-18 12:18:00 +00:00
Maxim Valyanskiy a52cd3e05a bug#48068: RecordFormatException: The content of an excel record cannot exceed 8224 bytes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899608 13f79535-47bb-0310-9956-ffa450edef68
2010-01-15 12:25:15 +00:00
Yegor Kozlov 4be190d69d avoid creating unnecessary temp files, tighten up usage of temp files in unit tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899364 13f79535-47bb-0310-9956-ffa450edef68
2010-01-14 19:23:13 +00:00
Maxim Valyanskiy a9e8084bb7 revert previous commit
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899129 13f79535-47bb-0310-9956-ffa450edef68
2010-01-14 09:22:39 +00:00
Maxim Valyanskiy 2e6ffb4f7e ExtractorFactory: save OOXML stream into temporary file before text extraction - this reduces memory usage and allows temporary file cleanup
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@899123 13f79535-47bb-0310-9956-ffa450edef68
2010-01-14 08:56:35 +00:00
Yegor Kozlov 8b377f2f67 Avoid creating temporary files when opening OPC packages from input stream
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898927 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 20:24:15 +00:00
Nick Burch b88672b637 Add a command line wrapper around ExtractorFactory, mostly aimed at when debugging rather than production
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898810 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 15:52:53 +00:00
Nick Burch 4cb27e5095 Remove some generics warnings from the HPSF custom properties and sections parts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898804 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 15:42:53 +00:00
Nick Burch 0d7ef148b5 Tweak the generics definition on workbook so that method signatures on hssf and xssf won't need to change
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898769 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 14:25:31 +00:00
Nick Burch b5ccd57cd6 Tweak the iterator section to avoid casts as we have generics, and then ditch the hssf duplicate bit (everyone should be pointed at the ss.usermodel version)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898750 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 13:46:36 +00:00
Nick Burch 1362cf6ba7 Fix a few generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898748 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 13:42:15 +00:00
Nick Burch 545f2e1119 Improved how HSMF handles multiple recipients
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898295 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 12:02:18 +00:00
Maxim Valyanskiy a85c56df35 add more powerpoint xml mime types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898292 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 11:53:35 +00:00
Nick Burch 6e97a360a3 Add PublisherTextExtractor support to ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897887 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:55:43 +00:00
Nick Burch 55ae8cd33d Add status message for recent change from maxcom
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897886 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:54:16 +00:00
Maxim Valyanskiy f1a1f4a6ff XSLF: text extraction from tables
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897875 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:27:54 +00:00
Nick Burch 5621bb0800 Make it possible to return null on missing chunks, rather than the exception
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897847 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 12:19:42 +00:00
Nick Burch e605489dc0 Do a manual sync, as per the common skins README, to avoid a build warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897830 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 10:39:35 +00:00
Nick Burch 5ad8301c2a Add embeded (attachment) support to the outlook text extractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897258 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:44:08 +00:00
Nick Burch 98cea49eb5 Rename the outlook extractor to be more consistent with other extractors
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897249 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:18:52 +00:00
Nick Burch cefe4e1d28 Wire up the new HSMFTextExtactor to the ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897246 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:14:27 +00:00
Nick Burch bd2f63c721 Add a text extractor to HSMF for simpler extraction of text from .msg files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897242 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:04:15 +00:00
Nick Burch a6e7575999 Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897239 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:02:35 +00:00
Nick Burch 7ae86fab09 More work on the recipient related chunks, including a helper method to do best-effort finding of the recipients email address
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897213 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 14:26:27 +00:00
Nick Burch 52695c1811 Quick bit of refactoring to save parsing the type and id twice
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897205 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:49:09 +00:00
Nick Burch ff94e5c61b Support fetching the message date from the submission id
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897201 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:53 +00:00
Nick Burch 1ae7d3f871 Docs tweaks
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897200 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:17 +00:00
Nick Burch 58806414fc Tweak a few tests, and add in a few more chunk types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897185 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 12:08:35 +00:00
Nick Burch 795ed3ce55 Complete chunk parser tests, and make more chunk groups available
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897172 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:37:37 +00:00
Nick Burch 0e368a23da Fix some chunk types, fix the directory descent, fix the Msg2txt example, and start on fixing core tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897167 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:14:58 +00:00
Nick Burch 6afb781730 Shuffle where some of the HSMF tests live to better match package names, and stub out a few more tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896923 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:47:09 +00:00
Nick Burch 2bb376f55b Start on major HSMF refactoring. Should compile, but not quite all tests pass as a little bit of work is left
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896914 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:15:20 +00:00
Nick Burch e5884f2f66 Add a couple more HSMF chunk types, and use Generics in a few places
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896868 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 12:56:39 +00:00
Nick Burch dd6750ceb9 Skip warnings we can't avoid
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896866 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 12:50:21 +00:00
Nick Burch 62e8d43b5b Initialise the link type of HSSFHyperLink, so that getType() on it works
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896049 13f79535-47bb-0310-9956-ffa450edef68
2010-01-05 14:08:51 +00:00