Commit Graph

635 Commits

Author SHA1 Message Date
Nick Burch
918f1a496d Fix an issue with the HSMF tests working on some machines but not others - Make poifs.filesystem.DirectoryNode preserve the original ordering of its files, which HSMF needs to be able
to correctly match up chunks

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@911878 13f79535-47bb-0310-9956-ffa450edef68
2010-02-19 17:55:32 +00:00
Nick Burch
943d3d19e1 Add a disabled test for bug #44501, which still remains, plus fix a generics warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904062 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:28:29 +00:00
Nick Burch
9bbf3ef4d0 Fix generics warnings, and fix up tests to handle the extra bit of text being extracted now
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904060 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:20:32 +00:00
Nick Burch
3aef368b71 Apply patch from Jukka from bug #43670 to improve HDGF v11 Separator detection, and handle short strings better, hopefully solving the Negative length of ChunkHeader issue
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904052 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:05:13 +00:00
Nick Burch
2880d934f9 Improve error message, and fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904049 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:00:38 +00:00
Maxim Valyanskiy
ed3cae95f8 PowerPoint OLEShape: extract last version of embedded ole object
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@901215 13f79535-47bb-0310-9956-ffa450edef68
2010-01-20 14:33:58 +00:00
Nick Burch
545f2e1119 Improved how HSMF handles multiple recipients
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898295 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 12:02:18 +00:00
Nick Burch
6e97a360a3 Add PublisherTextExtractor support to ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897887 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:55:43 +00:00
Nick Burch
5621bb0800 Make it possible to return null on missing chunks, rather than the exception
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897847 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 12:19:42 +00:00
Nick Burch
5ad8301c2a Add embeded (attachment) support to the outlook text extractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897258 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:44:08 +00:00
Nick Burch
98cea49eb5 Rename the outlook extractor to be more consistent with other extractors
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897249 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:18:52 +00:00
Nick Burch
cefe4e1d28 Wire up the new HSMFTextExtactor to the ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897246 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:14:27 +00:00
Nick Burch
bd2f63c721 Add a text extractor to HSMF for simpler extraction of text from .msg files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897242 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:04:15 +00:00
Nick Burch
a6e7575999 Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897239 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:02:35 +00:00
Nick Burch
7ae86fab09 More work on the recipient related chunks, including a helper method to do best-effort finding of the recipients email address
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897213 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 14:26:27 +00:00
Nick Burch
52695c1811 Quick bit of refactoring to save parsing the type and id twice
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897205 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:49:09 +00:00
Nick Burch
ff94e5c61b Support fetching the message date from the submission id
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897201 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:53 +00:00
Nick Burch
58806414fc Tweak a few tests, and add in a few more chunk types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897185 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 12:08:35 +00:00
Nick Burch
795ed3ce55 Complete chunk parser tests, and make more chunk groups available
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897172 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:37:37 +00:00
Nick Burch
0e368a23da Fix some chunk types, fix the directory descent, fix the Msg2txt example, and start on fixing core tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897167 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:14:58 +00:00
Nick Burch
6afb781730 Shuffle where some of the HSMF tests live to better match package names, and stub out a few more tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896923 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:47:09 +00:00
Nick Burch
2bb376f55b Start on major HSMF refactoring. Should compile, but not quite all tests pass as a little bit of work is left
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896914 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:15:20 +00:00
Nick Burch
e5884f2f66 Add a couple more HSMF chunk types, and use Generics in a few places
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896868 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 12:56:39 +00:00
Yegor Kozlov
a64917fe88 removed svn:eol-style=native from binary files, uncommented related tests; see Bugzilla 48427
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@894173 13f79535-47bb-0310-9956-ffa450edef68
2009-12-28 13:45:10 +00:00
Yegor Kozlov
0b9ab8930e included poi-ooxml-schemas and poi-examples in the release cycle, also misc improvements in build.xml: merged scratchpad and main examples, improved cross-module classpath dependencies, removed obsolete targets
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@882927 13f79535-47bb-0310-9956-ffa450edef68
2009-11-21 15:30:54 +00:00
Yegor Kozlov
e8e2b1ff49 support for text extraction from PPT master slides, see Bugzilla 48161
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@835271 13f79535-47bb-0310-9956-ffa450edef68
2009-11-12 07:07:41 +00:00
Nick Burch
a9d6f34732 Patch from Trejkaz - fix bug #48038 - handle reading HWPF stylesheets from non zero offsets
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@832625 13f79535-47bb-0310-9956-ffa450edef68
2009-11-04 00:07:37 +00:00
Josh Micich
ed1b4794a0 Bugzilla 47962 - Fixed some potential NPEs. Avoided unnecessary creation of box instances. Applied patch with mods
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@823348 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 22:29:41 +00:00
Yegor Kozlov
2ad9b00e9f added Ant target to compile scratchpad examples, added example HSMF application that converts a .msg file to text and extracts attachments, see Bugzilla 47922
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@821500 13f79535-47bb-0310-9956-ffa450edef68
2009-10-04 10:25:53 +00:00
Yegor Kozlov
bd966f208b removed unnecessary svn:executable flag from files in SVN trunk, see bugzilla 47734
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@814335 13f79535-47bb-0310-9956-ffa450edef68
2009-09-13 16:05:28 +00:00
Yegor Kozlov
4841ed0d47 Fixed PPT parser to tolerate Comment2000 containers with missing comment text, see Bugzilla 44770
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@811814 13f79535-47bb-0310-9956-ffa450edef68
2009-09-06 11:35:01 +00:00
Yegor Kozlov
f4f823b94c Support for extraction of header / footer images in HWPF, see Bugzilla 47733
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@809659 13f79535-47bb-0310-9956-ffa450edef68
2009-08-31 16:57:03 +00:00
Yegor Kozlov
728f88bd92 moved all test data to a top-level directory, refactored all junit usages of .testdata.path to one place
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806959 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 12:34:24 +00:00
Josh Micich
9ca061617c fixing compiler warnings - unused imports, declared exceptions not thrown
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806789 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 05:50:32 +00:00
Yegor Kozlov
7e2a5dacee Avoid exception when reading ClipboardData packet in OLE property sets, see bugzilla 45583
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806172 13f79535-47bb-0310-9956-ffa450edef68
2009-08-20 13:49:33 +00:00
Yegor Kozlov
3316b64495 Centralize logic for finding/opening sample files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@805928 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 18:51:44 +00:00
Josh Micich
9926c89006 Fixed compiler warnings - unnecessary throws declaration
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@805552 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 19:49:28 +00:00
Josh Micich
c304173145 Fixed compiler warnings - unnecessary else
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@805492 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 16:50:24 +00:00
Paolo Mottadelli
05c8f883e4 createHeader/Footer methods + tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@805422 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 14:08:09 +00:00
Josh Micich
c83d6bfa24 Fixing compiler warnings - unnecessary typecasts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@805284 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 05:29:53 +00:00
Josh Micich
995d8fe73f Fixed unused imports compiler warnings (and some others)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@805262 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 02:41:45 +00:00
Yegor Kozlov
307f443c33 WPFWordExtractor does not extract inserted/deleted text, see Bugzilla 45597
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@797740 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 10:41:53 +00:00
Paolo Mottadelli
15e434d684 POIXMLproperties: created, creator, modified, lastprinted, identifier, revision + test
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@796239 13f79535-47bb-0310-9956-ffa450edef68
2009-07-21 10:23:05 +00:00
Paolo Mottadelli
76752c5db7 POIXMLproperties: core properties improvement + test
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@795923 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 17:10:43 +00:00
Yegor Kozlov
77987258b8 fixed WordExtractor to avoid ArrayIndexOutOfBoundsException when encountering empty footnote block
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@795333 13f79535-47bb-0310-9956-ffa450edef68
2009-07-18 10:03:01 +00:00
Yegor Kozlov
656114c69a Support for extraction of endnotes from docx files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@795329 13f79535-47bb-0310-9956-ffa450edef68
2009-07-18 09:28:38 +00:00
Yegor Kozlov
fa31a65d14 Support for extraction of footnotes from docx files, see Bugzilla 45556
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@795328 13f79535-47bb-0310-9956-ffa450edef68
2009-07-18 09:09:59 +00:00
Yegor Kozlov
3304fa1887 Fixed ExtractorFactory to support .xltx and .dotx files, see Bugzilla 47517
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@795327 13f79535-47bb-0310-9956-ffa450edef68
2009-07-18 09:09:11 +00:00
Yegor Kozlov
9c5c51ad29 Support for getting OLE object data in PowerPointExtractor, see Bugzilla 47456
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@791241 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 13:15:41 +00:00
Yegor Kozlov
12806cceaa Support fo text extraction of footnotes, endnotes and comments in HWPF, see Bugzilla 47400
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@788949 13f79535-47bb-0310-9956-ffa450edef68
2009-06-27 10:39:51 +00:00