Commit Graph

528 Commits

Author SHA1 Message Date
Sergey Vladimirov
05c7df2042 introduce DocumentPart enum and simplify fields access API
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148280 13f79535-47bb-0310-9956-ffa450edef68
2011-07-19 11:49:48 +00:00
Sergey Vladimirov
92e5199a95 compact HTML output of WordToHtmlConverter
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148269 13f79535-47bb-0310-9956-ffa450edef68
2011-07-19 10:51:23 +00:00
Sergey Vladimirov
8e4853b55e fix test case name for TestExcelToHtmlConverterSuite
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148039 13f79535-47bb-0310-9956-ffa450edef68
2011-07-18 20:27:11 +00:00
Sergey Vladimirov
9f7257440b fix Bug 51524 -- PapBinTable constructor is slow (regression)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148002 13f79535-47bb-0310-9956-ffa450edef68
2011-07-18 18:44:03 +00:00
Sergey Vladimirov
6815f67967 introduce picture descriptor structure (internal), now Picture class extends it;
add test case with correctly cropped picture info

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147450 13f79535-47bb-0310-9956-ffa450edef68
2011-07-16 16:19:49 +00:00
Sergey Vladimirov
66adcb6351 update AllHWPFTests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147400 13f79535-47bb-0310-9956-ffa450edef68
2011-07-16 10:19:13 +00:00
Sergey Vladimirov
8a5f656a67 update AllHWPFTests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147399 13f79535-47bb-0310-9956-ffa450edef68
2011-07-16 10:17:35 +00:00
Sergey Vladimirov
3568e18a60 fix fields processing, improve test case
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147082 13f79535-47bb-0310-9956-ffa450edef68
2011-07-15 10:52:11 +00:00
Sergey Vladimirov
0a631075ec update TestRangeProperties
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145542 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 11:04:43 +00:00
Sergey Vladimirov
920e0ee72f convert unicode string constants to \uxxx-s
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145495 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 08:57:05 +00:00
Sergey Vladimirov
70fd79b2ec add more detailed message about missing substring
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145494 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 08:52:12 +00:00
Sergey Vladimirov
75090630ff add initial support for fast-saved files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145410 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 00:40:27 +00:00
Sergey Vladimirov
098cc9fc4f rewrite PAPX / CHPX loading, allowing to read complex files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145342 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 20:49:41 +00:00
Sergey Vladimirov
4a0d32fa98 rename MBD001D0B89.doc to Bug48075.doc
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145288 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:45:46 +00:00
Sergey Vladimirov
8430800f64 SEPX uses chars as boundaries coordinates, don't do double conversions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145286 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:41:53 +00:00
Sergey Vladimirov
26c1fa750d push boundaries checks down, removing deprecation warnings, remove (unused) cpMin (Word XP) hack
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145075 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 08:57:42 +00:00
Sergey Vladimirov
365134f1bd remove unused parameters
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144923 13f79535-47bb-0310-9956-ffa450edef68
2011-07-10 18:04:17 +00:00
Sergey Vladimirov
6b96874866 shortcut for test case to allow loading bad structure of CHP
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144691 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 16:10:57 +00:00
Sergey Vladimirov
b95ff1310d make sure at least English text is extracted
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144684 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 15:36:05 +00:00
Sergey Vladimirov
f23c8a021a temporary(?) workaround for Bug 49933 - Word 6/95 documents with sections cause ArrayIndexOutOfBoundsException
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144683 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 15:33:41 +00:00
Sergey Vladimirov
c2d4b6babe sort PAPX and SEPX after loading
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144679 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 14:38:42 +00:00
Sergey Vladimirov
d07d8ec7f7 fix the same problem as with PAPX -- CHPX shall be stored based on text positions, not on bytes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144638 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 11:08:45 +00:00
Sergey Vladimirov
a496e6c0fa one of problems linked to 47286 seems to be found -- some CHPX were not saved
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144352 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 15:08:02 +00:00
Sergey Vladimirov
026ef97694 add test case to check if email from Bug47286.doc is present in HTML
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144341 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:37:42 +00:00
Sergey Vladimirov
63c571ca1f add user-friendly way to access field properties if char is a beginning of field
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144336 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:32:20 +00:00
Sergey Vladimirov
65f24a120b save document fields, but Bug47286 looks better, but still not the save
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144275 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 11:54:51 +00:00
Sergey Vladimirov
2abc7aa436 update TestFieldsTables
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144251 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:54:19 +00:00
Sergey Vladimirov
b5de84932e fix test for 47286
now it will fail if bug is fixed

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144235 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:04:03 +00:00
Sergey Vladimirov
18b0247ee2 Test case shall not fail
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144202 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 09:06:33 +00:00
Sergey Vladimirov
eb1dd34245 fix TestPAPBinTable
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144177 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 06:53:14 +00:00
Sergey Vladimirov
beabf33a84 already fixed 48065 - Problems with save output of HWPF (losing formatting)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143809 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 13:13:04 +00:00
Sergey Vladimirov
dd2005da09 fix 47563 - Exception when working with table
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143802 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 12:52:57 +00:00
Sergey Vladimirov
d6d163fd96 fix 47287 - StringIndexOutOfBoundsException in CharacterRun.replaceText()
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143786 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 12:11:53 +00:00
Sergey Vladimirov
f19392c371 Bug 46817 - Regression: Text from some table cells missing is fixed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143772 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:41:53 +00:00
Sergey Vladimirov
9a5143cf4f fail() shall throw Error, not AssertionError (which is ignored)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143771 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:39:30 +00:00
Sergey Vladimirov
65787fc1b9 update test: according to file dump, last row char is 85, so end offset is 86, not 87.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143725 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:25:34 +00:00
Sergey Vladimirov
25f3636438 correctly detect cells of inner tables, do not include last "fake" cell in row
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143707 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:24:55 +00:00
Sergey Vladimirov
f0c8d0520c fix package: org.apache.poi.hssf.usermodel.converter -> org.apache.poi.hssf.converter
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143689 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 07:33:20 +00:00
Sergey Vladimirov
67ff6e3513 output document properties to html and pdf
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143314 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 09:37:32 +00:00
Sergey Vladimirov
afa392e8c8 add test case for lost cell properties
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143077 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:35:36 +00:00
Sergey Vladimirov
8ac3172759 rewrite table bounds detection for Word 97, including inner table support
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143070 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:13:27 +00:00
Sergey Vladimirov
320a3ab737 fix range detection for documents with section across document parts.
Bug46817.doc is the example of document where section contains all document parts - including main part and additional from textbox. Previous implemenetation didn't correctly calculate last paragraph for Section (intersection of SEPX and main document range). Test case is added.

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143014 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 12:02:02 +00:00
Sergey Vladimirov
0e6e9f649c manually test Bug33519.doc
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142882 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 03:04:18 +00:00
Sergey Vladimirov
efd149c834 fix inner tables handling by Range.getTable() method
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142877 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 02:17:54 +00:00
Sergey Vladimirov
65881d12ea fix test suite name (for better IDEs compatibility)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142875 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:53:06 +00:00
Sergey Vladimirov
02672056d8 simplify table converting; fix bug that prevents table from AIOOB-Tap.doc to be converted
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142872 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:44:29 +00:00
Sergey Vladimirov
9538734d31 Test correct processing of "sprmPItap" (0x6649) and "sprmPFInTable" (0x2416)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142867 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:19:31 +00:00
Sergey Vladimirov
bc0abec2c4 add additional TableProperties; implement TableAutoformatLookSpecifier
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142827 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 22:21:10 +00:00
Sergey Vladimirov
ffee3c2c50 add Excel-to-HTML converter (with test suite)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142780 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:49:13 +00:00
Sergey Vladimirov
70d42bbd9e rename extractor -> converter and move to converter package
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142767 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:14:44 +00:00
Sergey Vladimirov
4fbd693851 add Word-to-HTML extractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142765 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:08:06 +00:00
Yegor Kozlov
e4cd5a5b00 cleanup sxxsf temp files after tests, remove non-ascii characters from source code, polished test output
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139518 13f79535-47bb-0310-9956-ffa450edef68
2011-06-25 10:56:46 +00:00
Nick Burch
23077dfb49 Add a unit test to show that bug #49648 no longer occurs
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139245 13f79535-47bb-0310-9956-ffa450edef68
2011-06-24 11:18:49 +00:00
Yegor Kozlov
bc467bb8c1 resolved old bugzilla issues, added unit tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139204 13f79535-47bb-0310-9956-ffa450edef68
2011-06-24 08:46:37 +00:00
Yegor Kozlov
aac4cf50a9 Bug 51351 - more progress with WordToFoExtractor: fix ListEntryNoListTable and MBD001D0B89 tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1138836 13f79535-47bb-0310-9956-ffa450edef68
2011-06-23 11:28:31 +00:00
Yegor Kozlov
6e90e3bc06 bug 51351: more progress with WordToFoExtractor: support for hyperlinks, common fields and code cleanup
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1137673 13f79535-47bb-0310-9956-ffa450edef68
2011-06-20 15:56:28 +00:00
Yegor Kozlov
2761d5a900 junit test for sprm, added missing line from r1137143
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1137538 13f79535-47bb-0310-9956-ffa450edef68
2011-06-20 08:01:23 +00:00
Yegor Kozlov
07253cbc8b Bugzilla 51335: Parse picture goal and crop sizes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1134663 13f79535-47bb-0310-9956-ffa450edef68
2011-06-11 14:58:50 +00:00
Nick Burch
9f40e3df71 Improve HSMF MAPIMessage access to the HTML and RTF versions of the message body (where available)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087782 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 16:01:29 +00:00
Nick Burch
002d0a5995 Add new method to HSMF of MAPIMessage.has7BitEncodingStrings() to make it easier to decide when encoding guessing is needed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087746 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 15:20:07 +00:00
Nick Burch
a0b9fb350b Update OutlookTextExtractor to request 7 bit encoding guessing
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087734 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 15:02:14 +00:00
Nick Burch
43d5e715d3 Improve HSMF encoding guessing for 7 bit fields, and allow HSMF access to the HTML body contents in MAPIMessage
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087726 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 14:51:45 +00:00
Nick Burch
e43a1777ed Fix HPBF generics warnings, and add a NPOIFS check to the HPBF tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085495 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 18:03:18 +00:00
Nick Burch
535e69dea4 Fix generics warnings, and make the NPOIFS and deprecated constructor parts of Publisher clearer and consistent with other parts
Also fix a test filename


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085443 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 16:06:11 +00:00
Nick Burch
a638f0a9ad Strip trailing padding from HMEF compressed rtf when decoding
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1081414 13f79535-47bb-0310-9956-ffa450edef68
2011-03-14 15:11:03 +00:00
Yegor Kozlov
865efc9a8c support for getting HWPFDocument fields, see Bugzilla 50313
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1081291 13f79535-47bb-0310-9956-ffa450edef68
2011-03-14 09:10:12 +00:00
Nick Burch
20eea4a872 Refactor the HMEF contents checks to use a superclass, and stub out the Rtf Message body tests (disabled as there looks to be a padding issue still to solve)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1078304 13f79535-47bb-0310-9956-ffa450edef68
2011-03-05 15:37:24 +00:00
Nick Burch
0df8c11c4b Two more differences between the LZW in HDGF and HMEF:
* Little Endian vs Big Endian storage of the code position
 * Initial dictionary position is the end of pre-fill, if there is one, rather than always being position 0


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1078300 13f79535-47bb-0310-9956-ffa450edef68
2011-03-05 15:25:39 +00:00
Nick Burch
f050cf6eeb Setting the locale doesn't seem to set the timezone, so do that explicitly to ensure the tests behave for everyone
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1077874 13f79535-47bb-0310-9956-ffa450edef68
2011-03-04 11:06:59 +00:00
Nick Burch
bbfa85f807 Tweak the attribute getter names on Attachments, and add MAPI+TNEF Attribute unit tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076685 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 17:06:37 +00:00
Nick Burch
efe5a63429 Add a TNEF extractor class, which can be run from the command line to extract out the message body and attachments
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076607 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 13:01:07 +00:00
Nick Burch
aa1963cd3b Improve HMEF handling of typed attributes (Strings and Dates), for both TNEF and MAPI attributes, and use this to allow easier access to common file parts. Then use this in the attachment unit tests.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076603 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 12:41:39 +00:00
Nick Burch
145de166a1 Switch HMEF attributes to a factory scheme for creation, then add subtypes for cleaner code. Also adds a few more tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076310 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 17:52:12 +00:00
Nick Burch
20f6e280a5 Refactor the TNEF Attributes to better match the MAPI ones, and move both into their own package
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076251 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 15:25:35 +00:00
Nick Burch
649df30b76 Start on unit tests for HMEF. Quite a bit is still stubbed out, and it shows that the LZW isn't quite right yet (so tests disabled)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1075955 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 18:12:44 +00:00
Maxim Valyanskiy
40dbadc49e hwpf: ignore null-reference to parent stylesheet (bug#50688)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1065535 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 09:27:44 +00:00
Nick Burch
9d2b9caa33 Add support for custom MAPI Properties (0x8000 and above, plus unknown lower ones)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058262 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 18:14:49 +00:00
Nick Burch
4358a18407 MAPIProperty is a better fit name for HSMF, so change the name before it gets too widely used
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058116 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 12:45:51 +00:00
Nick Burch
babc3f173e More converting of HSMF code to use strongly type attributes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057712 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 16:11:23 +00:00
Nick Burch
d9197ba3a4 Start converting HSMF code to use the new strongly type MAPIAttribute class for lookups, and add a dev class for listing the defined attributes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057698 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 15:35:29 +00:00
Nick Burch
2d583f4e3f Tweak HSLF and HWPF to work well with NPOIFS, and add unit tests for this
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1054191 13f79535-47bb-0310-9956-ffa450edef68
2011-01-01 05:42:36 +00:00
Nick Burch
f2b541e026 Since a DirectoryNode has a reference to the underlying POIFSFileSystem, tidy up the POIDocument constructor to not need both passing in
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1053521 13f79535-47bb-0310-9956-ffa450edef68
2010-12-29 03:19:46 +00:00
Nick Burch
d2583d83d5 Refactor the common LZW decompression code out into utils
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1051377 13f79535-47bb-0310-9956-ffa450edef68
2010-12-21 05:18:34 +00:00
Nick Burch
cbf86ed0bc Improve the HDGF LZW code.
Some tweaks to the decompression, and more tests, but mostly work on the compression side. We can now compress small streams properly, and these round-trip fine. However, some longer streams don't compress correctly, and more work on that is still needed (see the disabled unit test)


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1049805 13f79535-47bb-0310-9956-ffa450edef68
2010-12-16 07:41:41 +00:00
Nick Burch
e0ec632d69 Slightly expand tests, inspired by bug #50060, but ultimately show there is a deeper issue outstanding
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1026399 13f79535-47bb-0310-9956-ffa450edef68
2010-10-22 16:41:02 +00:00
Yegor Kozlov
1715a0deaf removed svn:executable from project files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1022458 13f79535-47bb-0310-9956-ffa450edef68
2010-10-14 10:34:59 +00:00
Yegor Kozlov
76470fed4e avoid NPE in ListLevel.getNumberText() when numberText is null, see Bugzilla 50075
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1022456 13f79535-47bb-0310-9956-ffa450edef68
2010-10-14 10:30:29 +00:00
Yegor Kozlov
d6307ad1aa support for BorderCode in HWPF, see Bugzilla 49919
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1005447 13f79535-47bb-0310-9956-ffa450edef68
2010-10-07 13:55:46 +00:00
Yegor Kozlov
b53e0ba59c support for processing of symbols in HWPF, see Bugzilla 49908
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1005443 13f79535-47bb-0310-9956-ffa450edef68
2010-10-07 13:40:58 +00:00
Nick Burch
a018a145bf Test that shows we handle word floating and fixed pictures properly
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1002123 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 11:46:22 +00:00
Maxim Valyanskiy
6d03a80070 revert previous commit because it causes problems with other old .docs
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1001726 13f79535-47bb-0310-9956-ffa450edef68
2010-09-27 13:08:57 +00:00
Maxim Valyanskiy
286b682cdc hwpt: add more 2 bytes to OldSectionTable to solve ArrayIndexOutOfBoundsException
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1001712 13f79535-47bb-0310-9956-ffa450edef68
2010-09-27 12:50:36 +00:00
Nick Burch
cc419a3a85 Work inspired by bug #48018 - get HWPF lists more consistent in read vs write, and preserve order as apparently that matters. Includes a fair number of list related unit tests, but not for everything
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998943 13f79535-47bb-0310-9956-ffa450edef68
2010-09-20 14:26:49 +00:00
Nick Burch
ca0fc0ec1e Apply patch from bug #49820 - Fix HWPF paragraph levels, so that outline levels can be properly fetched
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998897 13f79535-47bb-0310-9956-ffa450edef68
2010-09-20 11:45:53 +00:00
Nick Burch
50b44decce Add test that shows that bug #45877 has already been fixed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998634 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 11:52:20 +00:00
Nick Burch
8043f38bd7 More fixes for bug #49933, workaround the fact that some word6/word95 SEPX entries are compressed differently, and we don't have the specs for how they're stored
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998621 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 09:59:10 +00:00
Nick Burch
100095bcb9 Fix bug #49936 - Handle HWPF documents with problematic HeaderStories better
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998146 13f79535-47bb-0310-9956-ffa450edef68
2010-09-17 14:14:19 +00:00
Nick Burch
c07e529c03 Fix support for sections in old word 6 / word 95 files
Improve unit testing for HWPFOldDocument
Sprm fix also improves some HWPFDocument files too!


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998131 13f79535-47bb-0310-9956-ffa450edef68
2010-09-17 13:46:11 +00:00
Nick Burch
0e504bd9ca Add a few more test asserts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@997751 13f79535-47bb-0310-9956-ffa450edef68
2010-09-16 13:45:54 +00:00
Nick Burch
9647ab8976 Add some more paragraph property documentation and tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@997413 13f79535-47bb-0310-9956-ffa450edef68
2010-09-15 17:38:50 +00:00
Nick Burch
216bcfea03 Tweak hssf and hwpf image mime types for wmf/emf to match those used by XSSF/XWPF
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995943 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 19:15:16 +00:00
Nick Burch
96b0aea310 Improve documentation of some of the HWPF picture stuff, and add unit tests for images of embeded documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995807 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 14:37:45 +00:00
Nick Burch
2b9a63d38d Add getMimeType() method to HWPF Picture, and fix some generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995464 13f79535-47bb-0310-9956-ffa450edef68
2010-09-09 15:30:55 +00:00
Nick Burch
dac67e887e Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995445 13f79535-47bb-0310-9956-ffa450edef68
2010-09-09 14:28:14 +00:00
Nick Burch
514e6be1fe Fix bug #47990 - Support for .msg attachments within a MAPIMessage .msg
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@982331 13f79535-47bb-0310-9956-ffa450edef68
2010-08-04 17:08:39 +00:00
Maxim Valyanskiy
e52778847c hwpf: append any character data before paragraphs to first paragraph
(workaround for bug#48075)


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@982238 13f79535-47bb-0310-9956-ffa450edef68
2010-08-04 12:43:58 +00:00
Nick Burch
ce0fea767f Fix bug #49441 - Allow overriding and guessing of HSMF non-unicode string encodings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@981947 13f79535-47bb-0310-9956-ffa450edef68
2010-08-03 16:06:21 +00:00
Maxim Valyanskiy
c21d04a68b wpf: More CharactedRun mapping fixes for fast-saved documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@979618 13f79535-47bb-0310-9956-ffa450edef68
2010-07-27 09:38:54 +00:00
Maxim Valyanskiy
8fab18e37a More fixes for auto-saved documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@979286 13f79535-47bb-0310-9956-ffa450edef68
2010-07-26 14:04:27 +00:00
Nick Burch
bea0801c24 Patch from Jens Gotze from bug #49581 - Ability to add, modify and remove series from HSSF Charts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@964855 13f79535-47bb-0310-9956-ffa450edef68
2010-07-16 16:46:42 +00:00
Maxim Valyanskiy
8661c09c35 HWPF: test auto-saved document
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@961342 13f79535-47bb-0310-9956-ffa450edef68
2010-07-07 10:41:17 +00:00
Nick Burch
256e73d16d More Word 6 / Word 95 Support
HWPFOldDocument now processes a few more table sections, and so we can fake up some
 basic Ranges. This allows us to do paragraph level text extraction


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960102 13f79535-47bb-0310-9956-ffa450edef68
2010-07-02 20:59:30 +00:00
Nick Burch
30848a80aa Basic text extraction support for old Word 6 and Word 95 documents via some HWPF extensions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959346 13f79535-47bb-0310-9956-ffa450edef68
2010-06-30 15:13:10 +00:00
Nick Burch
ad33151624 Better handling of Outlook messages in HSMF when there's no recipient email address
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@954476 13f79535-47bb-0310-9956-ffa450edef68
2010-06-14 13:47:22 +00:00
Nick Burch
05ddf6a51e Fix for bug #48245 - tweak HWPF table cell detection to work across more files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@953694 13f79535-47bb-0310-9956-ffa450edef68
2010-06-11 13:29:44 +00:00
Nick Burch
bf4e6ff464 Add additional RevisionMarkAuthorTable test
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@953343 13f79535-47bb-0310-9956-ffa450edef68
2010-06-10 15:02:05 +00:00
Nick Burch
d29d1d7d9b Apply with tweaks the patch from bug #45269 - improve replaceText on HWPF ranges
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951498 13f79535-47bb-0310-9956-ffa450edef68
2010-06-04 17:19:31 +00:00
Nick Burch
45c4b6bf8f Tweak @link reference to avoid compiler issues
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951055 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 16:23:40 +00:00
Nick Burch
f9fa636e6d Remove un-used imports
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951053 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 16:21:41 +00:00
Nick Burch
65d7431a9f Parse the HSMF headers chunk if present, and use it to find Dates in text extraction if needed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951034 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 15:33:54 +00:00
Nick Burch
cee16bc83b List attachment names in the output of OutlookTextExtractor (to get attachment contents, use ExtractorFactory as normal)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@950595 13f79535-47bb-0310-9956-ffa450edef68
2010-06-02 15:24:11 +00:00
Yegor Kozlov
6ee427ddf9 fixed construction of the DIB picture header, see Bugzilla 43161
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@949483 13f79535-47bb-0310-9956-ffa450edef68
2010-05-30 06:56:32 +00:00
Nick Burch
6666c539da Add a simple testcase for the new RevisionMarkAuthorTable.java
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@948445 13f79535-47bb-0310-9956-ffa450edef68
2010-05-26 14:22:49 +00:00
Yegor Kozlov
fe048df54e Fixed locale-sensitive formatters in PackagePropertiesPart, see Bugzilla 49138
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@935896 13f79535-47bb-0310-9956-ffa450edef68
2010-04-20 12:57:27 +00:00
Yegor Kozlov
639bf94c6f propagate parent to parent-aware records decoded from Escher, also ensure that TextShape and EscherTextboxWrapper hold the same cached sets of records, see Bugzilla 48916
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@930525 13f79535-47bb-0310-9956-ffa450edef68
2010-04-03 14:44:39 +00:00
Nick Burch
918f1a496d Fix an issue with the HSMF tests working on some machines but not others - Make poifs.filesystem.DirectoryNode preserve the original ordering of its files, which HSMF needs to be able
to correctly match up chunks

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@911878 13f79535-47bb-0310-9956-ffa450edef68
2010-02-19 17:55:32 +00:00
Nick Burch
943d3d19e1 Add a disabled test for bug #44501, which still remains, plus fix a generics warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904062 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:28:29 +00:00
Nick Burch
9bbf3ef4d0 Fix generics warnings, and fix up tests to handle the extra bit of text being extracted now
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904060 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:20:32 +00:00
Nick Burch
3aef368b71 Apply patch from Jukka from bug #43670 to improve HDGF v11 Separator detection, and handle short strings better, hopefully solving the Negative length of ChunkHeader issue
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904052 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:05:13 +00:00
Nick Burch
545f2e1119 Improved how HSMF handles multiple recipients
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898295 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 12:02:18 +00:00
Nick Burch
5621bb0800 Make it possible to return null on missing chunks, rather than the exception
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897847 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 12:19:42 +00:00
Nick Burch
98cea49eb5 Rename the outlook extractor to be more consistent with other extractors
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897249 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:18:52 +00:00
Nick Burch
bd2f63c721 Add a text extractor to HSMF for simpler extraction of text from .msg files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897242 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:04:15 +00:00
Nick Burch
7ae86fab09 More work on the recipient related chunks, including a helper method to do best-effort finding of the recipients email address
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897213 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 14:26:27 +00:00
Nick Burch
52695c1811 Quick bit of refactoring to save parsing the type and id twice
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897205 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:49:09 +00:00
Nick Burch
ff94e5c61b Support fetching the message date from the submission id
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897201 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:53 +00:00
Nick Burch
58806414fc Tweak a few tests, and add in a few more chunk types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897185 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 12:08:35 +00:00
Nick Burch
795ed3ce55 Complete chunk parser tests, and make more chunk groups available
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897172 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:37:37 +00:00
Nick Burch
0e368a23da Fix some chunk types, fix the directory descent, fix the Msg2txt example, and start on fixing core tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897167 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:14:58 +00:00
Nick Burch
6afb781730 Shuffle where some of the HSMF tests live to better match package names, and stub out a few more tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896923 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:47:09 +00:00
Nick Burch
2bb376f55b Start on major HSMF refactoring. Should compile, but not quite all tests pass as a little bit of work is left
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896914 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:15:20 +00:00
Yegor Kozlov
a64917fe88 removed svn:eol-style=native from binary files, uncommented related tests; see Bugzilla 48427
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@894173 13f79535-47bb-0310-9956-ffa450edef68
2009-12-28 13:45:10 +00:00
Yegor Kozlov
e8e2b1ff49 support for text extraction from PPT master slides, see Bugzilla 48161
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@835271 13f79535-47bb-0310-9956-ffa450edef68
2009-11-12 07:07:41 +00:00
Nick Burch
a9d6f34732 Patch from Trejkaz - fix bug #48038 - handle reading HWPF stylesheets from non zero offsets
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@832625 13f79535-47bb-0310-9956-ffa450edef68
2009-11-04 00:07:37 +00:00
Josh Micich
ed1b4794a0 Bugzilla 47962 - Fixed some potential NPEs. Avoided unnecessary creation of box instances. Applied patch with mods
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@823348 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 22:29:41 +00:00
Yegor Kozlov
bd966f208b removed unnecessary svn:executable flag from files in SVN trunk, see bugzilla 47734
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@814335 13f79535-47bb-0310-9956-ffa450edef68
2009-09-13 16:05:28 +00:00
Yegor Kozlov
4841ed0d47 Fixed PPT parser to tolerate Comment2000 containers with missing comment text, see Bugzilla 44770
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@811814 13f79535-47bb-0310-9956-ffa450edef68
2009-09-06 11:35:01 +00:00
Yegor Kozlov
f4f823b94c Support for extraction of header / footer images in HWPF, see Bugzilla 47733
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@809659 13f79535-47bb-0310-9956-ffa450edef68
2009-08-31 16:57:03 +00:00
Yegor Kozlov
728f88bd92 moved all test data to a top-level directory, refactored all junit usages of .testdata.path to one place
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806959 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 12:34:24 +00:00
Josh Micich
9ca061617c fixing compiler warnings - unused imports, declared exceptions not thrown
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806789 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 05:50:32 +00:00