Sergey Vladimirov
05c7df2042
introduce DocumentPart enum and simplify fields access API
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148280 13f79535-47bb-0310-9956-ffa450edef68
2011-07-19 11:49:48 +00:00
Sergey Vladimirov
92e5199a95
compact HTML output of WordToHtmlConverter
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148269 13f79535-47bb-0310-9956-ffa450edef68
2011-07-19 10:51:23 +00:00
Sergey Vladimirov
8e4853b55e
fix test case name for TestExcelToHtmlConverterSuite
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148039 13f79535-47bb-0310-9956-ffa450edef68
2011-07-18 20:27:11 +00:00
Sergey Vladimirov
9f7257440b
fix Bug 51524 -- PapBinTable constructor is slow (regression)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1148002 13f79535-47bb-0310-9956-ffa450edef68
2011-07-18 18:44:03 +00:00
Sergey Vladimirov
6815f67967
introduce picture descriptor structure (internal), now Picture class extends it;
...
add test case with correctly cropped picture info
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147450 13f79535-47bb-0310-9956-ffa450edef68
2011-07-16 16:19:49 +00:00
Sergey Vladimirov
66adcb6351
update AllHWPFTests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147400 13f79535-47bb-0310-9956-ffa450edef68
2011-07-16 10:19:13 +00:00
Sergey Vladimirov
8a5f656a67
update AllHWPFTests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147399 13f79535-47bb-0310-9956-ffa450edef68
2011-07-16 10:17:35 +00:00
Sergey Vladimirov
3568e18a60
fix fields processing, improve test case
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1147082 13f79535-47bb-0310-9956-ffa450edef68
2011-07-15 10:52:11 +00:00
Sergey Vladimirov
0a631075ec
update TestRangeProperties
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145542 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 11:04:43 +00:00
Sergey Vladimirov
920e0ee72f
convert unicode string constants to \uxxx-s
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145495 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 08:57:05 +00:00
Sergey Vladimirov
70fd79b2ec
add more detailed message about missing substring
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145494 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 08:52:12 +00:00
Sergey Vladimirov
75090630ff
add initial support for fast-saved files
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145410 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 00:40:27 +00:00
Sergey Vladimirov
098cc9fc4f
rewrite PAPX / CHPX loading, allowing to read complex files
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145342 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 20:49:41 +00:00
Sergey Vladimirov
4a0d32fa98
rename MBD001D0B89.doc to Bug48075.doc
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145288 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:45:46 +00:00
Sergey Vladimirov
8430800f64
SEPX uses chars as boundaries coordinates, don't do double conversions
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145286 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:41:53 +00:00
Sergey Vladimirov
26c1fa750d
push boundaries checks down, removing deprecation warnings, remove (unused) cpMin (Word XP) hack
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1145075 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 08:57:42 +00:00
Sergey Vladimirov
365134f1bd
remove unused parameters
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144923 13f79535-47bb-0310-9956-ffa450edef68
2011-07-10 18:04:17 +00:00
Sergey Vladimirov
6b96874866
shortcut for test case to allow loading bad structure of CHP
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144691 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 16:10:57 +00:00
Sergey Vladimirov
b95ff1310d
make sure at least English text is extracted
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144684 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 15:36:05 +00:00
Sergey Vladimirov
f23c8a021a
temporary(?) workaround for Bug 49933 - Word 6/95 documents with sections cause ArrayIndexOutOfBoundsException
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144683 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 15:33:41 +00:00
Sergey Vladimirov
c2d4b6babe
sort PAPX and SEPX after loading
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144679 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 14:38:42 +00:00
Sergey Vladimirov
d07d8ec7f7
fix the same problem as with PAPX -- CHPX shall be stored based on text positions, not on bytes
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144638 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 11:08:45 +00:00
Sergey Vladimirov
a496e6c0fa
one of problems linked to 47286 seems to be found -- some CHPX were not saved
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144352 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 15:08:02 +00:00
Sergey Vladimirov
026ef97694
add test case to check if email from Bug47286.doc is present in HTML
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144341 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:37:42 +00:00
Sergey Vladimirov
63c571ca1f
add user-friendly way to access field properties if char is a beginning of field
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144336 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:32:20 +00:00
Sergey Vladimirov
65f24a120b
save document fields, but Bug47286 looks better, but still not the save
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144275 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 11:54:51 +00:00
Sergey Vladimirov
2abc7aa436
update TestFieldsTables
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144251 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:54:19 +00:00
Sergey Vladimirov
b5de84932e
fix test for 47286
...
now it will fail if bug is fixed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144235 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:04:03 +00:00
Sergey Vladimirov
18b0247ee2
Test case shall not fail
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144202 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 09:06:33 +00:00
Sergey Vladimirov
eb1dd34245
fix TestPAPBinTable
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144177 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 06:53:14 +00:00
Sergey Vladimirov
beabf33a84
already fixed 48065 - Problems with save output of HWPF (losing formatting)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143809 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 13:13:04 +00:00
Sergey Vladimirov
dd2005da09
fix 47563 - Exception when working with table
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143802 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 12:52:57 +00:00
Sergey Vladimirov
d6d163fd96
fix 47287 - StringIndexOutOfBoundsException in CharacterRun.replaceText()
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143786 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 12:11:53 +00:00
Sergey Vladimirov
f19392c371
Bug 46817 - Regression: Text from some table cells missing is fixed
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143772 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:41:53 +00:00
Sergey Vladimirov
9a5143cf4f
fail() shall throw Error, not AssertionError (which is ignored)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143771 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:39:30 +00:00
Sergey Vladimirov
65787fc1b9
update test: according to file dump, last row char is 85, so end offset is 86, not 87.
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143725 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:25:34 +00:00
Sergey Vladimirov
25f3636438
correctly detect cells of inner tables, do not include last "fake" cell in row
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143707 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:24:55 +00:00
Sergey Vladimirov
f0c8d0520c
fix package: org.apache.poi.hssf.usermodel.converter -> org.apache.poi.hssf.converter
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143689 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 07:33:20 +00:00
Sergey Vladimirov
67ff6e3513
output document properties to html and pdf
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143314 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 09:37:32 +00:00
Sergey Vladimirov
afa392e8c8
add test case for lost cell properties
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143077 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:35:36 +00:00
Sergey Vladimirov
8ac3172759
rewrite table bounds detection for Word 97, including inner table support
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143070 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:13:27 +00:00
Sergey Vladimirov
320a3ab737
fix range detection for documents with section across document parts.
...
Bug46817.doc is the example of document where section contains all document parts - including main part and additional from textbox. Previous implemenetation didn't correctly calculate last paragraph for Section (intersection of SEPX and main document range). Test case is added.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143014 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 12:02:02 +00:00
Sergey Vladimirov
0e6e9f649c
manually test Bug33519.doc
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142882 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 03:04:18 +00:00
Sergey Vladimirov
efd149c834
fix inner tables handling by Range.getTable() method
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142877 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 02:17:54 +00:00
Sergey Vladimirov
65881d12ea
fix test suite name (for better IDEs compatibility)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142875 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:53:06 +00:00
Sergey Vladimirov
02672056d8
simplify table converting; fix bug that prevents table from AIOOB-Tap.doc to be converted
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142872 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:44:29 +00:00
Sergey Vladimirov
9538734d31
Test correct processing of "sprmPItap" (0x6649) and "sprmPFInTable" (0x2416)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142867 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:19:31 +00:00
Sergey Vladimirov
bc0abec2c4
add additional TableProperties; implement TableAutoformatLookSpecifier
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142827 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 22:21:10 +00:00
Sergey Vladimirov
ffee3c2c50
add Excel-to-HTML converter (with test suite)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142780 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:49:13 +00:00
Sergey Vladimirov
70d42bbd9e
rename extractor -> converter and move to converter package
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142767 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:14:44 +00:00
Sergey Vladimirov
4fbd693851
add Word-to-HTML extractor
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142765 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:08:06 +00:00
Yegor Kozlov
e4cd5a5b00
cleanup sxxsf temp files after tests, remove non-ascii characters from source code, polished test output
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139518 13f79535-47bb-0310-9956-ffa450edef68
2011-06-25 10:56:46 +00:00
Nick Burch
23077dfb49
Add a unit test to show that bug #49648 no longer occurs
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139245 13f79535-47bb-0310-9956-ffa450edef68
2011-06-24 11:18:49 +00:00
Yegor Kozlov
bc467bb8c1
resolved old bugzilla issues, added unit tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139204 13f79535-47bb-0310-9956-ffa450edef68
2011-06-24 08:46:37 +00:00
Yegor Kozlov
aac4cf50a9
Bug 51351 - more progress with WordToFoExtractor: fix ListEntryNoListTable and MBD001D0B89 tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1138836 13f79535-47bb-0310-9956-ffa450edef68
2011-06-23 11:28:31 +00:00
Yegor Kozlov
6e90e3bc06
bug 51351: more progress with WordToFoExtractor: support for hyperlinks, common fields and code cleanup
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1137673 13f79535-47bb-0310-9956-ffa450edef68
2011-06-20 15:56:28 +00:00
Yegor Kozlov
2761d5a900
junit test for sprm, added missing line from r1137143
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1137538 13f79535-47bb-0310-9956-ffa450edef68
2011-06-20 08:01:23 +00:00
Yegor Kozlov
07253cbc8b
Bugzilla 51335: Parse picture goal and crop sizes
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1134663 13f79535-47bb-0310-9956-ffa450edef68
2011-06-11 14:58:50 +00:00
Nick Burch
9f40e3df71
Improve HSMF MAPIMessage access to the HTML and RTF versions of the message body (where available)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087782 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 16:01:29 +00:00
Nick Burch
002d0a5995
Add new method to HSMF of MAPIMessage.has7BitEncodingStrings() to make it easier to decide when encoding guessing is needed
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087746 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 15:20:07 +00:00
Nick Burch
a0b9fb350b
Update OutlookTextExtractor to request 7 bit encoding guessing
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087734 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 15:02:14 +00:00
Nick Burch
43d5e715d3
Improve HSMF encoding guessing for 7 bit fields, and allow HSMF access to the HTML body contents in MAPIMessage
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087726 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 14:51:45 +00:00
Nick Burch
e43a1777ed
Fix HPBF generics warnings, and add a NPOIFS check to the HPBF tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085495 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 18:03:18 +00:00
Nick Burch
535e69dea4
Fix generics warnings, and make the NPOIFS and deprecated constructor parts of Publisher clearer and consistent with other parts
...
Also fix a test filename
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085443 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 16:06:11 +00:00
Nick Burch
a638f0a9ad
Strip trailing padding from HMEF compressed rtf when decoding
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1081414 13f79535-47bb-0310-9956-ffa450edef68
2011-03-14 15:11:03 +00:00
Yegor Kozlov
865efc9a8c
support for getting HWPFDocument fields, see Bugzilla 50313
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1081291 13f79535-47bb-0310-9956-ffa450edef68
2011-03-14 09:10:12 +00:00
Nick Burch
20eea4a872
Refactor the HMEF contents checks to use a superclass, and stub out the Rtf Message body tests (disabled as there looks to be a padding issue still to solve)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1078304 13f79535-47bb-0310-9956-ffa450edef68
2011-03-05 15:37:24 +00:00
Nick Burch
0df8c11c4b
Two more differences between the LZW in HDGF and HMEF:
...
* Little Endian vs Big Endian storage of the code position
* Initial dictionary position is the end of pre-fill, if there is one, rather than always being position 0
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1078300 13f79535-47bb-0310-9956-ffa450edef68
2011-03-05 15:25:39 +00:00
Nick Burch
f050cf6eeb
Setting the locale doesn't seem to set the timezone, so do that explicitly to ensure the tests behave for everyone
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1077874 13f79535-47bb-0310-9956-ffa450edef68
2011-03-04 11:06:59 +00:00
Nick Burch
bbfa85f807
Tweak the attribute getter names on Attachments, and add MAPI+TNEF Attribute unit tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076685 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 17:06:37 +00:00
Nick Burch
efe5a63429
Add a TNEF extractor class, which can be run from the command line to extract out the message body and attachments
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076607 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 13:01:07 +00:00
Nick Burch
aa1963cd3b
Improve HMEF handling of typed attributes (Strings and Dates), for both TNEF and MAPI attributes, and use this to allow easier access to common file parts. Then use this in the attachment unit tests.
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076603 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 12:41:39 +00:00
Nick Burch
145de166a1
Switch HMEF attributes to a factory scheme for creation, then add subtypes for cleaner code. Also adds a few more tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076310 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 17:52:12 +00:00
Nick Burch
20f6e280a5
Refactor the TNEF Attributes to better match the MAPI ones, and move both into their own package
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076251 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 15:25:35 +00:00
Nick Burch
649df30b76
Start on unit tests for HMEF. Quite a bit is still stubbed out, and it shows that the LZW isn't quite right yet (so tests disabled)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1075955 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 18:12:44 +00:00
Maxim Valyanskiy
40dbadc49e
hwpf: ignore null-reference to parent stylesheet (bug#50688)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1065535 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 09:27:44 +00:00
Nick Burch
9d2b9caa33
Add support for custom MAPI Properties (0x8000 and above, plus unknown lower ones)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058262 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 18:14:49 +00:00
Nick Burch
4358a18407
MAPIProperty is a better fit name for HSMF, so change the name before it gets too widely used
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058116 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 12:45:51 +00:00
Nick Burch
babc3f173e
More converting of HSMF code to use strongly type attributes
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057712 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 16:11:23 +00:00
Nick Burch
d9197ba3a4
Start converting HSMF code to use the new strongly type MAPIAttribute class for lookups, and add a dev class for listing the defined attributes
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057698 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 15:35:29 +00:00
Nick Burch
2d583f4e3f
Tweak HSLF and HWPF to work well with NPOIFS, and add unit tests for this
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1054191 13f79535-47bb-0310-9956-ffa450edef68
2011-01-01 05:42:36 +00:00
Nick Burch
f2b541e026
Since a DirectoryNode has a reference to the underlying POIFSFileSystem, tidy up the POIDocument constructor to not need both passing in
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1053521 13f79535-47bb-0310-9956-ffa450edef68
2010-12-29 03:19:46 +00:00
Nick Burch
d2583d83d5
Refactor the common LZW decompression code out into utils
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1051377 13f79535-47bb-0310-9956-ffa450edef68
2010-12-21 05:18:34 +00:00
Nick Burch
cbf86ed0bc
Improve the HDGF LZW code.
...
Some tweaks to the decompression, and more tests, but mostly work on the compression side. We can now compress small streams properly, and these round-trip fine. However, some longer streams don't compress correctly, and more work on that is still needed (see the disabled unit test)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1049805 13f79535-47bb-0310-9956-ffa450edef68
2010-12-16 07:41:41 +00:00
Nick Burch
e0ec632d69
Slightly expand tests, inspired by bug #50060 , but ultimately show there is a deeper issue outstanding
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1026399 13f79535-47bb-0310-9956-ffa450edef68
2010-10-22 16:41:02 +00:00
Yegor Kozlov
1715a0deaf
removed svn:executable from project files
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1022458 13f79535-47bb-0310-9956-ffa450edef68
2010-10-14 10:34:59 +00:00
Yegor Kozlov
76470fed4e
avoid NPE in ListLevel.getNumberText() when numberText is null, see Bugzilla 50075
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1022456 13f79535-47bb-0310-9956-ffa450edef68
2010-10-14 10:30:29 +00:00
Yegor Kozlov
d6307ad1aa
support for BorderCode in HWPF, see Bugzilla 49919
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1005447 13f79535-47bb-0310-9956-ffa450edef68
2010-10-07 13:55:46 +00:00
Yegor Kozlov
b53e0ba59c
support for processing of symbols in HWPF, see Bugzilla 49908
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1005443 13f79535-47bb-0310-9956-ffa450edef68
2010-10-07 13:40:58 +00:00
Nick Burch
a018a145bf
Test that shows we handle word floating and fixed pictures properly
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1002123 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 11:46:22 +00:00
Maxim Valyanskiy
6d03a80070
revert previous commit because it causes problems with other old .docs
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1001726 13f79535-47bb-0310-9956-ffa450edef68
2010-09-27 13:08:57 +00:00
Maxim Valyanskiy
286b682cdc
hwpt: add more 2 bytes to OldSectionTable to solve ArrayIndexOutOfBoundsException
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1001712 13f79535-47bb-0310-9956-ffa450edef68
2010-09-27 12:50:36 +00:00
Nick Burch
cc419a3a85
Work inspired by bug #48018 - get HWPF lists more consistent in read vs write, and preserve order as apparently that matters. Includes a fair number of list related unit tests, but not for everything
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998943 13f79535-47bb-0310-9956-ffa450edef68
2010-09-20 14:26:49 +00:00
Nick Burch
ca0fc0ec1e
Apply patch from bug #49820 - Fix HWPF paragraph levels, so that outline levels can be properly fetched
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998897 13f79535-47bb-0310-9956-ffa450edef68
2010-09-20 11:45:53 +00:00
Nick Burch
50b44decce
Add test that shows that bug #45877 has already been fixed
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998634 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 11:52:20 +00:00
Nick Burch
8043f38bd7
More fixes for bug #49933 , workaround the fact that some word6/word95 SEPX entries are compressed differently, and we don't have the specs for how they're stored
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998621 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 09:59:10 +00:00
Nick Burch
100095bcb9
Fix bug #49936 - Handle HWPF documents with problematic HeaderStories better
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998146 13f79535-47bb-0310-9956-ffa450edef68
2010-09-17 14:14:19 +00:00
Nick Burch
c07e529c03
Fix support for sections in old word 6 / word 95 files
...
Improve unit testing for HWPFOldDocument
Sprm fix also improves some HWPFDocument files too!
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998131 13f79535-47bb-0310-9956-ffa450edef68
2010-09-17 13:46:11 +00:00
Nick Burch
0e504bd9ca
Add a few more test asserts
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@997751 13f79535-47bb-0310-9956-ffa450edef68
2010-09-16 13:45:54 +00:00
Nick Burch
9647ab8976
Add some more paragraph property documentation and tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@997413 13f79535-47bb-0310-9956-ffa450edef68
2010-09-15 17:38:50 +00:00
Nick Burch
216bcfea03
Tweak hssf and hwpf image mime types for wmf/emf to match those used by XSSF/XWPF
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995943 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 19:15:16 +00:00
Nick Burch
96b0aea310
Improve documentation of some of the HWPF picture stuff, and add unit tests for images of embeded documents
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995807 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 14:37:45 +00:00
Nick Burch
2b9a63d38d
Add getMimeType() method to HWPF Picture, and fix some generics warnings
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995464 13f79535-47bb-0310-9956-ffa450edef68
2010-09-09 15:30:55 +00:00
Nick Burch
dac67e887e
Fix generics warnings
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995445 13f79535-47bb-0310-9956-ffa450edef68
2010-09-09 14:28:14 +00:00
Nick Burch
514e6be1fe
Fix bug #47990 - Support for .msg attachments within a MAPIMessage .msg
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@982331 13f79535-47bb-0310-9956-ffa450edef68
2010-08-04 17:08:39 +00:00
Maxim Valyanskiy
e52778847c
hwpf: append any character data before paragraphs to first paragraph
...
(workaround for bug#48075)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@982238 13f79535-47bb-0310-9956-ffa450edef68
2010-08-04 12:43:58 +00:00
Nick Burch
ce0fea767f
Fix bug #49441 - Allow overriding and guessing of HSMF non-unicode string encodings
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@981947 13f79535-47bb-0310-9956-ffa450edef68
2010-08-03 16:06:21 +00:00
Maxim Valyanskiy
c21d04a68b
wpf: More CharactedRun mapping fixes for fast-saved documents
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@979618 13f79535-47bb-0310-9956-ffa450edef68
2010-07-27 09:38:54 +00:00
Maxim Valyanskiy
8fab18e37a
More fixes for auto-saved documents
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@979286 13f79535-47bb-0310-9956-ffa450edef68
2010-07-26 14:04:27 +00:00
Nick Burch
bea0801c24
Patch from Jens Gotze from bug #49581 - Ability to add, modify and remove series from HSSF Charts
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@964855 13f79535-47bb-0310-9956-ffa450edef68
2010-07-16 16:46:42 +00:00
Maxim Valyanskiy
8661c09c35
HWPF: test auto-saved document
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@961342 13f79535-47bb-0310-9956-ffa450edef68
2010-07-07 10:41:17 +00:00
Nick Burch
256e73d16d
More Word 6 / Word 95 Support
...
HWPFOldDocument now processes a few more table sections, and so we can fake up some
basic Ranges. This allows us to do paragraph level text extraction
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960102 13f79535-47bb-0310-9956-ffa450edef68
2010-07-02 20:59:30 +00:00
Nick Burch
30848a80aa
Basic text extraction support for old Word 6 and Word 95 documents via some HWPF extensions
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959346 13f79535-47bb-0310-9956-ffa450edef68
2010-06-30 15:13:10 +00:00
Nick Burch
ad33151624
Better handling of Outlook messages in HSMF when there's no recipient email address
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@954476 13f79535-47bb-0310-9956-ffa450edef68
2010-06-14 13:47:22 +00:00
Nick Burch
05ddf6a51e
Fix for bug #48245 - tweak HWPF table cell detection to work across more files
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@953694 13f79535-47bb-0310-9956-ffa450edef68
2010-06-11 13:29:44 +00:00
Nick Burch
bf4e6ff464
Add additional RevisionMarkAuthorTable test
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@953343 13f79535-47bb-0310-9956-ffa450edef68
2010-06-10 15:02:05 +00:00
Nick Burch
d29d1d7d9b
Apply with tweaks the patch from bug #45269 - improve replaceText on HWPF ranges
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951498 13f79535-47bb-0310-9956-ffa450edef68
2010-06-04 17:19:31 +00:00
Nick Burch
45c4b6bf8f
Tweak @link reference to avoid compiler issues
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951055 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 16:23:40 +00:00
Nick Burch
f9fa636e6d
Remove un-used imports
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951053 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 16:21:41 +00:00
Nick Burch
65d7431a9f
Parse the HSMF headers chunk if present, and use it to find Dates in text extraction if needed
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951034 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 15:33:54 +00:00
Nick Burch
cee16bc83b
List attachment names in the output of OutlookTextExtractor (to get attachment contents, use ExtractorFactory as normal)
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@950595 13f79535-47bb-0310-9956-ffa450edef68
2010-06-02 15:24:11 +00:00
Yegor Kozlov
6ee427ddf9
fixed construction of the DIB picture header, see Bugzilla 43161
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@949483 13f79535-47bb-0310-9956-ffa450edef68
2010-05-30 06:56:32 +00:00
Nick Burch
6666c539da
Add a simple testcase for the new RevisionMarkAuthorTable.java
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@948445 13f79535-47bb-0310-9956-ffa450edef68
2010-05-26 14:22:49 +00:00
Yegor Kozlov
fe048df54e
Fixed locale-sensitive formatters in PackagePropertiesPart, see Bugzilla 49138
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@935896 13f79535-47bb-0310-9956-ffa450edef68
2010-04-20 12:57:27 +00:00
Yegor Kozlov
639bf94c6f
propagate parent to parent-aware records decoded from Escher, also ensure that TextShape and EscherTextboxWrapper hold the same cached sets of records, see Bugzilla 48916
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@930525 13f79535-47bb-0310-9956-ffa450edef68
2010-04-03 14:44:39 +00:00
Nick Burch
918f1a496d
Fix an issue with the HSMF tests working on some machines but not others - Make poifs.filesystem.DirectoryNode preserve the original ordering of its files, which HSMF needs to be able
...
to correctly match up chunks
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@911878 13f79535-47bb-0310-9956-ffa450edef68
2010-02-19 17:55:32 +00:00
Nick Burch
943d3d19e1
Add a disabled test for bug #44501 , which still remains, plus fix a generics warning
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904062 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:28:29 +00:00
Nick Burch
9bbf3ef4d0
Fix generics warnings, and fix up tests to handle the extra bit of text being extracted now
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904060 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:20:32 +00:00
Nick Burch
3aef368b71
Apply patch from Jukka from bug #43670 to improve HDGF v11 Separator detection, and handle short strings better, hopefully solving the Negative length of ChunkHeader issue
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904052 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:05:13 +00:00
Nick Burch
545f2e1119
Improved how HSMF handles multiple recipients
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898295 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 12:02:18 +00:00
Nick Burch
5621bb0800
Make it possible to return null on missing chunks, rather than the exception
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897847 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 12:19:42 +00:00
Nick Burch
98cea49eb5
Rename the outlook extractor to be more consistent with other extractors
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897249 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:18:52 +00:00
Nick Burch
bd2f63c721
Add a text extractor to HSMF for simpler extraction of text from .msg files
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897242 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:04:15 +00:00
Nick Burch
7ae86fab09
More work on the recipient related chunks, including a helper method to do best-effort finding of the recipients email address
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897213 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 14:26:27 +00:00
Nick Burch
52695c1811
Quick bit of refactoring to save parsing the type and id twice
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897205 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:49:09 +00:00
Nick Burch
ff94e5c61b
Support fetching the message date from the submission id
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897201 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:53 +00:00
Nick Burch
58806414fc
Tweak a few tests, and add in a few more chunk types
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897185 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 12:08:35 +00:00
Nick Burch
795ed3ce55
Complete chunk parser tests, and make more chunk groups available
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897172 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:37:37 +00:00
Nick Burch
0e368a23da
Fix some chunk types, fix the directory descent, fix the Msg2txt example, and start on fixing core tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897167 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:14:58 +00:00
Nick Burch
6afb781730
Shuffle where some of the HSMF tests live to better match package names, and stub out a few more tests
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896923 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:47:09 +00:00
Nick Burch
2bb376f55b
Start on major HSMF refactoring. Should compile, but not quite all tests pass as a little bit of work is left
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896914 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:15:20 +00:00
Yegor Kozlov
a64917fe88
removed svn:eol-style=native from binary files, uncommented related tests; see Bugzilla 48427
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@894173 13f79535-47bb-0310-9956-ffa450edef68
2009-12-28 13:45:10 +00:00
Yegor Kozlov
e8e2b1ff49
support for text extraction from PPT master slides, see Bugzilla 48161
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@835271 13f79535-47bb-0310-9956-ffa450edef68
2009-11-12 07:07:41 +00:00
Nick Burch
a9d6f34732
Patch from Trejkaz - fix bug #48038 - handle reading HWPF stylesheets from non zero offsets
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@832625 13f79535-47bb-0310-9956-ffa450edef68
2009-11-04 00:07:37 +00:00
Josh Micich
ed1b4794a0
Bugzilla 47962 - Fixed some potential NPEs. Avoided unnecessary creation of box instances. Applied patch with mods
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@823348 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 22:29:41 +00:00
Yegor Kozlov
bd966f208b
removed unnecessary svn:executable flag from files in SVN trunk, see bugzilla 47734
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@814335 13f79535-47bb-0310-9956-ffa450edef68
2009-09-13 16:05:28 +00:00
Yegor Kozlov
4841ed0d47
Fixed PPT parser to tolerate Comment2000 containers with missing comment text, see Bugzilla 44770
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@811814 13f79535-47bb-0310-9956-ffa450edef68
2009-09-06 11:35:01 +00:00
Yegor Kozlov
f4f823b94c
Support for extraction of header / footer images in HWPF, see Bugzilla 47733
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@809659 13f79535-47bb-0310-9956-ffa450edef68
2009-08-31 16:57:03 +00:00
Yegor Kozlov
728f88bd92
moved all test data to a top-level directory, refactored all junit usages of .testdata.path to one place
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806959 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 12:34:24 +00:00
Josh Micich
9ca061617c
fixing compiler warnings - unused imports, declared exceptions not thrown
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806789 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 05:50:32 +00:00