Commit Graph

1049 Commits

Author SHA1 Message Date
Sergey Vladimirov
fce856c75c fix comparator
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144366 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 15:51:46 +00:00
Sergey Vladimirov
6845dab8b3 organize imports
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144363 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 15:50:06 +00:00
Sergey Vladimirov
338cf7bc4d additional check and workaround for wrong order (fix TestWordExtractorBugs.testBug50688)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144361 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 15:41:40 +00:00
Sergey Vladimirov
a496e6c0fa one of problems linked to 47286 seems to be found -- some CHPX were not saved
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144352 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 15:08:02 +00:00
Sergey Vladimirov
026ef97694 add test case to check if email from Bug47286.doc is present in HTML
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144341 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:37:42 +00:00
Sergey Vladimirov
dcf5427b8a simplify work with fields
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144337 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:33:01 +00:00
Sergey Vladimirov
63c571ca1f add user-friendly way to access field properties if char is a beginning of field
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144336 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:32:20 +00:00
Sergey Vladimirov
34b6794422 sort CHPX on load (sometimes out of order)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144333 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 14:31:25 +00:00
Sergey Vladimirov
189f4c0a09 add toString method to GenericPropertyNode
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144283 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 12:18:48 +00:00
Sergey Vladimirov
5c8219c777 simplify PlexOfField by using GenericPropertyNode
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144282 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 12:18:36 +00:00
Sergey Vladimirov
65f24a120b save document fields, but Bug47286 looks better, but still not the save
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144275 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 11:54:51 +00:00
Sergey Vladimirov
2abc7aa436 update TestFieldsTables
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144251 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:54:19 +00:00
Sergey Vladimirov
2d46ded6b7 more correct definition of FLD, preserving reserved bits
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144249 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:53:58 +00:00
Sergey Vladimirov
b5de84932e fix test for 47286
now it will fail if bug is fixed

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144235 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:04:03 +00:00
Sergey Vladimirov
1f37d4db1a add documentation
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144233 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 10:01:16 +00:00
Sergey Vladimirov
33cb2db6dd make getSize() static
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144229 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 09:55:25 +00:00
Sergey Vladimirov
d1c0ea5578 refactor FieldDescriptor to be based on autogenerated code
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144224 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 09:49:47 +00:00
Sergey Vladimirov
18b0247ee2 Test case shall not fail
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144202 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 09:06:33 +00:00
Sergey Vladimirov
11d065a01a fix compiler warning ("unmappable character for encoding ASCII")
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144197 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 08:54:13 +00:00
Sergey Vladimirov
eb1dd34245 fix TestPAPBinTable
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1144177 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 06:53:14 +00:00
Sergey Vladimirov
296583c971 dump text run properties
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143895 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 15:59:20 +00:00
Sergey Vladimirov
8f6a4fd533 CharacterSprmUncompressor extends SprmUncompressor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143894 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 15:58:17 +00:00
Sergey Vladimirov
0bf2eb9d6c update CHP type
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143838 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 14:15:03 +00:00
Sergey Vladimirov
a69d691581 dump text runs
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143837 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 14:14:37 +00:00
Sergey Vladimirov
78a951143f support Word95 files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143822 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 13:43:29 +00:00
Sergey Vladimirov
beabf33a84 already fixed 48065 - Problems with save output of HWPF (losing formatting)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143809 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 13:13:04 +00:00
Sergey Vladimirov
dd2005da09 fix 47563 - Exception when working with table
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143802 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 12:52:57 +00:00
Sergey Vladimirov
d6d163fd96 fix 47287 - StringIndexOutOfBoundsException in CharacterRun.replaceText()
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143786 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 12:11:53 +00:00
Sergey Vladimirov
f19392c371 Bug 46817 - Regression: Text from some table cells missing is fixed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143772 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:41:53 +00:00
Sergey Vladimirov
9a5143cf4f fail() shall throw Error, not AssertionError (which is ignored)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143771 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:39:30 +00:00
Sergey Vladimirov
3c2f15b9ff add content of main text box to document text
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143769 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:38:59 +00:00
Sergey Vladimirov
814aed073e uncomment lines, that were temporary commented for debug purposes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143760 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:09:47 +00:00
Sergey Vladimirov
2dc223b9e9 allow bad properties parsing
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143758 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 11:04:59 +00:00
Sergey Vladimirov
4abb3c6e51 add / update documentation
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143756 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 10:56:46 +00:00
Sergey Vladimirov
e325f0d084 fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143754 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 10:41:42 +00:00
Sergey Vladimirov
bbbea8860d fix test case for 45473: calculate PAPX boundaries basing on char positions, not on previously read byte positions (they are outdated); fix boundaries checks (again)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143753 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 10:39:27 +00:00
Sergey Vladimirov
d1eb54b129 fix generics warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143740 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:59:51 +00:00
Sergey Vladimirov
92e9201b77 add ability to dump text pieces
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143734 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:51:42 +00:00
Sergey Vladimirov
4c9a35ee4d add toString() methods to PieceDescriptor and TextPiece; add hashCode() to PieceDescriptor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143733 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:51:21 +00:00
Sergey Vladimirov
65787fc1b9 update test: according to file dump, last row char is 85, so end offset is 86, not 87.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143725 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:25:34 +00:00
Sergey Vladimirov
4c670a3efe fix: too many PAPX were dropped due to too strict boundary checks
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143720 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:16:26 +00:00
Sergey Vladimirov
d7596a864f always output overall range
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143710 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:37:03 +00:00
Sergey Vladimirov
0de832ba9d add getOverallRange() method definition to HWPFDocumentCore
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143709 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:35:51 +00:00
Sergey Vladimirov
25f3636438 correctly detect cells of inner tables, do not include last "fake" cell in row
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143707 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:24:55 +00:00
Sergey Vladimirov
b3def16e9a add javadocs
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143706 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:24:22 +00:00
Sergey Vladimirov
8979236b05 print paragraph number
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143704 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:14:45 +00:00
Sergey Vladimirov
e465a37e16 update PAPAbstractType, adding field descriptions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143701 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:05:38 +00:00
Sergey Vladimirov
dde8c709a7 initialize more TAPAbstractType fields to prevent NPE
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143694 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 07:39:53 +00:00
Sergey Vladimirov
f0c8d0520c fix package: org.apache.poi.hssf.usermodel.converter -> org.apache.poi.hssf.converter
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143689 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 07:33:20 +00:00
Sergey Vladimirov
755cea3d76 add empty rdf:about to rdf:Description
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143331 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 10:36:30 +00:00
Sergey Vladimirov
67ff6e3513 output document properties to html and pdf
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143314 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 09:37:32 +00:00
Sergey Vladimirov
fc3c8fcf10 fix lost cell properties (and row shift)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143079 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:35:54 +00:00
Sergey Vladimirov
afa392e8c8 add test case for lost cell properties
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143077 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:35:36 +00:00
Sergey Vladimirov
8ac3172759 rewrite table bounds detection for Word 97, including inner table support
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143070 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:13:27 +00:00
Sergey Vladimirov
fa46737e44 ignore PAPX if they references non-existing text parts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143069 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 14:12:16 +00:00
Sergey Vladimirov
320a3ab737 fix range detection for documents with section across document parts.
Bug46817.doc is the example of document where section contains all document parts - including main part and additional from textbox. Previous implemenetation didn't correctly calculate last paragraph for Section (intersection of SEPX and main document range). Test case is added.

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143014 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 12:02:02 +00:00
Sergey Vladimirov
d24bfcea93 handle empty strings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143012 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 11:58:50 +00:00
Sergey Vladimirov
34436d6ec7 add Section.toString() for debug purposes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143011 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 11:58:24 +00:00
Sergey Vladimirov
e1e99605de add SEPX.toString() for debug purposes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142979 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 11:02:30 +00:00
Sergey Vladimirov
8bbbf6996f correct range calculation for non-complex file
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142977 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 10:58:04 +00:00
Sergey Vladimirov
0e6e9f649c manually test Bug33519.doc
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142882 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 03:04:18 +00:00
Sergey Vladimirov
2cbeb04781 sometimes there is a lag between text pieces; for example
[0; 2), [2; 4), [6,8)
previous implementation correctly calculates next(2)=>2, but incorrectly next(4)=>4 (need to be 6).

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142881 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 03:03:53 +00:00
Sergey Vladimirov
9b69954212 add additional sanity checks
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142880 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 03:02:12 +00:00
Sergey Vladimirov
efd149c834 fix inner tables handling by Range.getTable() method
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142877 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 02:17:54 +00:00
Sergey Vladimirov
65881d12ea fix test suite name (for better IDEs compatibility)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142875 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:53:06 +00:00
Sergey Vladimirov
efb7a720aa add isEmpty() method to TALS and handle it by SPRM 0x740a compressor to preserve old behavior:
do not add 0x740a SPRM if TALS is empty

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142874 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:51:15 +00:00
Sergey Vladimirov
02672056d8 simplify table converting; fix bug that prevents table from AIOOB-Tap.doc to be converted
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142872 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:44:29 +00:00
Sergey Vladimirov
d2e2970e07 allow user to check table level from Table class (not only from Paragraph), so table processing code is simplified
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142871 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:40:39 +00:00
Sergey Vladimirov
9538734d31 Test correct processing of "sprmPItap" (0x6649) and "sprmPFInTable" (0x2416)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142867 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:19:31 +00:00
Sergey Vladimirov
6abfa5742a rename PAP structure fields according to specification
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142864 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 01:11:42 +00:00
Sergey Vladimirov
f5c03d6194 add additional options and features to HWPFLister, enabling PAPX and paragraphs (including SPRMs) information dump
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142861 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 00:48:53 +00:00
Sergey Vladimirov
68aef63742 add or improve toString() methods of several classes for debug purposes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142860 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 00:48:00 +00:00
Sergey Vladimirov
b8ff8ac945 add SprmOperation.toString() for debug purposes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142859 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 00:46:42 +00:00
Sergey Vladimirov
e41354dcd5 fix warning about cast
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142836 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 22:36:52 +00:00
Sergey Vladimirov
fbbc482d44 fix warning about generics
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142835 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 22:36:27 +00:00
Sergey Vladimirov
bc0abec2c4 add additional TableProperties; implement TableAutoformatLookSpecifier
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142827 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 22:21:10 +00:00
Sergey Vladimirov
ffee3c2c50 add Excel-to-HTML converter (with test suite)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142780 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:49:13 +00:00
Sergey Vladimirov
679c2b403e remove unused method
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142771 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:20:20 +00:00
Sergey Vladimirov
70d42bbd9e rename extractor -> converter and move to converter package
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142767 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:14:44 +00:00
Sergey Vladimirov
4fbd693851 add Word-to-HTML extractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142765 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:08:06 +00:00
Sergey Vladimirov
2b97a034fe warn and skip wrong type SPRM
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142764 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:06:20 +00:00
Sergey Vladimirov
89a5e4c11c extract private getCharacterRun method with CHPX as parameter (used with reflection in workaround in AbstractWordUtils)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142763 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:05:40 +00:00
Sergey Vladimirov
1c0d6a5d09 workaround for NPE caused by old Word format
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142762 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:04:26 +00:00
Sergey Vladimirov
ed7c3eeb07 add toString() to BorderCode
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142761 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:04:08 +00:00
Sergey Vladimirov
a57c397517 workaround for NPE caused by old Word format
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1142760 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 19:03:45 +00:00
Maxim Valyanskiy
e1b483d6ac hwpf: ignore ArrayIndexOutOfBoundsException in TableSprmUncompressor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1141490 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 11:46:27 +00:00
Yegor Kozlov
e4cd5a5b00 cleanup sxxsf temp files after tests, remove non-ascii characters from source code, polished test output
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139518 13f79535-47bb-0310-9956-ffa450edef68
2011-06-25 10:56:46 +00:00
Nick Burch
23077dfb49 Add a unit test to show that bug #49648 no longer occurs
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139245 13f79535-47bb-0310-9956-ffa450edef68
2011-06-24 11:18:49 +00:00
Yegor Kozlov
bc467bb8c1 resolved old bugzilla issues, added unit tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1139204 13f79535-47bb-0310-9956-ffa450edef68
2011-06-24 08:46:37 +00:00
Yegor Kozlov
aac4cf50a9 Bug 51351 - more progress with WordToFoExtractor: fix ListEntryNoListTable and MBD001D0B89 tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1138836 13f79535-47bb-0310-9956-ffa450edef68
2011-06-23 11:28:31 +00:00
Yegor Kozlov
d45f22ad8d Bug 51424 - Ignore exceptions in ParagraphSprmUncompressor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1138834 13f79535-47bb-0310-9956-ffa450edef68
2011-06-23 11:21:05 +00:00
Yegor Kozlov
6e90e3bc06 bug 51351: more progress with WordToFoExtractor: support for hyperlinks, common fields and code cleanup
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1137673 13f79535-47bb-0310-9956-ffa450edef68
2011-06-20 15:56:28 +00:00
Yegor Kozlov
2761d5a900 junit test for sprm, added missing line from r1137143
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1137538 13f79535-47bb-0310-9956-ffa450edef68
2011-06-20 08:01:23 +00:00
Yegor Kozlov
545cde3374 Support for sprmPJc paragraph SPRM and handling of additional properties in ParagraphSprmCompressor, see Bugzilla 51389 and 51390
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1137143 13f79535-47bb-0310-9956-ffa450edef68
2011-06-18 08:57:10 +00:00
Yegor Kozlov
cbb3176c72 more progress with WordToFoExtractor, see Bugzilla 51351
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1136001 13f79535-47bb-0310-9956-ffa450edef68
2011-06-15 11:41:22 +00:00
Yegor Kozlov
3c34797e0a Bugzilla 51351: Word to XSL-FO converter
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1135414 13f79535-47bb-0310-9956-ffa450edef68
2011-06-14 08:53:00 +00:00
Nick Burch
6f3f1a479e Fix deprecated warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1135118 13f79535-47bb-0310-9956-ffa450edef68
2011-06-13 14:46:21 +00:00
Yegor Kozlov
07253cbc8b Bugzilla 51335: Parse picture goal and crop sizes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1134663 13f79535-47bb-0310-9956-ffa450edef68
2011-06-11 14:58:50 +00:00
Yegor Kozlov
65cfe469b0 Bug 51305: Add sprmTCellPaddingDefault support; Update Table Cell Descriptor, make it non-from-file;
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1132616 13f79535-47bb-0310-9956-ffa450edef68
2011-06-06 12:43:17 +00:00
Yegor Kozlov
866a7bf272 cleaned javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1132549 13f79535-47bb-0310-9956-ffa450edef68
2011-06-06 08:45:06 +00:00
Nick Burch
f3b6cc03d4 Patch from Sergey from bug #51292 - Additional HWPF Table Cell descriptor values
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1129690 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 14:21:32 +00:00
Maxim Valyanskiy
98860bdcec hwpf: ignore invalid style reference
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1095664 13f79535-47bb-0310-9956-ffa450edef68
2011-04-21 11:49:03 +00:00
Nick Burch
aad03b8cd3 Tweak 7 bit guessing code
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087788 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 16:27:03 +00:00
Nick Burch
9f40e3df71 Improve HSMF MAPIMessage access to the HTML and RTF versions of the message body (where available)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087782 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 16:01:29 +00:00
Nick Burch
002d0a5995 Add new method to HSMF of MAPIMessage.has7BitEncodingStrings() to make it easier to decide when encoding guessing is needed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087746 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 15:20:07 +00:00
Nick Burch
a0b9fb350b Update OutlookTextExtractor to request 7 bit encoding guessing
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087734 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 15:02:14 +00:00
Nick Burch
43d5e715d3 Improve HSMF encoding guessing for 7 bit fields, and allow HSMF access to the HTML body contents in MAPIMessage
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1087726 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 14:51:45 +00:00
Nick Burch
2a295bf8ea More NPOIFS constructors, and document recent fixes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085501 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 18:17:45 +00:00
Nick Burch
e43a1777ed Fix HPBF generics warnings, and add a NPOIFS check to the HPBF tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085495 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 18:03:18 +00:00
Nick Burch
64185cd51c More NPOIFS Constructor updates
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085447 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 16:21:09 +00:00
Nick Burch
535e69dea4 Fix generics warnings, and make the NPOIFS and deprecated constructor parts of Publisher clearer and consistent with other parts
Also fix a test filename


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1085443 13f79535-47bb-0310-9956-ffa450edef68
2011-03-25 16:06:11 +00:00
Nick Burch
a638f0a9ad Strip trailing padding from HMEF compressed rtf when decoding
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1081414 13f79535-47bb-0310-9956-ffa450edef68
2011-03-14 15:11:03 +00:00
Yegor Kozlov
865efc9a8c support for getting HWPFDocument fields, see Bugzilla 50313
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1081291 13f79535-47bb-0310-9956-ffa450edef68
2011-03-14 09:10:12 +00:00
Nick Burch
20eea4a872 Refactor the HMEF contents checks to use a superclass, and stub out the Rtf Message body tests (disabled as there looks to be a padding issue still to solve)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1078304 13f79535-47bb-0310-9956-ffa450edef68
2011-03-05 15:37:24 +00:00
Nick Burch
0df8c11c4b Two more differences between the LZW in HDGF and HMEF:
* Little Endian vs Big Endian storage of the code position
 * Initial dictionary position is the end of pre-fill, if there is one, rather than always being position 0


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1078300 13f79535-47bb-0310-9956-ffa450edef68
2011-03-05 15:25:39 +00:00
Nick Burch
f050cf6eeb Setting the locale doesn't seem to set the timezone, so do that explicitly to ensure the tests behave for everyone
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1077874 13f79535-47bb-0310-9956-ffa450edef68
2011-03-04 11:06:59 +00:00
Nick Burch
8e1411c194 Fix a couple of license headers, and tweak the rat settings so it runs clean
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1077865 13f79535-47bb-0310-9956-ffa450edef68
2011-03-04 11:00:13 +00:00
Nick Burch
bbfa85f807 Tweak the attribute getter names on Attachments, and add MAPI+TNEF Attribute unit tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076685 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 17:06:37 +00:00
Nick Burch
efe5a63429 Add a TNEF extractor class, which can be run from the command line to extract out the message body and attachments
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076607 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 13:01:07 +00:00
Nick Burch
aa1963cd3b Improve HMEF handling of typed attributes (Strings and Dates), for both TNEF and MAPI attributes, and use this to allow easier access to common file parts. Then use this in the attachment unit tests.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076603 13f79535-47bb-0310-9956-ffa450edef68
2011-03-03 12:41:39 +00:00
Nick Burch
145de166a1 Switch HMEF attributes to a factory scheme for creation, then add subtypes for cleaner code. Also adds a few more tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076310 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 17:52:12 +00:00
Nick Burch
20f6e280a5 Refactor the TNEF Attributes to better match the MAPI ones, and move both into their own package
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1076251 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 15:25:35 +00:00
Nick Burch
649df30b76 Start on unit tests for HMEF. Quite a bit is still stubbed out, and it shows that the LZW isn't quite right yet (so tests disabled)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1075955 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 18:12:44 +00:00
Nick Burch
3937b2f310 Switch Picture processing in HSLFSlideShow to be lazy-loading, to speed things up if you're only interested in text stuff
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1073883 13f79535-47bb-0310-9956-ffa450edef68
2011-02-23 19:01:24 +00:00
Maxim Valyanskiy
e325b96167 hwpf: ignore invalid style reference
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1065538 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 09:55:10 +00:00
Maxim Valyanskiy
40dbadc49e hwpf: ignore null-reference to parent stylesheet (bug#50688)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1065535 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 09:27:44 +00:00
Maxim Valyanskiy
9269c0850c hwpf: ignore incorrect ListFormatOverride reference that causes IndexOutOfBoundException
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1064110 13f79535-47bb-0310-9956-ffa450edef68
2011-01-27 12:42:12 +00:00
Nick Burch
89df31493c Start on TNEF RTF attribute decompression, but not quite finished yet
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058555 13f79535-47bb-0310-9956-ffa450edef68
2011-01-13 13:53:02 +00:00
Nick Burch
9d2b9caa33 Add support for custom MAPI Properties (0x8000 and above, plus unknown lower ones)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058262 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 18:14:49 +00:00
Nick Burch
1d5f5c311a HMEF dumping and properties tweaks
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058243 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 17:22:40 +00:00
Nick Burch
fd16797edd Start to decode the MAPI Properties in the TNEF stream for HMEF
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058226 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 16:45:02 +00:00
Maxim Valyanskiy
5f669d2a4d hwpf: fix ArrayIndexOutOfBoundsException in Range.getCharacterRun() (TIKA-577)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058176 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 15:16:50 +00:00
Nick Burch
4358a18407 MAPIProperty is a better fit name for HSMF, so change the name before it gets too widely used
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058116 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 12:45:51 +00:00
Nick Burch
112b2b584e Copy the MAPI type information over completely to HSMF
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1058114 13f79535-47bb-0310-9956-ffa450edef68
2011-01-12 12:43:25 +00:00
Nick Burch
babc3f173e More converting of HSMF code to use strongly type attributes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057712 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 16:11:23 +00:00
Nick Burch
5fa971d0bf Add a HSMF dev tool for listing the chunks with their types and names
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057708 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 16:01:42 +00:00
Nick Burch
d9197ba3a4 Start converting HSMF code to use the new strongly type MAPIAttribute class for lookups, and add a dev class for listing the defined attributes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057698 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 15:35:29 +00:00
Nick Burch
7978095929 Create a new HSMF MAPI Attributes class to hold the attribute list in a strongly typed way, based on the MSDN published properties list
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1057665 13f79535-47bb-0310-9956-ffa450edef68
2011-01-11 14:42:40 +00:00
Nick Burch
2d583f4e3f Tweak HSLF and HWPF to work well with NPOIFS, and add unit tests for this
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1054191 13f79535-47bb-0310-9956-ffa450edef68
2011-01-01 05:42:36 +00:00
Nick Burch
f2b541e026 Since a DirectoryNode has a reference to the underlying POIFSFileSystem, tidy up the POIDocument constructor to not need both passing in
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1053521 13f79535-47bb-0310-9956-ffa450edef68
2010-12-29 03:19:46 +00:00
Nick Burch
c3c4e4f2f7 Add initial low level code for HMEF, an implementation of TNEF (aka winmail.dat). Currently processes core attributes, but not embedded MAPI ones, and still needs to do more for extraction
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1051756 13f79535-47bb-0310-9956-ffa450edef68
2010-12-22 05:04:19 +00:00
Nick Burch
d2583d83d5 Refactor the common LZW decompression code out into utils
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1051377 13f79535-47bb-0310-9956-ffa450edef68
2010-12-21 05:18:34 +00:00
Nick Burch
59130c6193 Add a couple more chunk types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1051006 13f79535-47bb-0310-9956-ffa450edef68
2010-12-20 07:18:31 +00:00
Nick Burch
cbf86ed0bc Improve the HDGF LZW code.
Some tweaks to the decompression, and more tests, but mostly work on the compression side. We can now compress small streams properly, and these round-trip fine. However, some longer streams don't compress correctly, and more work on that is still needed (see the disabled unit test)


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1049805 13f79535-47bb-0310-9956-ffa450edef68
2010-12-16 07:41:41 +00:00
Yegor Kozlov
9d56b17ace moved ptg classes to org.apache.poi.ss.formula.ptg
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1038691 13f79535-47bb-0310-9956-ffa450edef68
2010-11-24 16:54:26 +00:00
Maxim Valyanskiy
1c64221e5a hwpf: Ignore null argument in PicturesTable.hasPicture
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1037670 13f79535-47bb-0310-9956-ffa450edef68
2010-11-22 11:10:01 +00:00
Maxim Valyanskiy
381b6641b6 hwpf: IndexOutOfBoundsException in Range.findRange when list contains only nulls
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1036850 13f79535-47bb-0310-9956-ffa450edef68
2010-11-19 14:09:13 +00:00
Nick Burch
e0ec632d69 Slightly expand tests, inspired by bug #50060, but ultimately show there is a deeper issue outstanding
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1026399 13f79535-47bb-0310-9956-ffa450edef68
2010-10-22 16:41:02 +00:00
Nick Burch
d2d0c131c0 Fix more HSLF generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1024420 13f79535-47bb-0310-9956-ffa450edef68
2010-10-19 21:25:41 +00:00
Nick Burch
392a97fef4 Fix more HSLF generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1024411 13f79535-47bb-0310-9956-ffa450edef68
2010-10-19 21:01:54 +00:00
Nick Burch
2ece9ad04b Fix more HSLF generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1024390 13f79535-47bb-0310-9956-ffa450edef68
2010-10-19 20:12:19 +00:00
Nick Burch
a954289390 Fix some HSLF generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1024368 13f79535-47bb-0310-9956-ffa450edef68
2010-10-19 19:22:24 +00:00
Nick Burch
40c968c111 Add a couple of methods to help make debugging problems in hwpf easier
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1024302 13f79535-47bb-0310-9956-ffa450edef68
2010-10-19 16:05:16 +00:00
Yegor Kozlov
1715a0deaf removed svn:executable from project files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1022458 13f79535-47bb-0310-9956-ffa450edef68
2010-10-14 10:34:59 +00:00
Yegor Kozlov
76470fed4e avoid NPE in ListLevel.getNumberText() when numberText is null, see Bugzilla 50075
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1022456 13f79535-47bb-0310-9956-ffa450edef68
2010-10-14 10:30:29 +00:00
Yegor Kozlov
19be0174ee fixed unmappable characters for ASCII
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1006161 13f79535-47bb-0310-9956-ffa450edef68
2010-10-09 13:47:53 +00:00
Yegor Kozlov
d6307ad1aa support for BorderCode in HWPF, see Bugzilla 49919
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1005447 13f79535-47bb-0310-9956-ffa450edef68
2010-10-07 13:55:46 +00:00
Yegor Kozlov
b53e0ba59c support for processing of symbols in HWPF, see Bugzilla 49908
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1005443 13f79535-47bb-0310-9956-ffa450edef68
2010-10-07 13:40:58 +00:00
Yegor Kozlov
adefc926c1 cleaned javadoc warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1004934 13f79535-47bb-0310-9956-ffa450edef68
2010-10-06 08:55:30 +00:00
Nick Burch
a018a145bf Test that shows we handle word floating and fixed pictures properly
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1002123 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 11:46:22 +00:00
Maxim Valyanskiy
6d03a80070 revert previous commit because it causes problems with other old .docs
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1001726 13f79535-47bb-0310-9956-ffa450edef68
2010-09-27 13:08:57 +00:00
Maxim Valyanskiy
286b682cdc hwpt: add more 2 bytes to OldSectionTable to solve ArrayIndexOutOfBoundsException
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1001712 13f79535-47bb-0310-9956-ffa450edef68
2010-09-27 12:50:36 +00:00
Nick Burch
cc419a3a85 Work inspired by bug #48018 - get HWPF lists more consistent in read vs write, and preserve order as apparently that matters. Includes a fair number of list related unit tests, but not for everything
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998943 13f79535-47bb-0310-9956-ffa450edef68
2010-09-20 14:26:49 +00:00
Nick Burch
ca0fc0ec1e Apply patch from bug #49820 - Fix HWPF paragraph levels, so that outline levels can be properly fetched
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998897 13f79535-47bb-0310-9956-ffa450edef68
2010-09-20 11:45:53 +00:00
Nick Burch
50b44decce Add test that shows that bug #45877 has already been fixed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998634 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 11:52:20 +00:00
Nick Burch
1e2b747f8f Hopefully fix bug #47271 - Avoid infinite loops on broken HWPF documents with a corrupt CHP style with a parent of itself
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998625 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 10:16:06 +00:00
Nick Burch
8043f38bd7 More fixes for bug #49933, workaround the fact that some word6/word95 SEPX entries are compressed differently, and we don't have the specs for how they're stored
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998621 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 09:59:10 +00:00
Nick Burch
100095bcb9 Fix bug #49936 - Handle HWPF documents with problematic HeaderStories better
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998146 13f79535-47bb-0310-9956-ffa450edef68
2010-09-17 14:14:19 +00:00
Nick Burch
c07e529c03 Fix support for sections in old word 6 / word 95 files
Improve unit testing for HWPFOldDocument
Sprm fix also improves some HWPFDocument files too!


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998131 13f79535-47bb-0310-9956-ffa450edef68
2010-09-17 13:46:11 +00:00
Nick Burch
be03b86543 Fix generics warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@998128 13f79535-47bb-0310-9956-ffa450edef68
2010-09-17 13:43:07 +00:00
Nick Burch
0e504bd9ca Add a few more test asserts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@997751 13f79535-47bb-0310-9956-ffa450edef68
2010-09-16 13:45:54 +00:00
Nick Burch
9647ab8976 Add some more paragraph property documentation and tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@997413 13f79535-47bb-0310-9956-ffa450edef68
2010-09-15 17:38:50 +00:00
Nick Burch
216bcfea03 Tweak hssf and hwpf image mime types for wmf/emf to match those used by XSSF/XWPF
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995943 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 19:15:16 +00:00
Nick Burch
96b0aea310 Improve documentation of some of the HWPF picture stuff, and add unit tests for images of embeded documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995807 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 14:37:45 +00:00
Nick Burch
2b9a63d38d Add getMimeType() method to HWPF Picture, and fix some generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995464 13f79535-47bb-0310-9956-ffa450edef68
2010-09-09 15:30:55 +00:00
Nick Burch
dac67e887e Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@995445 13f79535-47bb-0310-9956-ffa450edef68
2010-09-09 14:28:14 +00:00
Yegor Kozlov
7ddd3d1715 added a getter for _iStartAt in ListFormatOverrideLevel
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@992611 13f79535-47bb-0310-9956-ffa450edef68
2010-09-04 14:15:55 +00:00
Nick Burch
514e6be1fe Fix bug #47990 - Support for .msg attachments within a MAPIMessage .msg
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@982331 13f79535-47bb-0310-9956-ffa450edef68
2010-08-04 17:08:39 +00:00
Maxim Valyanskiy
e52778847c hwpf: append any character data before paragraphs to first paragraph
(workaround for bug#48075)


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@982238 13f79535-47bb-0310-9956-ffa450edef68
2010-08-04 12:43:58 +00:00
Nick Burch
ce0fea767f Fix bug #49441 - Allow overriding and guessing of HSMF non-unicode string encodings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@981947 13f79535-47bb-0310-9956-ffa450edef68
2010-08-03 16:06:21 +00:00
Maxim Valyanskiy
c21d04a68b wpf: More CharactedRun mapping fixes for fast-saved documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@979618 13f79535-47bb-0310-9956-ffa450edef68
2010-07-27 09:38:54 +00:00
Maxim Valyanskiy
8fab18e37a More fixes for auto-saved documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@979286 13f79535-47bb-0310-9956-ffa450edef68
2010-07-26 14:04:27 +00:00
Nick Burch
bea0801c24 Patch from Jens Gotze from bug #49581 - Ability to add, modify and remove series from HSSF Charts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@964855 13f79535-47bb-0310-9956-ffa450edef68
2010-07-16 16:46:42 +00:00
Maxim Valyanskiy
8c763967ab hwpf: Range.findRange fix for out-of-range searches
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@962520 13f79535-47bb-0310-9956-ffa450edef68
2010-07-09 13:30:43 +00:00
Maxim Valyanskiy
7aba97ba6e hwpf.usermodel.Range: use generics & remove redundant type casts
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@961347 13f79535-47bb-0310-9956-ffa450edef68
2010-07-07 10:57:14 +00:00
Maxim Valyanskiy
de525d07cd HWPFDocument: minor cleanup
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@961343 13f79535-47bb-0310-9956-ffa450edef68
2010-07-07 10:42:23 +00:00
Maxim Valyanskiy
8661c09c35 HWPF: test auto-saved document
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@961342 13f79535-47bb-0310-9956-ffa450edef68
2010-07-07 10:41:17 +00:00
Maxim Valyanskiy
22730f9a12 HWPF: better fix for TextPieceTable.getCharIndex()
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960922 13f79535-47bb-0310-9956-ffa450edef68
2010-07-06 15:45:36 +00:00
Maxim Valyanskiy
78b0c18ade HWPF: Improve reading of auto-saved ("complex") document
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960587 13f79535-47bb-0310-9956-ffa450edef68
2010-07-05 12:56:02 +00:00
Nick Burch
256e73d16d More Word 6 / Word 95 Support
HWPFOldDocument now processes a few more table sections, and so we can fake up some
 basic Ranges. This allows us to do paragraph level text extraction


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960102 13f79535-47bb-0310-9956-ffa450edef68
2010-07-02 20:59:30 +00:00
Nick Burch
01ec911b74 Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960094 13f79535-47bb-0310-9956-ffa450edef68
2010-07-02 20:01:42 +00:00
Nick Burch
6ee6d9095f Enable Word6Extractor in ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959360 13f79535-47bb-0310-9956-ffa450edef68
2010-06-30 16:08:10 +00:00
Nick Burch
30848a80aa Basic text extraction support for old Word 6 and Word 95 documents via some HWPF extensions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959346 13f79535-47bb-0310-9956-ffa450edef68
2010-06-30 15:13:10 +00:00
Nick Burch
0910eb1ab5 Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959335 13f79535-47bb-0310-9956-ffa450edef68
2010-06-30 14:41:03 +00:00
Nick Burch
ad33151624 Better handling of Outlook messages in HSMF when there's no recipient email address
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@954476 13f79535-47bb-0310-9956-ffa450edef68
2010-06-14 13:47:22 +00:00
Nick Burch
05ddf6a51e Fix for bug #48245 - tweak HWPF table cell detection to work across more files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@953694 13f79535-47bb-0310-9956-ffa450edef68
2010-06-11 13:29:44 +00:00
Nick Burch
bf4e6ff464 Add additional RevisionMarkAuthorTable test
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@953343 13f79535-47bb-0310-9956-ffa450edef68
2010-06-10 15:02:05 +00:00
Yegor Kozlov
8c4341facf cleaned javadoc warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951920 13f79535-47bb-0310-9956-ffa450edef68
2010-06-06 18:19:08 +00:00
Nick Burch
d29d1d7d9b Apply with tweaks the patch from bug #45269 - improve replaceText on HWPF ranges
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951498 13f79535-47bb-0310-9956-ffa450edef68
2010-06-04 17:19:31 +00:00
Nick Burch
45c4b6bf8f Tweak @link reference to avoid compiler issues
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951055 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 16:23:40 +00:00
Nick Burch
f9fa636e6d Remove un-used imports
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951053 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 16:21:41 +00:00
Nick Burch
65d7431a9f Parse the HSMF headers chunk if present, and use it to find Dates in text extraction if needed
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@951034 13f79535-47bb-0310-9956-ffa450edef68
2010-06-03 15:33:54 +00:00
Nick Burch
cee16bc83b List attachment names in the output of OutlookTextExtractor (to get attachment contents, use ExtractorFactory as normal)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@950595 13f79535-47bb-0310-9956-ffa450edef68
2010-06-02 15:24:11 +00:00
Yegor Kozlov
6ee427ddf9 fixed construction of the DIB picture header, see Bugzilla 43161
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@949483 13f79535-47bb-0310-9956-ffa450edef68
2010-05-30 06:56:32 +00:00
Yegor Kozlov
55c924c5d2 removed deprecation warnings to keep javac quiet
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@949434 13f79535-47bb-0310-9956-ffa450edef68
2010-05-29 18:31:04 +00:00
Nick Burch
0df94e6be8 Apply patch from bug #48924 - Allow access of the HWPF DateAndTime underlying date values
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@948455 13f79535-47bb-0310-9956-ffa450edef68
2010-05-26 14:40:25 +00:00
Nick Burch
6666c539da Add a simple testcase for the new RevisionMarkAuthorTable.java
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@948445 13f79535-47bb-0310-9956-ffa450edef68
2010-05-26 14:22:49 +00:00
Nick Burch
9798e24fd2 Apply patch from bug #48926 - Initial support for the HWPF revision marks authors list
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@948435 13f79535-47bb-0310-9956-ffa450edef68
2010-05-26 14:17:15 +00:00
Nick Burch
4c1d86e5de Apply patches from Peter Kutak from bugs 49334 and 49242 - HSSFChart improvements by tracking more records
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@948080 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 15:59:26 +00:00
Nick Burch
a3899a57d8 Resolve bug #49139 - don't assume that the block size is always 512 bytes. Instead of hard coding this value in, pass around the new POIFSBigBlockSize object that holds the size and various helper subsizes. Should now be possible to open 4k block files without error.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@937834 13f79535-47bb-0310-9956-ffa450edef68
2010-04-25 17:35:56 +00:00
Yegor Kozlov
fe048df54e Fixed locale-sensitive formatters in PackagePropertiesPart, see Bugzilla 49138
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@935896 13f79535-47bb-0310-9956-ffa450edef68
2010-04-20 12:57:27 +00:00
Nick Burch
f1371227be Remove old .cvsignore files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@933963 13f79535-47bb-0310-9956-ffa450edef68
2010-04-14 14:11:13 +00:00
Maxim Valyanskiy
fc53ead4ca bugfix: ClassCastException it PicturesTable.getAllPictures():
UnknownEscherRecord cannot be cast to EscherBlipRecord


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@931111 13f79535-47bb-0310-9956-ffa450edef68
2010-04-06 12:12:03 +00:00
Yegor Kozlov
639bf94c6f propagate parent to parent-aware records decoded from Escher, also ensure that TextShape and EscherTextboxWrapper hold the same cached sets of records, see Bugzilla 48916
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@930525 13f79535-47bb-0310-9956-ffa450edef68
2010-04-03 14:44:39 +00:00
Nick Burch
918f1a496d Fix an issue with the HSMF tests working on some machines but not others - Make poifs.filesystem.DirectoryNode preserve the original ordering of its files, which HSMF needs to be able
to correctly match up chunks

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@911878 13f79535-47bb-0310-9956-ffa450edef68
2010-02-19 17:55:32 +00:00
Nick Burch
943d3d19e1 Add a disabled test for bug #44501, which still remains, plus fix a generics warning
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904062 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:28:29 +00:00
Nick Burch
9bbf3ef4d0 Fix generics warnings, and fix up tests to handle the extra bit of text being extracted now
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904060 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:20:32 +00:00
Nick Burch
3aef368b71 Apply patch from Jukka from bug #43670 to improve HDGF v11 Separator detection, and handle short strings better, hopefully solving the Negative length of ChunkHeader issue
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904052 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:05:13 +00:00
Nick Burch
2880d934f9 Improve error message, and fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@904049 13f79535-47bb-0310-9956-ffa450edef68
2010-01-28 12:00:38 +00:00
Maxim Valyanskiy
ed3cae95f8 PowerPoint OLEShape: extract last version of embedded ole object
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@901215 13f79535-47bb-0310-9956-ffa450edef68
2010-01-20 14:33:58 +00:00
Nick Burch
545f2e1119 Improved how HSMF handles multiple recipients
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@898295 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 12:02:18 +00:00
Nick Burch
6e97a360a3 Add PublisherTextExtractor support to ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897887 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 14:55:43 +00:00
Nick Burch
5621bb0800 Make it possible to return null on missing chunks, rather than the exception
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897847 13f79535-47bb-0310-9956-ffa450edef68
2010-01-11 12:19:42 +00:00
Nick Burch
5ad8301c2a Add embeded (attachment) support to the outlook text extractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897258 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:44:08 +00:00
Nick Burch
98cea49eb5 Rename the outlook extractor to be more consistent with other extractors
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897249 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:18:52 +00:00
Nick Burch
cefe4e1d28 Wire up the new HSMFTextExtactor to the ExtractorFactory
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897246 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:14:27 +00:00
Nick Burch
bd2f63c721 Add a text extractor to HSMF for simpler extraction of text from .msg files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897242 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:04:15 +00:00
Nick Burch
a6e7575999 Fix generics warnings
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897239 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 16:02:35 +00:00
Nick Burch
7ae86fab09 More work on the recipient related chunks, including a helper method to do best-effort finding of the recipients email address
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897213 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 14:26:27 +00:00
Nick Burch
52695c1811 Quick bit of refactoring to save parsing the type and id twice
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897205 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:49:09 +00:00
Nick Burch
ff94e5c61b Support fetching the message date from the submission id
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897201 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 13:42:53 +00:00
Nick Burch
58806414fc Tweak a few tests, and add in a few more chunk types
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897185 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 12:08:35 +00:00
Nick Burch
795ed3ce55 Complete chunk parser tests, and make more chunk groups available
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897172 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:37:37 +00:00
Nick Burch
0e368a23da Fix some chunk types, fix the directory descent, fix the Msg2txt example, and start on fixing core tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@897167 13f79535-47bb-0310-9956-ffa450edef68
2010-01-08 11:14:58 +00:00
Nick Burch
6afb781730 Shuffle where some of the HSMF tests live to better match package names, and stub out a few more tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896923 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:47:09 +00:00
Nick Burch
2bb376f55b Start on major HSMF refactoring. Should compile, but not quite all tests pass as a little bit of work is left
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896914 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 16:15:20 +00:00
Nick Burch
e5884f2f66 Add a couple more HSMF chunk types, and use Generics in a few places
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@896868 13f79535-47bb-0310-9956-ffa450edef68
2010-01-07 12:56:39 +00:00
Yegor Kozlov
a64917fe88 removed svn:eol-style=native from binary files, uncommented related tests; see Bugzilla 48427
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@894173 13f79535-47bb-0310-9956-ffa450edef68
2009-12-28 13:45:10 +00:00
Yegor Kozlov
0b9ab8930e included poi-ooxml-schemas and poi-examples in the release cycle, also misc improvements in build.xml: merged scratchpad and main examples, improved cross-module classpath dependencies, removed obsolete targets
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@882927 13f79535-47bb-0310-9956-ffa450edef68
2009-11-21 15:30:54 +00:00
Yegor Kozlov
e8e2b1ff49 support for text extraction from PPT master slides, see Bugzilla 48161
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@835271 13f79535-47bb-0310-9956-ffa450edef68
2009-11-12 07:07:41 +00:00
Nick Burch
a9d6f34732 Patch from Trejkaz - fix bug #48038 - handle reading HWPF stylesheets from non zero offsets
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@832625 13f79535-47bb-0310-9956-ffa450edef68
2009-11-04 00:07:37 +00:00
Josh Micich
ed1b4794a0 Bugzilla 47962 - Fixed some potential NPEs. Avoided unnecessary creation of box instances. Applied patch with mods
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@823348 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 22:29:41 +00:00
Yegor Kozlov
2ad9b00e9f added Ant target to compile scratchpad examples, added example HSMF application that converts a .msg file to text and extracts attachments, see Bugzilla 47922
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@821500 13f79535-47bb-0310-9956-ffa450edef68
2009-10-04 10:25:53 +00:00
Yegor Kozlov
bd966f208b removed unnecessary svn:executable flag from files in SVN trunk, see bugzilla 47734
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@814335 13f79535-47bb-0310-9956-ffa450edef68
2009-09-13 16:05:28 +00:00
Yegor Kozlov
4841ed0d47 Fixed PPT parser to tolerate Comment2000 containers with missing comment text, see Bugzilla 44770
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@811814 13f79535-47bb-0310-9956-ffa450edef68
2009-09-06 11:35:01 +00:00
Yegor Kozlov
f4f823b94c Support for extraction of header / footer images in HWPF, see Bugzilla 47733
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@809659 13f79535-47bb-0310-9956-ffa450edef68
2009-08-31 16:57:03 +00:00
Yegor Kozlov
728f88bd92 moved all test data to a top-level directory, refactored all junit usages of .testdata.path to one place
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806959 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 12:34:24 +00:00
Josh Micich
9ca061617c fixing compiler warnings - unused imports, declared exceptions not thrown
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806789 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 05:50:32 +00:00
Yegor Kozlov
7e2a5dacee Avoid exception when reading ClipboardData packet in OLE property sets, see bugzilla 45583
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@806172 13f79535-47bb-0310-9956-ffa450edef68
2009-08-20 13:49:33 +00:00
Yegor Kozlov
3316b64495 Centralize logic for finding/opening sample files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@805928 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 18:51:44 +00:00