Commit Graph

148 Commits

Author SHA1 Message Date
Dominik Stadler
9eb8fedf5d Fix bug 61787, which was introduced by bug 58067: Change how deleted content is detected to not incorrectly see too much text as deleted.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1819405 13f79535-47bb-0310-9956-ffa450edef68
2017-12-28 08:45:51 +00:00
Dominik Stadler
eb8fac4f77 Bug 61911: Avoid IndexOutOfBounds access when reading pictures
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1819403 13f79535-47bb-0310-9956-ffa450edef68
2017-12-28 08:45:36 +00:00
Javen O'Neal
a1b8f3ff14 bug 61745: add support for charts in XWPF/docx. Thanks to Sandeep Tiwari for the patch
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1815047 13f79535-47bb-0310-9956-ffa450edef68
2017-11-12 21:48:41 +00:00
Dominik Stadler
c2d09c0217 Bug 61586: Add test which shows that it works as expected
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1811355 13f79535-47bb-0310-9956-ffa450edef68
2017-10-06 17:41:20 +00:00
Tim Allison
4852228f3f 60279 -- back off to brute-force search for macro content if offset information is not correct
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1808301 13f79535-47bb-0310-9956-ffa450edef68
2017-09-14 02:22:55 +00:00
Tim Allison
3b32a00aa7 61470 -- add extraction of content within ruby elements; allow users to concatenate or not concatenate phonetic strings. Default is to concatenate.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1806712 13f79535-47bb-0310-9956-ffa450edef68
2017-08-30 16:29:52 +00:00
PJ Fanning
9f28e36438 [Bug-61354] fix issue with extracting text from Word docs. This closes #66
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1803250 13f79535-47bb-0310-9956-ffa450edef68
2017-07-28 07:42:23 +00:00
PJ Fanning
e046241c30 [Bug-61268] avoid NegativeArraySizeException parsing doc file
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1801395 13f79535-47bb-0310-9956-ffa450edef68
2017-07-09 22:32:27 +00:00
Nick Burch
05c7c16308 #61266 Test for old unsupported MS Write WRI files, and give a more helpful exception if found, plus unit tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1801376 13f79535-47bb-0310-9956-ffa450edef68
2017-07-09 16:26:33 +00:00
Dominik Stadler
938bb91c51 Bug 60374: Add test for ArrayIndexOutOfBounds on old Word Documents due to Non-Unicode
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1798200 13f79535-47bb-0310-9956-ffa450edef68
2017-06-09 12:47:03 +00:00
Andreas Beeker
2b4f944883 #61162 - En-/decryption support for HWPF
Decryption for Binary RC4 and CryptoAPI (... XOR is missing)

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1797837 13f79535-47bb-0310-9956-ffa450edef68
2017-06-06 22:21:11 +00:00
Tim Allison
af51ea4c10 bug 50955 -- word 6.0 charset fix
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790061 13f79535-47bb-0310-9956-ffa450edef68
2017-04-04 02:06:46 +00:00
Dominik Stadler
531226ac84 Add docm-file which is actually a binary doc to check handling of wrong-format-exceptions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1787555 13f79535-47bb-0310-9956-ffa450edef68
2017-03-18 13:56:04 +00:00
Dominik Stadler
f431473f29 Revert previous changes enough to restore the case in the commoncrawl-corpus which is failing now, add a unit-test for this case, enable one disabled unit-test
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1781822 13f79535-47bb-0310-9956-ffa450edef68
2017-02-05 22:34:40 +00:00
Dominik Stadler
41e52ffea7 Finally enable test for bug 44937, we got the requested missing test-documents a long time ago already.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1780456 13f79535-47bb-0310-9956-ffa450edef68
2017-01-26 19:59:05 +00:00
Tim Allison
f6388c2fff Bug 60570 - Add rudimentary EMF read-only capability
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1779493 13f79535-47bb-0310-9956-ffa450edef68
2017-01-19 16:22:29 +00:00
Mark Murphy
07893ec6a0 60329: Avoid NPE when styleid is null
Task-Url: https://bz.apache.org/bugzilla/show_bug.cgi?id=60329

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1772138 13f79535-47bb-0310-9956-ffa450edef68
2016-12-01 02:21:56 +00:00
Tim Allison
0232c71e33 POI-60341, add test document (ugh, mea culpa), turn on test.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1768505 13f79535-47bb-0310-9956-ffa450edef68
2016-11-07 12:10:46 +00:00
Mark Murphy
70724609b1 60337: XWPFTableRow.isRepeatHeader throws NullPointerException, setRepeatHeader does not overwrite old value
Task-Url: https://bz.apache.org/bugzilla/show_bug.cgi?id=60337


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1768153 13f79535-47bb-0310-9956-ffa450edef68
2016-11-05 06:12:24 +00:00
Nick Burch
6f5a44c139 More unit testing for XWPF list numbering complex cases, and some TODOs on improving it, inspired by users@ discussions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1768005 13f79535-47bb-0310-9956-ffa450edef68
2016-11-04 10:55:31 +00:00
Tim Allison
2633894b39 60293 -- Handle illegal "Odd" header/footer in XWPF
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1767353 13f79535-47bb-0310-9956-ffa450edef68
2016-10-31 19:02:06 +00:00
Javen O'Neal
12ca2205b0 bug 57843: add failing unit test: Word 6.0 (1993) fails with ArrayIndexOutOfBoundsException
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1761873 13f79535-47bb-0310-9956-ffa450edef68
2016-09-22 09:24:48 +00:00
Javen O'Neal
3adf725ba4 bug 60158: add failing test cases for AIOOBE on VBAMacroReader
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1761652 13f79535-47bb-0310-9956-ffa450edef68
2016-09-21 01:03:12 +00:00
Dominik Stadler
a118ec77d4 Add commented reproducer for bug 57603
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1753120 13f79535-47bb-0310-9956-ffa450edef68
2016-07-17 21:17:45 +00:00
Dominik Stadler
6b1caa4e2c Bug 59739: For now fix the regression in FileInformationBlock which was introduced after 3.15-beta1 so that the documents can be loaded again pending a full fix as discussed in the bug.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1750864 13f79535-47bb-0310-9956-ffa450edef68
2016-06-30 21:06:04 +00:00
Dominik Stadler
6b6aa56455 Avoid NPE in XWPFTableCell, taken from https://github.com/prasad-babu/poi/tree/WORKING_BRANCH
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1746625 13f79535-47bb-0310-9956-ffa450edef68
2016-06-02 20:09:44 +00:00
Dominik Stadler
aee2959a64 Bug 59378: Try to reproduce, but could not
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1746623 13f79535-47bb-0310-9956-ffa450edef68
2016-06-02 20:09:25 +00:00
Javen O'Neal
0d40050e20 bug 52949: add Word, Powerpoint, and Visio (HDGF) files with macros to test macro extraction
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1738502 13f79535-47bb-0310-9956-ffa450edef68
2016-04-11 03:02:18 +00:00
Dominik Stadler
992adb9c16 Revert set of inadvertedly committed files/changes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1737029 13f79535-47bb-0310-9956-ffa450edef68
2016-03-29 18:11:44 +00:00
Dominik Stadler
f5332a3a40 Bug 58915: Add note about memory consumption and update changelog
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1737025 13f79535-47bb-0310-9956-ffa450edef68
2016-03-29 17:30:55 +00:00
Andreas Beeker
9f47700711 #59058 - OOM when parsing docx after OPCPackage.open with File but not with InputStream (TIKA-1866)
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1734184 13f79535-47bb-0310-9956-ffa450edef68
2016-03-09 01:25:02 +00:00
Tim Allison
bc716a2796 POI 59030 fix NPE in XWPFTableCell's getVerticalAlignment via Prasad Babu
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1731257 13f79535-47bb-0310-9956-ffa450edef68
2016-02-19 15:46:26 +00:00
Dominik Stadler
677b19afbd GitHub PR 27: Add method to check for any protection in XWPFDocument, closes #27
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1730471 13f79535-47bb-0310-9956-ffa450edef68
2016-02-15 09:26:51 +00:00
Dominik Stadler
f94bb63332 add an ignored test for bug 58804
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1726681 13f79535-47bb-0310-9956-ffa450edef68
2016-01-25 20:21:30 +00:00
Dominik Stadler
ff8592b190 Fix bug 58067: XWPF: don't return deleted text when document is in review-mode
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1722715 13f79535-47bb-0310-9956-ffa450edef68
2016-01-03 13:28:01 +00:00
Dominik Stadler
4803a0c355 bug 58618: XWPFParagraph insertNewRun and removeRun work incorrectly for
runs after hyperlink/field runs

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1722503 13f79535-47bb-0310-9956-ffa450edef68
2016-01-01 16:28:01 +00:00
Tim Allison
b8f9c0cad1 POI-57889 -- actually trigger inclusion of CTNumLvl with document contributed by Moritz Dorka on TIKA-1315
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1682310 13f79535-47bb-0310-9956-ffa450edef68
2015-05-28 19:08:24 +00:00
Tim Allison
8bfc6056c8 POI-57889 prevent NPE with on some documents with XWPFParagraph's getNumFmt() and add some other classes to enable calculation of paragraph numbers
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1677723 13f79535-47bb-0310-9956-ffa450edef68
2015-05-05 01:39:16 +00:00
Dominik Stadler
58145060c2 Fix possible ArrayIndexOutOfBoundsException seen with some word documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1674954 13f79535-47bb-0310-9956-ffa450edef68
2015-04-20 18:16:30 +00:00
Dominik Stadler
7c9ead8ad7 Add test-document from bug 53379 to verify in integration tests that text extraction does not fail any more
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668484 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 21:48:07 +00:00
Dominik Stadler
ca9f22c289 Integration tests: Expect exception for old word documents and still run the text extraction for them. Also add executing HPSFPropertiesExtractor where possible
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668483 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 21:47:44 +00:00
Dominik Stadler
e6d6cbcbd1 Add test-document to verify that bug 51921 is fixed already
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668482 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 21:47:19 +00:00
Dominik Stadler
f9961331ff Bug 47304: use fixed encoding when extracting text in WordDocument
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668367 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 13:33:43 +00:00
Dominik Stadler
97ce24a440 Verify that document from bug 56392 works
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1666003 13f79535-47bb-0310-9956-ffa450edef68
2015-03-11 20:39:18 +00:00
Nick Burch
b6a88f09c8 Patch from rojotek from github-18 - Handle documents with a picture-only header
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1661908 13f79535-47bb-0310-9956-ffa450edef68
2015-02-24 12:09:30 +00:00
Dominik Stadler
5a20499263 Bug 53453: Apply patch to add methods to set margins in sections of HWPF documents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1649176 13f79535-47bb-0310-9956-ffa450edef68
2015-01-03 09:34:07 +00:00
Dominik Stadler
d909e0998c Add verifying test-case for bug 52420
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1649147 13f79535-47bb-0310-9956-ffa450edef68
2015-01-02 22:38:52 +00:00
Dominik Stadler
fb9295ffb9 Bug 57312: Add check for null value of underline w:val
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1646729 13f79535-47bb-0310-9956-ffa450edef68
2014-12-19 14:29:50 +00:00
Dominik Stadler
5f4a2a3b44 github-7 - Form check box extraction with XWPFWordExtractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1636990 13f79535-47bb-0310-9956-ffa450edef68
2014-11-05 22:26:00 +00:00
Nick Burch
a6978501b4 Test file from Jan Vanhoecke for bug #56880 - Non-extended character Pascal strings are not supported
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1630541 13f79535-47bb-0310-9956-ffa450edef68
2014-10-09 17:53:54 +00:00