poi/test-data/document
2016-12-01 02:21:56 +00:00
..
45690.docm
47304.doc Bug 47304: use fixed encoding when extracting text in WordDocument 2015-03-22 13:33:43 +00:00
51921-Word-Crash067.doc Add test-document to verify that bug 51921 is fixed already 2015-03-22 21:47:19 +00:00
51921-Word-Crash067.docx Add test-document to verify that bug 51921 is fixed already 2015-03-22 21:47:19 +00:00
52117.doc Integration tests: Expect exception for old word documents and still run the text extraction for them. Also add executing HPSFPropertiesExtractor where possible 2015-03-22 21:47:44 +00:00
52288.docx Test that shows that bug #52288 is already fixed 2011-12-06 03:33:11 +00:00
52420.doc Add verifying test-case for bug 52420 2015-01-02 22:38:52 +00:00
52449.docx Fix bug #52449 - Support writing XWPF documents with glossaries (plus fix some indenting) 2012-01-11 14:02:40 +00:00
53379.doc Add test-document from bug 53379 to verify in integration tests that text extraction does not fail any more 2015-03-22 21:48:07 +00:00
53446.doc Bugzilla 53446 - Fixed some problems extracting PNGs 2012-08-04 05:30:19 +00:00
55733.docx Fix bug #55733 - XWPFWordExtractor need 2013-11-01 19:43:46 +00:00
56392.docx Verify that document from bug 56392 works 2015-03-11 20:39:18 +00:00
56880.doc Test file from Jan Vanhoecke for bug #56880 - Non-extended character Pascal strings are not supported 2014-10-09 17:53:54 +00:00
57312.docx Bug 57312: Add check for null value of underline w:val 2014-12-19 14:29:50 +00:00
57603-seven_columns.doc Add commented reproducer for bug 57603 2016-07-17 21:17:45 +00:00
57843.doc bug 57843: add failing unit test: Word 6.0 (1993) fails with ArrayIndexOutOfBoundsException 2016-09-22 09:24:48 +00:00
58067.docx Fix bug 58067: XWPF: don't return deleted text when document is in review-mode 2016-01-03 13:28:01 +00:00
58618.docx bug 58618: XWPFParagraph insertNewRun and removeRun work incorrectly for 2016-01-01 16:28:01 +00:00
58804_1.doc add an ignored test for bug 58804 2016-01-25 20:21:30 +00:00
58804.doc add an ignored test for bug 58804 2016-01-25 20:21:30 +00:00
59030.docx POI 59030 fix NPE in XWPFTableCell's getVerticalAlignment via Prasad Babu 2016-02-19 15:46:26 +00:00
59378.docx Bug 59378: Try to reproduce, but could not 2016-06-02 20:09:25 +00:00
60158.docm bug 60158: add failing test cases for AIOOBE on VBAMacroReader 2016-09-21 01:03:12 +00:00
60293.docx 60293 -- Handle illegal "Odd" header/footer in XWPF 2016-10-31 19:02:06 +00:00
60329.docx 60329: Avoid NPE when styleid is null 2016-12-01 02:21:56 +00:00
abstract1.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
abstract2.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
abstract3.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
abstract4.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
AIOOB-Tap.doc
au.edu.utas.www___data_assets_word_doc_0003_154335_International-Travel-Approval-Request-Form.doc Fix possible ArrayIndexOutOfBoundsException seen with some word documents 2015-04-20 18:16:30 +00:00
bookmarks.docx
bug53475-password-is-pass.docx Another test file from Andreas Beekerndreas Beeker from bug #53475 2013-11-12 11:18:59 +00:00
bug53475-password-is-solrcell.docx Patch from Andreas Beeker from bug #53475 - CSPName may not always be present on OOXML encrypted documents, plus test 2013-11-07 21:29:14 +00:00
bug56075-changeTracking_off.docx Bug 56075 - Add Change Tracking support to XWPF 2014-03-13 00:16:56 +00:00
bug56075-changeTracking_on.docx Bug 56075 - Add Change Tracking support to XWPF 2014-03-13 00:16:56 +00:00
bug56076.docx Bug 56076 - Add document protection with password support to XWPF 2014-02-21 23:19:57 +00:00
bug57031.docx #59058 - OOM when parsing docx after OPCPackage.open with File but not with InputStream (TIKA-1866) 2016-03-09 01:25:02 +00:00
bug59058.docx #59058 - OOM when parsing docx after OPCPackage.open with File but not with InputStream (TIKA-1866) 2016-03-09 01:25:02 +00:00
Bug28627.doc
Bug33519.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug34898.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug41898.doc rename emf_2003_image.doc to Bug41898.doc and move test to TestBugs class 2011-08-09 12:50:15 +00:00
Bug44292.doc
Bug44431.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug44603.doc
Bug45269.doc
Bug45473.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug45877.doc Add test that shows that bug #45877 has already been fixed 2010-09-19 11:52:20 +00:00
Bug46220.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug46610_1.doc
Bug46610_2.doc
Bug46610_3.doc
Bug46817.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug47286.doc Test case shall not fail 2011-07-08 09:06:33 +00:00
Bug47287.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug47731.doc new test case for 47731 issue 2011-07-24 18:55:57 +00:00
Bug47742-text.txt removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
Bug47742.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug47958.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug48065.doc already fixed 48065 - Problems with save output of HWPF (losing formatting) 2011-07-07 13:13:04 +00:00
Bug48075.doc rename MBD001D0B89.doc to Bug48075.doc 2011-07-11 18:45:46 +00:00
Bug49820.doc Apply patch from bug #49820 - Fix HWPF paragraph levels, so that outline levels can be properly fetched 2010-09-20 11:45:53 +00:00
Bug49908.doc removed svn:executable from project files 2010-10-14 10:34:59 +00:00
Bug49919.doc support for BorderCode in HWPF, see Bugzilla 49919 2010-10-07 13:55:46 +00:00
Bug49933.doc update status and .doc for issue 49933 2011-07-09 15:37:04 +00:00
Bug50075.doc avoid NPE in ListLevel.getNumberText() when numberText is null, see Bugzilla 50075 2010-10-14 10:30:29 +00:00
Bug50936_1.doc Fix 47958 - ArrayIndexOutOfBoundsException from PicturesTable.getAllPictures() during Escher tree walk 2011-10-30 00:33:44 +00:00
Bug50936_2.doc Fix 47958 - ArrayIndexOutOfBoundsException from PicturesTable.getAllPictures() during Escher tree walk 2011-10-30 00:33:44 +00:00
Bug50936_3.doc Fix 47958 - ArrayIndexOutOfBoundsException from PicturesTable.getAllPictures() during Escher tree walk 2011-10-30 00:33:44 +00:00
Bug50955.doc resolved old bugzilla issues, added unit tests 2011-06-24 08:46:37 +00:00
Bug51170.docx avoid exceptions when using POI in Tika, see BUgs 51771 and 51770 2011-09-12 10:19:50 +00:00
Bug51604.doc 51604 is fixed 2011-08-09 05:18:37 +00:00
Bug51686.doc [[51686]] rename testWORD.doc to Bug51686.doc 2011-08-22 08:38:20 +00:00
Bug51834.doc always pad properties to 4 bytes 2011-10-02 01:06:22 +00:00
Bug51890.doc picture loading completely rewritten, bugs 51902 and 51890 fixed 2011-09-30 15:49:45 +00:00
Bug51944.doc fix 51944 - PAPFormattedDiskPage.getPAPX - IndexOutOfBounds 2011-10-30 00:04:38 +00:00
Bug52032_1.doc fix additional issue found in bug 52032, add test files 2011-10-29 23:57:48 +00:00
Bug52032_2.doc fix additional issue found in bug 52032, add test files 2011-10-29 23:57:48 +00:00
Bug52032_3.doc fix additional issue found in bug 52032, add test files 2011-10-29 23:57:48 +00:00
Bug52311.doc 52311 - Conversion to html : Problem in titles number 2012-11-05 14:38:12 +00:00
Bug52583.doc Bug 52583 - Conversion to html : Problem with combobox 2012-11-06 16:26:43 +00:00
Bug53182.doc fix bug 53182 - Reading combined character styling and direct formatting of a character run 2012-11-05 15:51:41 +00:00
Bug53380_1.doc Fixed bug 53380 -- ArrayIndexOutOfBounds Excetion parsing word 97 document 2012-09-11 19:49:44 +00:00
Bug53380_2.doc Fixed bug 53380 -- ArrayIndexOutOfBounds Excetion parsing word 97 document 2012-09-11 19:49:44 +00:00
Bug53380_3.doc +one more test file for Bug 53380 2012-09-21 07:16:03 +00:00
Bug53380_4.doc add 4th example file from Bug 53380 2012-09-25 21:42:09 +00:00
Bug53453Section.doc Bug 53453: Apply patch to add methods to set margins in sections of HWPF documents 2015-01-03 09:34:07 +00:00
Bug54771a.docx BUG 54771 extract text from SDTs at the cell level within a table row 2014-06-16 18:46:00 +00:00
Bug54771b.docx BUG 54771 extract text from SDTs at the cell level within a table row 2014-06-16 18:46:00 +00:00
Bug54849.docx Fix the javadoc, correct the indenting, and add the new test file from bug #54849 2013-06-13 18:18:19 +00:00
Bug55142.docx Patch from Tim Allison from bug #55142 - Not all XWPF SDT block 2013-06-25 13:09:08 +00:00
Bug60337.docx 60337: XWPFTableRow.isRepeatHeader throws NullPointerException, setRepeatHeader does not overwrite old value 2016-11-05 06:12:24 +00:00
Bug60341.docx POI-60341, add test document (ugh, mea culpa), turn on test. 2016-11-07 12:10:46 +00:00
cap.stanford.edu_profiles_viewbiosketch_facultyid=4009&name=m_maciver.doc Bug 59739: For now fix the regression in FileInformationBlock which was introduced after 3.15-beta1 so that the documents can be loaded again pending a full fix as discussed in the bug. 2016-06-30 21:06:04 +00:00
checkboxes.docx github-7 - Form check box extraction with XWPFWordExtractor 2014-11-05 22:26:00 +00:00
ComplexNumberedLists.docx More unit testing for XWPF list numbering complex cases, and some TODOs on improving it, inspired by users@ discussions 2016-11-04 10:55:31 +00:00
delins.docx
DiffFirstPageHeadFoot.doc
DiffFirstPageHeadFoot.docx
documentProperties.doc output document properties to html and pdf 2011-07-06 09:38:18 +00:00
documentProperties.docx
documentProtection_comments_no_password.docx
documentProtection_forms_no_password.docx
documentProtection_no_protection_tag_existing.docx
documentProtection_no_protection.docx
documentProtection_readonly_no_password.docx
documentProtection_trackedChanges_no_password.docx
drawing.docx Test for parsing document with drawings to prevent NoClassDefFoundError for CTAnchor in XWPFRun 2011-06-22 12:46:42 +00:00
EmbeddedDocument.docx Bug 50474 - Example demonstrating how to update workbook embedded in a WordprocessingML document 2011-06-25 13:46:00 +00:00
empty.doc
EmptyDocumentWithHeaderFooter.docx
endingnote.doc initial support for endnotes and footnotes in HWPF 2011-07-20 22:31:59 +00:00
endnotes.docx
EnforcedWith.docx GitHub PR 27: Add method to check for any protection in XWPFDocument, closes #27 2016-02-15 09:26:51 +00:00
equation.doc bug 51351: more progress with WordToFoExtractor: support for hyperlinks, common fields and code cleanup 2011-06-20 15:56:28 +00:00
ExternalEntityInText.docx Add some extra safety test to check that external entities are not loaded by xmlbeans 2014-08-12 11:33:02 +00:00
FancyFoot.doc
FancyFoot.docx
FieldCodes.docx
FldSimple.docx Remove svn:executable property from a series of files that didn't need it set 2010-08-09 13:12:52 +00:00
FloatingPictures.doc Test that shows we handle word floating and fixed pictures properly 2010-09-28 11:46:22 +00:00
footnote.doc
footnotes.docx
form_footnotes.docx
header_image.doc
headerFooter.docx
HeaderFooterProblematic.doc Fix bug #49936 - Handle HWPF documents with problematic HeaderStories better 2010-09-17 14:14:19 +00:00
HeaderFooterUnicode.doc
HeaderFooterUnicode.docx
headerPic.docx XWPF: support for pictures in headers 2011-03-21 12:43:58 +00:00
Headers.docx
HeaderWithMacros.doc
heading123.docx
hyperlink.doc bug 51351: more progress with WordToFoExtractor: support for hyperlinks, common fields and code cleanup 2011-06-20 15:56:28 +00:00
IllustrativeCases.docx
innertable.doc Test correct processing of "sprmPItap" (0x6649) and "sprmPFInTable" (0x2416) 2011-07-05 01:19:31 +00:00
issue_51265_1.docx removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
issue_51265_2.docx removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
issue_51265_3.docx removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
ListEntryNoListTable.doc
lists-margins.doc handle lists margins 2011-09-20 14:14:17 +00:00
Lists.doc Work inspired by bug #48018 - get HWPF lists more consistent in read vs write, and preserve order as apparently that matters. Includes a fair number of list related unit tests, but not for everything 2010-09-20 14:26:49 +00:00
MarkAuthorsTable.doc
nature1.gif removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
nature1.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
nature1.png removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
nature2.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
nature3.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
nature4.jpg removed svn:executable bit from project files 2011-12-10 08:02:08 +00:00
NoHeadFoot.doc
NoHeadFoot.docx
Numbering.docx POI-57889 prevent NPE with on some documents with XWPFParagraph's getNumFmt() and add some other classes to enable calculation of paragraph numbers 2015-05-05 01:39:16 +00:00
NumberingWOverrides.docx POI-57889 -- actually trigger inclusion of CTNumLvl with document contributed by Moritz Dorka on TIKA-1315 2015-05-28 19:08:24 +00:00
o_kurs.doc
ob_is.doc
page-break-before.doc add test case for "page break before" property of paragraph 2011-08-22 13:27:15 +00:00
page-break.doc add page-break test case 2011-08-22 15:03:51 +00:00
pageref.doc bug 51351: more progress with WordToFoExtractor: support for hyperlinks, common fields and code cleanup 2011-06-20 15:56:28 +00:00
PageSpecificHeadFoot.doc
PageSpecificHeadFoot.docx
parentinvguid.doc hwpf: ignore null-reference to parent stylesheet (bug#50688) 2011-01-31 09:27:44 +00:00
PasswordProtected.doc
Picture_Alternative_Text.doc Patch from Josh Holthaus from bug #53165 - HWPF support for fetching the description (alt text) of a picture 2012-05-01 09:46:15 +00:00
picture.doc introduce picture descriptor structure (internal), now Picture class extends it; 2011-07-16 16:19:49 +00:00
pictures_escher.doc add simpliest "escher" pictures support in Word-to-HTML and Word-to-FO converters 2011-07-28 15:08:06 +00:00
PngPicture.doc Restore HWPF support for inline Escher images (stored in Escher rather than direct in a PICFAndOfficeArtData in the main stream), plus add test for this kind of file 2011-11-28 18:42:30 +00:00
ProblemExtracting.doc
protected_sample.docx
rasp.doc
sample.docx
SampleDoc.doc
SampleDoc.docx
SampleDoc.txt
saved-by-table.doc
simple_image.jpg
simple_image.png
simple-list.doc
simple-table2.doc
simple-table.doc
simple.doc
SimpleHeadThreeColFoot.doc
SimpleHeadThreeColFoot.docx
SimpleMacro.doc bug 52949: add Word, Powerpoint, and Visio (HDGF) files with macros to test macro extraction 2016-04-11 03:02:18 +00:00
SimpleMacro.docm bug 52949: add Word, Powerpoint, and Visio (HDGF) files with macros to test macro extraction 2016-04-11 03:02:18 +00:00
SimpleMacro.vba bug 52949: add Word, Powerpoint, and Visio (HDGF) files with macros to test macro extraction 2016-04-11 03:02:18 +00:00
smarttag-snippet.docx Patch from Fabian from bug #52285 - support Smart Tags in XWPF paragraphs, with test (and fixing indents) 2011-12-06 04:31:04 +00:00
Styles.docx
table_footnotes.docx
table-merges.doc better processing of word tables in cases different rows have different cell widths 2011-07-22 09:42:32 +00:00
test2.doc
test-fields.doc support for getting HWPFDocument fields, see Bugzilla 50313 2011-03-14 09:10:12 +00:00
test.doc
test.dotx
testCroppedPictures.doc Bugzilla 51335: Parse picture goal and crop sizes 2011-06-11 14:58:50 +00:00
TestDocument.docx Correct XWPFRun detection of bold/italic in a paragraph with multiple runs of different styles 2010-09-14 16:32:02 +00:00
testPictures.doc
TestPoiXMLDocumentCorePropertiesGetKeywords.docx
testRangeDelete.doc
testRangeInsertion.doc
testRangeReplacement.doc
TestTableCellAlign.docx Avoid NPE in XWPFTableCell, taken from https://github.com/prasad-babu/poi/tree/WORKING_BRANCH 2016-06-02 20:09:44 +00:00
ThreeColFoot.doc
ThreeColFoot.docx
ThreeColHead.doc
ThreeColHead.docx
ThreeColHeadFoot.doc
ThreeColHeadFoot.docx
Tika-792.docx POI 55361 trigger to load CTMoveBookmark in TestXWPFParagraph 2013-08-08 13:42:01 +00:00
two_images.doc
VariousPictures.docx
vector_image.doc
vector_image.emf
watermark.doc add watermark test case (as example) 2011-08-23 15:35:07 +00:00
WithArtShapes.doc
WithGIF.docx Add unit test from Stefan for bug #51172 - DOCX gif support 2011-05-27 13:36:00 +00:00
WithTabs.docx
word95err.doc hwpt: add more 2 bytes to OldSectionTable to solve ArrayIndexOutOfBoundsException 2010-09-27 12:50:36 +00:00
Word6_sections2.doc More fixes for bug #49933, workaround the fact that some word6/word95 SEPX entries are compressed differently, and we don't have the specs for how they're stored 2010-09-19 09:59:10 +00:00
Word6_sections.doc Fix support for sections in old word 6 / word 95 files 2010-09-17 13:46:11 +00:00
Word6.doc
Word95.doc
word_with_embeded_ooxml.doc Inside ExtractorFactory, support finding embedded OOXML documents and providing extractors for them 2010-12-16 07:39:21 +00:00
word_with_embeded.doc
WordWithAttachments.docx
zero-length.docx Patch from rojotek from github-18 - Handle documents with a picture-only header 2015-02-24 12:09:30 +00:00