Commit Graph

69 Commits

Author SHA1 Message Date
Dominik Stadler
ca9f22c289 Integration tests: Expect exception for old word documents and still run the text extraction for them. Also add executing HPSFPropertiesExtractor where possible
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668483 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 21:47:44 +00:00
Dominik Stadler
e6d6cbcbd1 Add test-document to verify that bug 51921 is fixed already
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668482 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 21:47:19 +00:00
Dominik Stadler
dad8d1547c Add more test-files from bug 44501, all seem to work fine now
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668369 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 13:42:06 +00:00
Dominik Stadler
f9961331ff Bug 47304: use fixed encoding when extracting text in WordDocument
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668367 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 13:33:43 +00:00
Andreas Beeker
652211b8d2 Snapshot commit - still lots of errors, xslf rendering works, started work on hslf
git-svn-id: https://svn.apache.org/repos/asf/poi/branches/common_sl@1667902 13f79535-47bb-0310-9956-ffa450edef68
2015-03-19 23:44:23 +00:00
Nick Burch
a181f03b9a Exclude the OOXML strict files for now, not yet supported, see bug #57699
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1666433 13f79535-47bb-0310-9956-ffa450edef68
2015-03-13 13:03:53 +00:00
Dominik Stadler
091f06330e Add helper test to verify that vsdx cannot even be loaded by POIXMLDocument curently.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1665984 13f79535-47bb-0310-9956-ffa450edef68
2015-03-11 19:53:26 +00:00
Nick Burch
2879bc1b16 No support as yet for the OOXML visio files, so have these ignored
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1665971 13f79535-47bb-0310-9956-ffa450edef68
2015-03-11 19:08:59 +00:00
Nick Burch
d78a14b232 Fix inconsistent indents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1665970 13f79535-47bb-0310-9956-ffa450edef68
2015-03-11 19:07:13 +00:00
Dominik Stadler
54bbd99ce4 Add missing close and handle theme-pptx in ExtractorFactory. Add creating slide-bitmaps to PPTX integration test.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1663137 13f79535-47bb-0310-9956-ffa450edef68
2015-03-01 17:50:16 +00:00
Dominik Stadler
f043c44017 * Verify some more Text-Extraction features as part of integration tests, fix some NullPointerExceptions that showed up now because the event-based extraction does not have a Document available
* Also handle a XLSX which does not have row-numbers in the sheet-xml. Excel can read it so it makes sense to also allow to read it in the XSSFSheetXMLHandler 
* Remove some Eclipse warnings in test-code

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1662691 13f79535-47bb-0310-9956-ffa450edef68
2015-02-27 14:58:41 +00:00
Dominik Stadler
80d0d3b5bd Add missing javadoc-header
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1662661 13f79535-47bb-0310-9956-ffa450edef68
2015-02-27 11:33:36 +00:00
Dominik Stadler
e86ed72c2b * Add text-extraction verification to integration-tests via a new abstract base FileHandler
* Fix NullPointerException found in some documents when running against the test-data
* Add support for extracting text from Dir-Entries WORKBOOK and BOOK to support some old/strangely formatted XLS files.

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1662652 13f79535-47bb-0310-9956-ffa450edef68
2015-02-27 09:59:14 +00:00
Nick Burch
ce3cc5b519 Another exclude
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1651512 13f79535-47bb-0310-9956-ffa450edef68
2015-01-13 21:48:43 +00:00
Dominik Stadler
e8c5bd48ad Extract ROOT_DIR into constant, ignore .csv files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1649124 13f79535-47bb-0310-9956-ffa450edef68
2015-01-02 21:07:14 +00:00
Andreas Beeker
4bed984c0f Charset.forName() for known encodings makes catching UnknownEncodingException obsolete
Unify UTF-16LE conversion to StringUtil
BugFix for RecordInputStream.readFully in combination with continuing records
BugFix for integration tests - fix pathname for handler/exclude lookup on windows

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1648032 13f79535-47bb-0310-9956-ffa450edef68
2014-12-27 01:33:28 +00:00
Dominik Stadler
ae8391b926 One more change in file-pattern to actually exclude .svn files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1647941 13f79535-47bb-0310-9956-ffa450edef68
2014-12-26 07:51:46 +00:00
Dominik Stadler
65ca8293d5 Exclude .svn sub-dirs as it seems Apache Jenkins still runs with Subversion <=1.6
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1647939 13f79535-47bb-0310-9956-ffa450edef68
2014-12-26 07:04:34 +00:00
Dominik Stadler
0c580c8f45 Add a test-suite which performs integration/stress tests which load and handle all stored test files in various ways.
It works by using handlers for each type of file which perform various operations on the files, e.g. loading,
iterating content, modify, ... This will trigger changes which break working with the available test-files and
thus provides another layer of regression testing which hopefully prevents some failures from making it into
releases.

It is runnable via a new ant-target 'test-integration' and also added to the jenkins-target.

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1647885 13f79535-47bb-0310-9956-ffa450edef68
2014-12-25 09:16:44 +00:00