Commit Graph

17 Commits

Author SHA1 Message Date
Tim Allison 9412d467f3 bug 50955 - incorporate info from the DocumentSummaryInformation for
guessing the encoding.  Back off to the old method if DocSummInfo is
not available. Thanks to Andreas Beeker for recommending this direction.

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1791002 13f79535-47bb-0310-9956-ffa450edef68
2017-04-11 17:07:04 +00:00
Tim Allison 44d536e4ce bug 50955 - try originally guessed codepoint, backoff to 1252 if that fails
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790904 13f79535-47bb-0310-9956-ffa450edef68
2017-04-11 01:30:02 +00:00
Nick Burch eb427080e8 Javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790239 13f79535-47bb-0310-9956-ffa450edef68
2017-04-05 13:17:17 +00:00
Tim Allison 1bfd5f6585 bug 50955 and bug 60953 improve Big5 reader; ensure one character
per byte pair

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790172 13f79535-47bb-0310-9956-ffa450edef68
2017-04-05 01:45:55 +00:00
Tim Allison af51ea4c10 bug 50955 -- word 6.0 charset fix
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790061 13f79535-47bb-0310-9956-ffa450edef68
2017-04-04 02:06:46 +00:00
Nick Burch 3fa0449747 #57919 Add in-place and new-File write methods to POIDocument
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1753619 13f79535-47bb-0310-9956-ffa450edef68
2016-07-20 22:35:51 +00:00
Javen O'Neal ae12cfcd57 bug 59170: remove deprecated HSLF and HWPF methods
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1748783 13f79535-47bb-0310-9956-ffa450edef68
2016-06-17 00:55:22 +00:00
Sergey Vladimirov 5a930ae36f different workarounds for old Word format
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1195133 13f79535-47bb-0310-9956-ffa450edef68
2011-10-30 08:59:16 +00:00
Sergey Vladimirov a4ec172b7a extract FibBase from FIB
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1178019 13f79535-47bb-0310-9956-ffa450edef68
2011-10-01 15:32:32 +00:00
Sergey Vladimirov 1d9900c184 Add Word-to-Text converter and use it as replacement for WordExtractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1155336 13f79535-47bb-0310-9956-ffa450edef68
2011-08-09 12:38:52 +00:00
Sergey Vladimirov 7a333045e5 replace StringBuffer with StringBuilder in TextPiece
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1155208 13f79535-47bb-0310-9956-ffa450edef68
2011-08-09 05:00:09 +00:00
Sergey Vladimirov c10da4c31a replace ComplexFileTable with single-element-one right after load; replace text piece table as well
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1150675 13f79535-47bb-0310-9956-ffa450edef68
2011-07-25 12:58:09 +00:00
Sergey Vladimirov 0de832ba9d add getOverallRange() method definition to HWPFDocumentCore
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143709 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:35:51 +00:00
Nick Burch f2b541e026 Since a DirectoryNode has a reference to the underlying POIFSFileSystem, tidy up the POIDocument constructor to not need both passing in
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1053521 13f79535-47bb-0310-9956-ffa450edef68
2010-12-29 03:19:46 +00:00
Maxim Valyanskiy 78b0c18ade HWPF: Improve reading of auto-saved ("complex") document
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960587 13f79535-47bb-0310-9956-ffa450edef68
2010-07-05 12:56:02 +00:00
Nick Burch 256e73d16d More Word 6 / Word 95 Support
HWPFOldDocument now processes a few more table sections, and so we can fake up some
 basic Ranges. This allows us to do paragraph level text extraction


git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960102 13f79535-47bb-0310-9956-ffa450edef68
2010-07-02 20:59:30 +00:00
Nick Burch 30848a80aa Basic text extraction support for old Word 6 and Word 95 documents via some HWPF extensions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959346 13f79535-47bb-0310-9956-ffa450edef68
2010-06-30 15:13:10 +00:00