Tim Allison
9412d467f3
bug 50955 - incorporate info from the DocumentSummaryInformation for
...
guessing the encoding. Back off to the old method if DocSummInfo is
not available. Thanks to Andreas Beeker for recommending this direction.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1791002 13f79535-47bb-0310-9956-ffa450edef68
2017-04-11 17:07:04 +00:00
Tim Allison
44d536e4ce
bug 50955 - try originally guessed codepoint, backoff to 1252 if that fails
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790904 13f79535-47bb-0310-9956-ffa450edef68
2017-04-11 01:30:02 +00:00
Nick Burch
eb427080e8
Javadoc fixes
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790239 13f79535-47bb-0310-9956-ffa450edef68
2017-04-05 13:17:17 +00:00
Tim Allison
1bfd5f6585
bug 50955 and bug 60953 improve Big5 reader; ensure one character
...
per byte pair
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790172 13f79535-47bb-0310-9956-ffa450edef68
2017-04-05 01:45:55 +00:00
Tim Allison
af51ea4c10
bug 50955 -- word 6.0 charset fix
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790061 13f79535-47bb-0310-9956-ffa450edef68
2017-04-04 02:06:46 +00:00
Nick Burch
3fa0449747
#57919 Add in-place and new-File write methods to POIDocument
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1753619 13f79535-47bb-0310-9956-ffa450edef68
2016-07-20 22:35:51 +00:00
Javen O'Neal
ae12cfcd57
bug 59170: remove deprecated HSLF and HWPF methods
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1748783 13f79535-47bb-0310-9956-ffa450edef68
2016-06-17 00:55:22 +00:00
Sergey Vladimirov
5a930ae36f
different workarounds for old Word format
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1195133 13f79535-47bb-0310-9956-ffa450edef68
2011-10-30 08:59:16 +00:00
Sergey Vladimirov
a4ec172b7a
extract FibBase from FIB
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1178019 13f79535-47bb-0310-9956-ffa450edef68
2011-10-01 15:32:32 +00:00
Sergey Vladimirov
1d9900c184
Add Word-to-Text converter and use it as replacement for WordExtractor
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1155336 13f79535-47bb-0310-9956-ffa450edef68
2011-08-09 12:38:52 +00:00
Sergey Vladimirov
7a333045e5
replace StringBuffer with StringBuilder in TextPiece
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1155208 13f79535-47bb-0310-9956-ffa450edef68
2011-08-09 05:00:09 +00:00
Sergey Vladimirov
c10da4c31a
replace ComplexFileTable with single-element-one right after load; replace text piece table as well
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1150675 13f79535-47bb-0310-9956-ffa450edef68
2011-07-25 12:58:09 +00:00
Sergey Vladimirov
0de832ba9d
add getOverallRange() method definition to HWPFDocumentCore
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143709 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 08:35:51 +00:00
Nick Burch
f2b541e026
Since a DirectoryNode has a reference to the underlying POIFSFileSystem, tidy up the POIDocument constructor to not need both passing in
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1053521 13f79535-47bb-0310-9956-ffa450edef68
2010-12-29 03:19:46 +00:00
Maxim Valyanskiy
78b0c18ade
HWPF: Improve reading of auto-saved ("complex") document
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960587 13f79535-47bb-0310-9956-ffa450edef68
2010-07-05 12:56:02 +00:00
Nick Burch
256e73d16d
More Word 6 / Word 95 Support
...
HWPFOldDocument now processes a few more table sections, and so we can fake up some
basic Ranges. This allows us to do paragraph level text extraction
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960102 13f79535-47bb-0310-9956-ffa450edef68
2010-07-02 20:59:30 +00:00
Nick Burch
30848a80aa
Basic text extraction support for old Word 6 and Word 95 documents via some HWPF extensions
...
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959346 13f79535-47bb-0310-9956-ffa450edef68
2010-06-30 15:13:10 +00:00