diff --git a/src/documentation/content/xdocs/text-extraction.xml b/src/documentation/content/xdocs/text-extraction.xml index 397aa1b39..d71a0bf10 100644 --- a/src/documentation/content/xdocs/text-extraction.xml +++ b/src/documentation/content/xdocs/text-extraction.xml @@ -71,6 +71,13 @@ Those using POI 3.5 can also use org.apache.poi.xssf.extractor.XSSFExcelExtractor, to perform a similar task for .xlsx files.

+

In addition, there is a second text extractor for .xls files, + org.apache.poi.hssf.extractor.EventBasedExcelExtractor. This + is based on the streaming EventUserModel code, and will generally + deliver a lower memory footprint for extraction. However, it will + have problems correctly outputting more complex formulas, as it + works with records as they pass, and so doesn't have access to all + parts of complex and shared formulas.

Word