Page History
...
Name | Java Class | Function | Enabled by Default? |
---|---|---|---|
HTML Text Extractor | | extracts the full text of HTML documents for full text indexing | true |
JPEG Thumbnail | | creates thumbnail images of GIF, JPEG and PNG files | true |
Branded Preview JPEG | | creates a branded preview image for GIF, JPEG and PNG files (disabled by default) | false |
PDF Text Extractor | | extracts the full text of Adobe PDF documents (only if text-based or OCRed) for full text indexing | true |
Word Text Extractor | | extracts the full text of Microsoft Word or Plain Text documents for full text indexing | true |
PowerPoint Text Extractor | | extracts the full text of slides and notes in Microsoft PowerPoint and PowerPoint XML documents for full text indexing | true |
...