Page History
...
Name | Java Class | Function | Enabled by Default? |
---|---|---|---|
HTML Text Extractor |
| extracts the full text of HTML documents for full text indexing. (Uses Swing's HTML Parser) | true |
JPEG Thumbnail |
| creates thumbnail images of GIF, JPEG and PNG files | true |
Branded Preview JPEG |
| creates a branded preview image for GIF, JPEG and PNG files | false |
PDF Text Extractor |
| extracts the full text of Adobe PDF documents (only if text-based or OCRed) for full text indexing. (Uses the Apache PDFBox tool) | true |
XPDF Text Extractor |
| extracts the full text of Adobe PDF documents (only if text-based or OCRed) for full text indexing (Uses the XPDF command line tools available for Unix.) See XPDF Filter Configuration for details on installing/enabling. | false |
Word Text Extractor |
| extracts the full text of Microsoft Word or Plain Text documents for full text indexing. (Uses the "Microsoft Word Text Mining" tools.) | true |
PowerPoint Text Extractor |
| extracts the full text of slides and notes in Microsoft PowerPoint and PowerPoint XML documents for full text indexing (Uses the Apache POI tools.) | true |
ImageMagick Image Thumbnail Generator | org.dspace.app.mediafilter.ImageMagickImageThumbnailFilter | uses ImageMagick to generate thumbnails for image bitstreams. Requires installation of ImageMagick on your server. See ImageMagick Media Filters. | false |
ImageMagick PDF Thumbnail Generator | org.dspace.app.mediafilter.ImageMagickPdfThumbnailFilter | uses ImageMagick and Ghostscript to generate thumbnails for PDF bitstreams. Requires installation of ImageMagick and Ghostscript on your server. See ImageMagick Media Filters. | false |
Please note that the filter-media
script will automatically update the DSpace search index by default (see Legacy methods for re-indexing content) This is the recommended way to run these scripts. But, should you wish to disable it, you can pass the -n flag to either script to do so (see Executing (via Command Line) below).
...
Code Block |
---|
#Get "outputFormat" configuration from dspace.cfg String outputFormat = ConfigurationManager.getProperty(MediaFilterManager.FILTER_PREFIX + "." + MyComplexMediaFilter.class.getName() + "." + this.getPluginInstanceName() + ".outputFormat"); |
Configuration parameters
Property | filter.org.dspace.app.mediafilter.publicPermission |
---|---|
Example Value | filter.org.dspace.app.mediafilter.publicPermission = JPEGFilter, XPDF2Thumbnail |
Informational Note | By default mediafilter derivatives / thumbnails inherit the same permissions of the parent bitstream, but you can override this, in case you want to make publicly accessible derivative / thumbnail content, typically the thumbnails of objects for the browse list. List the MediaFilter name's that would get public accessible permissions. Any media filters not listed will instead inherit the permissions of the parent bitstream. |