Page History
...
Available Command-Line Options:
*Help* : {{\Wiki Markup [dspace
\]/bin/dspace
filter-media
\-h
}}- Display help message describing all command-line options.
- Force mode* : {{\
[dspace
\]/bin/dspace
filter-media
\-f
}}- Apply filters to ALL bitstreams, even if they've already been filtered. If they've already been filtered, the previously filtered content is overwritten.
*Wiki Markup - Identifier mode* : {{\
[dspace
\]/bin/dspace
filter-media
\-i
123456789/2
}}- Restrict processing to the community, collection, or item named by the identifier - by default, all bitstreams of all items in the repository are processed. The identifier must be a Handle, not a DB key. This option may be combined with any other option.
*Wiki Markup - Maximum mode* : {{\
[dspace
\]/bin/dspace
filter-media
\-m
1000
}}- Suspend operation after the specified maximum number of items have been processed - by default, no limit exists. This option may be combined with any other option.
*Wiki Markup - No-Index mode* : {{\
[dspace
\]/bin/dspace
filter-media
\-n
}}- Suppress index creation - by default, a new search index is created for full-text searching. This option suppresses index creation if you intend to run
index-update
elsewhere.
*Wiki Markup - Suppress index creation - by default, a new search index is created for full-text searching. This option suppresses index creation if you intend to run
- Plugin mode* : {{\
[dspace
\]/bin/dspace
filter-media
\-p
"PDF
Text
Extractor","Word
Text
Extractor"
}}- Apply ONLY the filter plugin(s) listed (separated by commas). By default all named filters listed in the filter.plugins field of dspace.cfg are applied. This option may be combined with any other option. WARNING: multiple plugin names must be separated by a comma (i.e. ',') and NOT a comma followed by a space (i.e. ', ').
*Wiki Markup - Skip mode* : {{\
[dspace
\]/bin/dspace
filter-media
\-s
123456789/9,123456789/100
}}- SKIP the listed identifiers (separated by commas) during processing. The identifiers must be Handles (not DB Keys). They may refer to items, collections or communities which should be skipped. This option may be combined with any other option. WARNING: multiple identifiers must be separated by a comma (i.e. ',') and NOT a comma followed by a space (i.e. ', ').
- NOTE: If you have a large number of identifiers to skip, you may maintain this comma-separated list within a separate file (e.g. filter-skiplist.txt). Use the following format to call the program. Please note the use of the "grave" or "tick" (`) symbol and do not use the single quotation.
{{\Wiki Markup [dspace
\]/bin/dspace
filter-media
\-s
`less
filter-skiplist.txt`
}}
- Verbose mode* : {{\
[dspace
\]/bin/dspace
filter-media
\-v
}}- Verbose mode - print all extracted text and other filter details to STDOUT.
Adding your own filters is done by creating a class which implements theorg.dspace.app.mediafilter.FormatFilter
interface. See the Creating a new Media/Format Filter topic and comments in the source fileFormatFilter.java
for more information. In theory filters could be implemented in any programming language (C, Perl, etc.) However, they need to be invoked by the Java code in the Media Filter class that you create.
- Verbose mode - print all extracted text and other filter details to STDOUT.
...
Overview
Content Tools