Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Disabled by default

mageMagick Image Thumbnail Generator

Table of Contents
minLevel2
outlinetrue
stylenone

...

Below is a listing of all currently available Media Filters, and what they actually do:

JPEG ThumbnailJPEGFilterWord WordFilter Word or Plain Text Microsoft Word Text Miningtrue

Name

Java Class

Function

Default input formats

Enabled by Default?

HTML PDF Text Extractor

org.dspace.app.mediafilter.HTMLFilterPDFFilter

extracts the full text of HTML documents Adobe PDF documents (only if text-based or OCRed) for full text indexing. (Uses Swing's HTML Parser)

true

the Apache PDFBox tool)

Adobe PDF

yes

HTML Text Extractor

org.dspace.app.mediafilter.

creates thumbnail images of GIF, JPEG and PNG files

true

Branded Preview JPEG

org.dspace.app.mediafilter.BrandedPreviewJPEGFilter

creates a branded preview image for GIF, JPEG and PNG files

false

HTMLFilter

extracts the full text of HTML documents for full text indexing. (Uses Swing's HTML Parser)

HTML, Text

yes

Word PDF Text Extractor

org.dspace.app.mediafilter.PDFFilterWordFilter

extracts the full text of Adobe PDF documents (only if text-based or OCRed) Microsoft Word or Plain Text documents for full text indexing. (Uses the Apache PDFBox tool)

true

"Microsoft Word Text Mining" tools.)  See also PoiWordFilter, below.

Microsoft Word

yes

Word XPDF Text Extractor

org.dspace.app.mediafilter.XPDF2TextPoiWordFilter

extracts the full text of Adobe PDF documents (only if text-based or OCRed) Microsoft Word and Microsoft Word XML documents for full text indexing. (Uses the XPDF command line tools available for Unix.) See XPDF Filter Configuration for details on installing/enabling.

false

"Apache POI" tools.)  Disabled by default.  Uncomment PoiWordFilter and comment WordFilter in dspace.cfg if you wish to use this one.

Microsoft Word, Microsoft Word XML

no

Excel Text Extractororg.dspace.app.mediafilter.ExcelFilterextracts the full text of Microsoft Excel documents for full text indexing. (Uses the "Apache POI" tools.)Microsoft Excel, Microsoft Excel XMLyes

PowerPoint Text Extractor

org.dspace.app.mediafilter.PowerPointFilter

extracts the full text of slides and notes in Microsoft PowerPoint and PowerPoint XML documents for full text indexing (Uses the Apache POI tools.)true

Microsoft Powerpoint, Microsoft Powerpoint XML

yes

PDFBox JPEG Thumbnailorg.dspace.app.mediafilter.PDFBoxThumbnailcreates thumbnail images of the first page of PDF filesAdobe PDFyes

JPEG Thumbnail

org.dspace.app.mediafilter.JPEGFilter

creates thumbnail images of GIF, JPEG and PNG files

BMP, GIF, JPEG, image/png

yes

Branded Preview JPEG

org.dspace.app.mediafilter.BrandedPreviewJPEGFilter

creates a branded preview image for GIF, JPEG and PNG files

BMP, GIF, JPEG, image/png

no

ImageMagick Image Thumbnail Generator

org.dspace.app.mediafilter.ImageMagickImageThumbnailFilter

uses Uses ImageMagick to generate thumbnails for image bitstreams. Requires installation of ImageMagick on your server. See ImageMagick Media Filters.BMP, GIF, image/png, JPG, TIFF, JPEG, JPEG 2000nofalse
ImageMagick PDF Thumbnail Generatororg.dspace.app.mediafilter.ImageMagickPdfThumbnailFilteruses Uses ImageMagick and Ghostscript to generate thumbnails for PDF bitstreams. Requires installation of ImageMagick and Ghostscript on your server. See  ImageMagick Media Filters.Adobe PDFfalseno

Please note that the filter-media script will automatically update the DSpace search index by default.

...

Property

filter.org.dspace.app.mediafilter.publicPermission

Example Valuefilter.org.dspace.app.mediafilter.publicPermission = JPEGFilter, XPDF2Thumbnail
Informational NoteBy default mediafilter derivatives / thumbnails inherit the same permissions of the parent bitstream, but you can override this, in case you want to make publicly accessible derivative / thumbnail content, typically the thumbnails of objects for the browse list. List the MediaFilter name's names that would get public accessible permissions. Any media filters not listed will instead inherit the permissions of the parent bitstream.

...