org.alfresco.repo.content.transform
Class TikaAutoContentTransformer
java.lang.Object
org.alfresco.repo.content.transform.ContentTransformerHelper
org.alfresco.repo.content.transform.AbstractContentTransformer2
org.alfresco.repo.content.transform.TikaPoweredContentTransformer
org.alfresco.repo.content.transform.TikaAutoContentTransformer
- All Implemented Interfaces:
- ContentWorker, ContentTransformer
public class TikaAutoContentTransformer
- extends TikaPoweredContentTransformer
A Content Extractor for XML, HTML and Text,
which makes use of the Apache Tika
auto-detection to select the best parser
to process your document.
This will be used for all files which Tika can
handle, but where no other more explicit
extractor is defined.
Field Summary |
static java.util.ArrayList |
SUPPORTED_MIMETYPES
We support all the mimetypes that the Tika
auto-detect parser can handle, except for
Image, Audio and Video ones which don't
make much sense |
Method Summary |
protected org.apache.tika.parser.Parser |
getParser()
Returns the Tika Auto-Detection
parser, which will try to
process all documents that Tika
knows about |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
SUPPORTED_MIMETYPES
public static java.util.ArrayList SUPPORTED_MIMETYPES
- We support all the mimetypes that the Tika
auto-detect parser can handle, except for
Image, Audio and Video ones which don't
make much sense
TikaAutoContentTransformer
public TikaAutoContentTransformer(org.apache.tika.config.TikaConfig tikaConfig)
getParser
protected org.apache.tika.parser.Parser getParser()
- Returns the Tika Auto-Detection
parser, which will try to
process all documents that Tika
knows about
- Specified by:
getParser
in class TikaPoweredContentTransformer
Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.