org.alfresco.repo.content.transform
Class TikaAutoContentTransformer
java.lang.Object
org.alfresco.repo.content.transform.ContentTransformerHelper
org.alfresco.repo.content.transform.AbstractContentTransformer2
org.alfresco.repo.content.transform.TikaPoweredContentTransformer
org.alfresco.repo.content.transform.TikaAutoContentTransformer
- All Implemented Interfaces:
- ContentWorker, ContentTransformer
public class TikaAutoContentTransformer
- extends TikaPoweredContentTransformer
A Content Extractor for XML, HTML and Text,
which makes use of the Apache Tika
auto-detection to select the best parser
to process your document.
This will be used for all files which Tika can
handle, but where no other more explicit
extractor is defined.
|
Field Summary |
static java.util.ArrayList |
SUPPORTED_MIMETYPES
We support all the mimetypes that the Tika
auto-detect parser can handle, except for
Image, Audio and Video ones which don't
make much sense |
|
Method Summary |
protected org.apache.tika.parser.Parser |
getParser()
Returns the Tika Auto-Detection
parser, which will try to
process all documents that Tika
knows about |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
SUPPORTED_MIMETYPES
public static java.util.ArrayList SUPPORTED_MIMETYPES
- We support all the mimetypes that the Tika
auto-detect parser can handle, except for
Image, Audio and Video ones which don't
make much sense
TikaAutoContentTransformer
public TikaAutoContentTransformer()
getParser
protected org.apache.tika.parser.Parser getParser()
- Returns the Tika Auto-Detection
parser, which will try to
process all documents that Tika
knows about
- Specified by:
getParser in class TikaPoweredContentTransformer
Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.