org.alfresco.repo.content.transform
Class TikaAutoContentTransformer

java.lang.Object
  extended by org.alfresco.repo.content.transform.ContentTransformerHelper
      extended by org.alfresco.repo.content.transform.AbstractContentTransformer2
          extended by org.alfresco.repo.content.transform.TikaPoweredContentTransformer
              extended by org.alfresco.repo.content.transform.TikaAutoContentTransformer
All Implemented Interfaces:
ContentWorker, ContentTransformer

public class TikaAutoContentTransformer
extends TikaPoweredContentTransformer

A Content Extractor for XML, HTML and Text, which makes use of the Apache Tika auto-detection to select the best parser to process your document. This will be used for all files which Tika can handle, but where no other more explicit extractor is defined.


Field Summary
static java.util.ArrayList SUPPORTED_MIMETYPES
          We support all the mimetypes that the Tika auto-detect parser can handle, except for Image, Audio and Video ones which don't make much sense
 
Fields inherited from class org.alfresco.repo.content.transform.TikaPoweredContentTransformer
LINE_BREAK, sourceMimeTypes, WRONG_FORMAT_MESSAGE_ID
 
Constructor Summary
TikaAutoContentTransformer(org.apache.tika.config.TikaConfig tikaConfig)
           
 
Method Summary
protected  org.apache.tika.parser.Parser getParser()
          Returns the Tika Auto-Detection parser, which will try to process all documents that Tika knows about
 
Methods inherited from class org.alfresco.repo.content.transform.TikaPoweredContentTransformer
buildParseContext, getContentHandler, isTransformable, transformInternal
 
Methods inherited from class org.alfresco.repo.content.transform.AbstractContentTransformer2
checkTransformable, getTransformationTime, recordTime, register, setRegistry, toString, transform, transform, transform
 
Methods inherited from class org.alfresco.repo.content.transform.ContentTransformerHelper
getMimetype, getMimetypeService, isExplicitTransformation, setExplicitTransformations, setMimetypeService
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 
Methods inherited from interface org.alfresco.repo.content.transform.ContentTransformer
isExplicitTransformation
 

Field Detail

SUPPORTED_MIMETYPES

public static java.util.ArrayList SUPPORTED_MIMETYPES
We support all the mimetypes that the Tika auto-detect parser can handle, except for Image, Audio and Video ones which don't make much sense

Constructor Detail

TikaAutoContentTransformer

public TikaAutoContentTransformer(org.apache.tika.config.TikaConfig tikaConfig)
Method Detail

getParser

protected org.apache.tika.parser.Parser getParser()
Returns the Tika Auto-Detection parser, which will try to process all documents that Tika knows about

Specified by:
getParser in class TikaPoweredContentTransformer


Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.