org.alfresco.repo.content.transform
Class TextMiningContentTransformer

java.lang.Object
  extended by org.alfresco.repo.content.transform.ContentTransformerHelper
      extended by org.alfresco.repo.content.transform.AbstractContentTransformer2
          extended by org.alfresco.repo.content.transform.TikaPoweredContentTransformer
              extended by org.alfresco.repo.content.transform.TextMiningContentTransformer
All Implemented Interfaces:
ContentWorker, ContentTransformer

public class TextMiningContentTransformer
extends TikaPoweredContentTransformer

This badly named transformer turns Microsoft Word documents (Word 6, 95, 97, 2000, 2003) into plain text, using Apache Tika.


Field Summary
 
Fields inherited from class org.alfresco.repo.content.transform.TikaPoweredContentTransformer
LINE_BREAK, sourceMimeTypes, WRONG_FORMAT_MESSAGE_ID
 
Constructor Summary
TextMiningContentTransformer()
           
 
Method Summary
protected  org.apache.tika.parser.Parser getParser()
          Returns the correct Tika Parser to process the document.
 
Methods inherited from class org.alfresco.repo.content.transform.TikaPoweredContentTransformer
buildParseContext, getContentHandler, isTransformable, transformInternal
 
Methods inherited from class org.alfresco.repo.content.transform.AbstractContentTransformer2
checkTransformable, getTransformationTime, recordTime, register, setRegistry, toString, transform, transform, transform
 
Methods inherited from class org.alfresco.repo.content.transform.ContentTransformerHelper
getMimetype, getMimetypeService, isExplicitTransformation, setExplicitTransformations, setMimetypeService
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 
Methods inherited from interface org.alfresco.repo.content.transform.ContentTransformer
isExplicitTransformation
 

Constructor Detail

TextMiningContentTransformer

public TextMiningContentTransformer()
Method Detail

getParser

protected org.apache.tika.parser.Parser getParser()
Description copied from class: TikaPoweredContentTransformer
Returns the correct Tika Parser to process the document. If you don't know which you want, use TikaAutoContentTransformer which makes use of the Tika auto-detection.

Specified by:
getParser in class TikaPoweredContentTransformer


Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.