org.alfresco.repo.content.metadata
Class TikaAutoMetadataExtracter

java.lang.Object
  extended by org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter
      extended by org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
          extended by org.alfresco.repo.content.metadata.TikaAutoMetadataExtracter
All Implemented Interfaces:
ContentWorker, MetadataExtracter

public class TikaAutoMetadataExtracter
extends TikaPoweredMetadataExtracter

A Metadata Extractor which makes use of the Apache Tika auto-detection to select the best parser to extract the metadata from your document. This will be used for all files which Tika can handle, but where no other more explicit extractor is defined.

   author:                 --      cm:author
   title:                  --      cm:title
   subject:                --      cm:description
   created:                --      cm:created
   comments:
   

geo:lat: -- cm:latitude

geo:long: -- cm:longitude


Nested Class Summary
 
Nested classes/interfaces inherited from class org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
TikaPoweredMetadataExtracter.HeadContentHandler, TikaPoweredMetadataExtracter.MapCaptureContentHandler, TikaPoweredMetadataExtracter.NullContentHandler
 
Nested classes/interfaces inherited from interface org.alfresco.repo.content.metadata.MetadataExtracter
MetadataExtracter.OverwritePolicy
 
Field Summary
protected static org.apache.commons.logging.Log logger
           
static java.util.ArrayList SUPPORTED_MIMETYPES
           
 
Fields inherited from class org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
KEY_AUTHOR, KEY_COMMENTS, KEY_CREATED, KEY_DESCRIPTION, KEY_SUBJECT, KEY_TITLE
 
Fields inherited from class org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter
NAMESPACE_PROPERTY_PREFIX, supportedDateFormats
 
Constructor Summary
TikaAutoMetadataExtracter(org.apache.tika.config.TikaConfig tikaConfig)
           
 
Method Summary
protected  org.apache.tika.parser.Parser getParser()
          Does auto-detection to select the best Tika Parser.
 
Methods inherited from class org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
buildSupportedMimetypes, extractRaw, extractSpecific, makeDate, needHeaderContents
 
Methods inherited from class org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter
checkIsSupported, extract, extract, extract, filterSystemProperties, getDefaultMapping, getExtractionTime, getMapping, getMimetypeService, getReliability, init, isSupported, newRawMap, putRawValue, readMappingProperties, readMappingProperties, register, setDictionaryService, setFailOnTypeConversion, setInheritDefaultMapping, setMapping, setMappingProperties, setMimetypeService, setOverwritePolicy, setOverwritePolicy, setRegistry, setSupportedDateFormats, setSupportedMimetypes
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

logger

protected static org.apache.commons.logging.Log logger

SUPPORTED_MIMETYPES

public static java.util.ArrayList SUPPORTED_MIMETYPES
Constructor Detail

TikaAutoMetadataExtracter

public TikaAutoMetadataExtracter(org.apache.tika.config.TikaConfig tikaConfig)
Method Detail

getParser

protected org.apache.tika.parser.Parser getParser()
Does auto-detection to select the best Tika Parser.

Specified by:
getParser in class TikaPoweredMetadataExtracter


Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.