org.alfresco.repo.content.metadata
Class PoiMetadataExtracter

java.lang.Object
  extended by org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter
      extended by org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
          extended by org.alfresco.repo.content.metadata.PoiMetadataExtracter
All Implemented Interfaces:
ContentWorker, MetadataExtracter

public class PoiMetadataExtracter
extends TikaPoweredMetadataExtracter

POI-based metadata extractor for Office 07 documents. See http://poi.apache.org/ for information on POI.

   author:                 --      cm:author
   title:                  --      cm:title
   subject:                --      cm:description
   created:                --      cm:created
   Any custom property:    --      [not mapped]
 
Uses Apache Tika


Nested Class Summary
 
Nested classes/interfaces inherited from class org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
TikaPoweredMetadataExtracter.HeadContentHandler, TikaPoweredMetadataExtracter.MapCaptureContentHandler, TikaPoweredMetadataExtracter.NullContentHandler
 
Nested classes/interfaces inherited from interface org.alfresco.repo.content.metadata.MetadataExtracter
MetadataExtracter.OverwritePolicy
 
Field Summary
protected static org.apache.commons.logging.Log logger
           
static java.util.ArrayList SUPPORTED_MIMETYPES
           
 
Fields inherited from class org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
KEY_AUTHOR, KEY_COMMENTS, KEY_CREATED, KEY_DESCRIPTION, KEY_SUBJECT, KEY_TITLE
 
Fields inherited from class org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter
NAMESPACE_PROPERTY_PREFIX, supportedDateFormats
 
Constructor Summary
PoiMetadataExtracter()
           
 
Method Summary
protected  org.apache.tika.parser.Parser getParser()
          Returns the correct Tika Parser to process the document.
 
Methods inherited from class org.alfresco.repo.content.metadata.TikaPoweredMetadataExtracter
buildSupportedMimetypes, extractRaw, extractSpecific, makeDate, needHeaderContents
 
Methods inherited from class org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter
checkIsSupported, extract, extract, extract, filterSystemProperties, getDefaultMapping, getExtractionTime, getMapping, getMimetypeService, getReliability, init, isSupported, newRawMap, putRawValue, readMappingProperties, readMappingProperties, register, setDictionaryService, setFailOnTypeConversion, setInheritDefaultMapping, setMapping, setMappingProperties, setMimetypeService, setOverwritePolicy, setOverwritePolicy, setRegistry, setSupportedDateFormats, setSupportedMimetypes
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

logger

protected static org.apache.commons.logging.Log logger

SUPPORTED_MIMETYPES

public static java.util.ArrayList SUPPORTED_MIMETYPES
Constructor Detail

PoiMetadataExtracter

public PoiMetadataExtracter()
Method Detail

getParser

protected org.apache.tika.parser.Parser getParser()
Description copied from class: TikaPoweredMetadataExtracter
Returns the correct Tika Parser to process the document. If you don't know which you want, use TikaAutoMetadataExtracter which makes use of the Tika auto-detection.

Specified by:
getParser in class TikaPoweredMetadataExtracter


Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.