Package org.alfresco.repo.content.metadata

Interface Summary
MetadataExtracter Interface for document property extracters.
OpenOfficeMetadataWorker An interface that allows separation between the metadata extractor registry and the third party subsystem owning the open office connection.
 

Class Summary
AbstractMappingMetadataExtracter Support class for metadata extracters that support dynamic and config-driven mapping between extracted values and model properties.
AbstractMetadataExtracter Deprecated. Use the AbstractMappingMetadataExtracter
AbstractMetadataExtracterTest  
DefaultOpenOfficeMetadataWorker The class doing the actual work of the OpenOfficeMetadataExtracter, based around an OpenOffice connection.
DWGMetadataExtracter Metadata extractor for the MIMETYPE_DWG mimetype.
DWGMetadataExtracterTest  
HtmlMetadataExtracter Extracts the following values from HTML documents:
HtmlMetadataExtracterTest  
MailMetadataExtracter Outlook MAPI format email meta-data extractor extracting the following values:
MailMetadataExtracterTest  
MappingMetadataExtracterTest  
MappingMetadataExtracterTest.DummyMappingMetadataExtracter A spoofed-up extracter that extracts the following:
MetadataExtracter.OverwritePolicy A enumeration of functional property overwrite policies.
MetadataExtracterRegistry Holds and provides the most appropriate metadate extracter for a particular mimetype.
MP3MetadataExtracter Extracts the following values from MP3 files:
MP3MetadataExtracterTest Test for the MP3 metadata extraction from id3 tags.
OfficeMetadataExtracter Office file format Metadata Extracter.
OfficeMetadataExtracterTest  
OpenDocumentMetadataExtracter Metadata extractor for the MIMETYPE_OPENDOCUMENT_XXX mimetypes.
OpenDocumentMetadataExtracterTest  
OpenOfficeMetadataExtracter Extracts values from Star Office documents into the following:
OpenOfficeMetadataExtracterTest Note - this test can sometimes fail if run on its own, as there can be a race condition with the OO process.
PdfBoxMetadataExtracter Metadata extractor for the PDF documents.
PdfBoxMetadataExtracterTest  
PoiMetadataExtracter POI-based metadata extractor for Office 07 documents.
PoiMetadataExtracterTest  
RFC822MetadataExtracter Metadata extractor for RFC822 mime emails.
RFC822MetadataExtracterTest Test for the RFC822 (imap/mbox) extractor
TikaAutoMetadataExtracter A Metadata Extractor which makes use of the Apache Tika auto-detection to select the best parser to extract the metadata from your document.
TikaAutoMetadataExtracterTest  
TikaPoweredMetadataExtracter The parent of all Metadata Extractors which use Apache Tika under the hood.
TikaPoweredMetadataExtracter.HeadContentHandler This content handler will capture entries from within the header of the Tika content XHTML, but ignore the rest.
TikaPoweredMetadataExtracter.MapCaptureContentHandler This content handler will grab all tags and attributes, and record the textual content of the last seen one of them.
TikaPoweredMetadataExtracter.NullContentHandler A content handler that ignores all the content it finds.
TikaSpringConfiguredMetadataExtracter A Metadata Extractor which makes use of Apache Tika, and allows the selection of the Tika parser to be sprung-in to extract the metadata from your document.
 



Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.