|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
Interface Summary | |
---|---|
MetadataExtracter | Interface for document property extracters. |
OpenOfficeMetadataWorker | An interface that allows separation between the metadata extractor registry and the third party subsystem owning the open office connection. |
Class Summary | |
---|---|
AbstractMappingMetadataExtracter | Support class for metadata extracters that support dynamic and config-driven mapping between extracted values and model properties. |
AbstractMetadataExtracter | Deprecated. Use the AbstractMappingMetadataExtracter |
AbstractMetadataExtracterTest | |
DefaultOpenOfficeMetadataWorker | The class doing the actual work of the OpenOfficeMetadataExtracter, based around an OpenOffice connection. |
DWGMetadataExtracter | Metadata extractor for the
MIMETYPE_DWG
mimetype. |
DWGMetadataExtracterTest | |
HtmlMetadataExtracter | Extracts the following values from HTML documents: |
HtmlMetadataExtracterTest | |
MailMetadataExtracter | Outlook MAPI format email meta-data extractor extracting the following values: |
MailMetadataExtracterTest | |
MappingMetadataExtracterTest | |
MappingMetadataExtracterTest.DummyMappingMetadataExtracter | A spoofed-up extracter that extracts the following: |
MetadataExtracter.OverwritePolicy | A enumeration of functional property overwrite policies. |
MetadataExtracterRegistry | Holds and provides the most appropriate metadate extracter for a particular mimetype. |
MP3MetadataExtracter | Extracts the following values from MP3 files: |
MP3MetadataExtracterTest | Test for the MP3 metadata extraction from id3 tags. |
OfficeMetadataExtracter | Office file format Metadata Extracter. |
OfficeMetadataExtracterTest | |
OpenDocumentMetadataExtracter | Metadata extractor for the
MIMETYPE_OPENDOCUMENT_XXX
mimetypes. |
OpenDocumentMetadataExtracterTest | |
OpenOfficeMetadataExtracter | Extracts values from Star Office documents into the following: |
OpenOfficeMetadataExtracterTest | Note - this test can sometimes fail if run on its own, as there can be a race condition with the OO process. |
PdfBoxMetadataExtracter | Metadata extractor for the PDF documents. |
PdfBoxMetadataExtracterTest | |
PoiMetadataExtracter | POI-based metadata extractor for Office 07 documents. |
PoiMetadataExtracterTest | |
RFC822MetadataExtracter | Metadata extractor for RFC822 mime emails. |
RFC822MetadataExtracterTest | Test for the RFC822 (imap/mbox) extractor |
TikaAutoMetadataExtracter | A Metadata Extractor which makes use of the Apache Tika auto-detection to select the best parser to extract the metadata from your document. |
TikaAutoMetadataExtracterTest | |
TikaPoweredMetadataExtracter | The parent of all Metadata Extractors which use Apache Tika under the hood. |
TikaPoweredMetadataExtracter.HeadContentHandler | This content handler will capture entries from within the header of the Tika content XHTML, but ignore the rest. |
TikaPoweredMetadataExtracter.MapCaptureContentHandler | This content handler will grab all tags and attributes, and record the textual content of the last seen one of them. |
TikaPoweredMetadataExtracter.NullContentHandler | A content handler that ignores all the content it finds. |
TikaSpringConfiguredMetadataExtracter | A Metadata Extractor which makes use of Apache Tika, and allows the selection of the Tika parser to be sprung-in to extract the metadata from your document. |
|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |