|
|||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||
| Interface Summary | |
|---|---|
| MetadataExtracter | Interface for document property extracters. |
| OpenOfficeMetadataWorker | An interface that allows separation between the metadata extractor registry and the third party subsystem owning the open office connection. |
| Class Summary | |
|---|---|
| AbstractMappingMetadataExtracter | Support class for metadata extracters that support dynamic and config-driven mapping between extracted values and model properties. |
| AbstractMetadataExtracter | Deprecated. Use the AbstractMappingMetadataExtracter |
| AbstractMetadataExtracterTest | |
| DefaultOpenOfficeMetadataWorker | The class doing the actual work of the OpenOfficeMetadataExtracter, based around an OpenOffice connection. |
| DWGMetadataExtracter | Metadata extractor for the
MIMETYPE_DWG
mimetype. |
| DWGMetadataExtracterTest | |
| HtmlMetadataExtracter | Extracts the following values from HTML documents: |
| HtmlMetadataExtracterTest | |
| MailMetadataExtracter | Outlook MAPI format email meta-data extractor extracting the following values: |
| MailMetadataExtracterTest | |
| MappingMetadataExtracterTest | |
| MappingMetadataExtracterTest.DummyMappingMetadataExtracter | A spoofed-up extracter that extracts the following: |
| MetadataExtracter.OverwritePolicy | A enumeration of functional property overwrite policies. |
| MetadataExtracterRegistry | Holds and provides the most appropriate metadate extracter for a particular mimetype. |
| MP3MetadataExtracter | Extracts the following values from MP3 files: |
| MP3MetadataExtracterTest | Test for the MP3 metadata extraction from id3 tags. |
| OfficeMetadataExtracter | Office file format Metadata Extracter. |
| OfficeMetadataExtracterTest | |
| OpenDocumentMetadataExtracter | Metadata extractor for the
MIMETYPE_OPENDOCUMENT_XXX
mimetypes. |
| OpenDocumentMetadataExtracterTest | |
| OpenOfficeMetadataExtracter | Extracts values from Star Office documents into the following: |
| OpenOfficeMetadataExtracterTest | Note - this test can sometimes fail if run on its own, as there can be a race condition with the OO process. |
| PdfBoxMetadataExtracter | Metadata extractor for the PDF documents. |
| PdfBoxMetadataExtracterTest | |
| PoiMetadataExtracter | POI-based metadata extractor for Office 07 documents. |
| PoiMetadataExtracterTest | |
| RFC822MetadataExtracter | Metadata extractor for RFC822 mime emails. |
| RFC822MetadataExtracterTest | Test for the RFC822 (imap/mbox) extractor |
| TikaAutoMetadataExtracter | A Metadata Extractor which makes use of the Apache Tika auto-detection to select the best parser to extract the metadata from your document. |
| TikaAutoMetadataExtracterTest | |
| TikaPoweredMetadataExtracter | The parent of all Metadata Extractors which use Apache Tika under the hood. |
| TikaPoweredMetadataExtracter.HeadContentHandler | This content handler will capture entries from within the header of the Tika content XHTML, but ignore the rest. |
| TikaPoweredMetadataExtracter.MapCaptureContentHandler | This content handler will grab all tags and attributes, and record the textual content of the last seen one of them. |
| TikaPoweredMetadataExtracter.NullContentHandler | A content handler that ignores all the content it finds. |
|
|||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||