File Metadata

File Metadata Processor #

Extracts metadata from supported file types and stores the results in the document metadata.

File typeExtracted metadata
Imagecolors (top-3 dominant color names), width (px), height (px)

Configuration #

ParameterTypeRequiredDefaultDescription
message_fieldstringNomessagesPipeline context key for the input messages
output_queueobjectNonullQueue to push processed documents to

Example #

- file_metadata:
    output_queue:
      name: "documents_with_metadata"
Edit Edit this page