`DocumentToImageContent` component #9263

anakin87 · 2025-04-17T09:53:54Z

We should make an additional component to FileToImageContent called DocumentToImageContent which allows passing of Document objects directly along with specifying the meta field where you'll find the image path.

An additional requirement here is that we should also support looking for a meta field called page_number. In the situation of supporting PDFs we rarely want to send the whole PDF to the vision LLM, but rather a single page (or even maybe a few pages).

Depends on #9258.

@sjrl has background on this.

sjrl · 2025-04-17T11:31:54Z

Here are the analogous components that we use in deepset

DeepsetPDFDocumentToBase64Image for PDF based Documents
DeepsetFiletoBase64Image for PNG, JPG, JPEG, GIF based Documents

These could probably be combined so we don't need separate components for the different file types.

sjrl mentioned this issue Apr 17, 2025

ImageFileToImageContent and PDFToImageContent conversion components #9262

Open

julian-risch added P2 Medium priority, add to the next sprint if no P1 available P1 High priority, add to the next sprint and removed P2 Medium priority, add to the next sprint if no P1 available labels Apr 17, 2025

julian-risch assigned sjrl Apr 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`DocumentToImageContent` component #9263

`DocumentToImageContent` component #9263

anakin87 commented Apr 17, 2025

sjrl commented Apr 17, 2025 •

edited

Loading

DocumentToImageContent component #9263

DocumentToImageContent component #9263

Comments

anakin87 commented Apr 17, 2025

sjrl commented Apr 17, 2025 • edited Loading

`DocumentToImageContent` component #9263

`DocumentToImageContent` component #9263

sjrl commented Apr 17, 2025 •

edited

Loading