Integrate Document AI and Gemini to automate workflows with scalable backend

Connect Document AI and Gemini nodes to in your workflow. Integrate with any tool or database and ship powerful backend logic and APIs instantly - No code required!

getting started

How To Connect Document AI and Gemini

Supported Triggers & Actions

Document AI NODES

Document AI Form Parser

Form Parser allows you to automatically extract fields, values, and generic entities like names, addresses, and prices from standard forms, structuring data in tables. It's ready to use without the need for training or customization, suitable for various document types. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Document AI Invoice Parser

Extract text and values from invoices such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Document AI OCR

Identify and extract text in different types of documents. This processor allows you to identify and extract text, including handwritten text, from documents in over 200 language. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Document AI US License Parser

Extract fields from US Driver License, including names, dates, etc. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Gemini NODES

Count Tokens in Prompt

When using long prompts, it might be useful to count tokens before sending any content to the model.

script

Gemini Text Generator

Make an API call to the Generative Language Model endpoint

script

Generate Embedding

Generate Embeddings from text input and represent text (words, sentences, and blocks of text) in a vectorized formusing Gemini AI

script

Multimodal

Use Google's Gemini AI to generate text from text-only or text-and-image input. [Full documentation](https://cloud.google.com/vertex-ai/docs/generative-ai/start/quickstarts/quickstart-multimodal).

script

Stream Response

Generates a stream of response text using Google's Generative AI with a given prompt

script

blog posts & tutorials

Recommended Reads

Below are recommneded blogs that will help in your journey