/

Document AI

&

Gemini

Integrate Document AI and Gemini

Connect Document AI and Gemini nodes in your workflow. Integrate with any tool or database and ship powerful backend logic and APIs instantly - No code required!

Node stack

Supported Document AI and Gemini Nodes

Add any other tools or your preferred database nodes. If an integration is not available generate your own using AI

Document AI Form Parser

Form Parser allows you to automatically extract fields, values, and generic entities like names, addresses, and prices from standard forms, structuring data in tables. It's ready to use without the need for training or customization, suitable for various document types. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Document AI Invoice Parser

Extract text and values from invoices such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Document AI OCR

Identify and extract text in different types of documents. This processor allows you to identify and extract text, including handwritten text, from documents in over 200 language. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Document AI US License Parser

Extract fields from US Driver License, including names, dates, etc. **To use this node you must first enable the [Document AI API](https://console.cloud.google.com/apis/library/documentai.googleapis.com?project=_)**

Count Tokens in Prompt

When using long prompts, it might be useful to count tokens before sending any content to the model.

Gemini Text Generator

Make an API call to the Generative Language Model endpoint

Generate Embedding

Generate Embeddings from text input and represent text (words, sentences, and blocks of text) in a vectorized formusing Gemini AI

Multimodal

Use Google's Gemini AI to generate text from text-only or text-and-image input. [Full documentation](https://cloud.google.com/vertex-ai/docs/generative-ai/start/quickstarts/quickstart-multimodal).

Stream Response

Generates a stream of response text using Google's Generative AI with a given prompt

Quick start

How to integrate Document AI and Gemini

Step 1 — Add the nodes to your workflow

Create a new workflow in BuildShip, click “Add node”, and select the Document AI and Gemini actions you want to use.

Step 2 — Configure each node

Go to each node to authenticate (or add your API key) and fill in the required parameters.

Step 3 — Connect the nodes

Each node in BuildShip can connect to others by using their output variables. When you reference a variable from one node in another, BuildShip automatically links them in the workflow.

Step 4 — Test your workflow

Define your starting data in the Inputs node and choose what to do with the result in the Flow Output node. Finally, run a test to see your workflow in action.

blog posts & tutorials

Recommended Reads

Below are recommneded blogs that will help in your journey