Automate Image Analysis with Local Ollama Vision Models in n8n

Struggling to extract detailed insights from images for business or research? This n8n workflow automates exhaustive image analysis using multiple local Ollama Vision Models, delivering structured, markdown-formatted results saved directly to Google Docs for collaboration and ease of use.
manualTrigger
httpRequest
googleDocs
+6
Workflow Identifier: 2210
NODES in Use: manualTrigger, set, googleDrive, extractFromFile, splitOut, splitInBatches, httpRequest, googleDocs, stickyNote

Press CTRL+F5 if the workflow didn't load.

Learn how to Build this Workflow with AI:

Visit through Desktop for Best experience

What this workflow does

This workflow takes an image ID from Google Drive and pulls that image down.

Then, it turns the image into a base64 string, so it can be sent easily in a request.

It picks local Ollama vision models one by one and sends the image with a clear prompt to each model.

These models give a detailed report about things in the image, the setting, text seen, and spatial details.

Finally, it collects all reports and saves them into a Google Docs file as markdown text.

This saves time and errors from doing image analysis by hand, giving clear, formatted notes automatically.


Who should use this workflow

If you work a lot with images and need accurate, full descriptions fast, this is for you.

People who want to analyze images to report objects, settings, text inside images in one easy step will find this helpful.

You must have access to Google Drive images and want to save results in Google Docs for teams.


Tools and services used

  • n8n: Automates the workflow processing.
  • Local Ollama API: Runs vision language models on your computer.
  • Google Drive API: Downloads target images by file ID.
  • Google Docs API: Updates documents with model analysis.
  • OAuth2 Credentials: For secure Google services access.

Inputs, processing steps, and outputs explained

Inputs

  • Google Drive file ID of the desired image.
  • Predefined list of local Ollama vision model names.

Processing Steps

  • Download image binary from Google Drive using the file ID.
  • Convert image binary data to base64 string format.
  • Loop over each model name from the list.
  • Build a JSON request with the base64 image and detailed prompt asking the model to analyze objects, text, context, spatial relations, and write markdown.
  • Post request to local Ollama API to get the model’s markdown analysis.
  • Gather all model responses and prepare them for the next step.
  • Send the combined markdown reports to a specified Google Docs file.

Outputs

  • A Google Docs document containing rich, detailed markdown descriptions from each Ollama vision model.

Beginner step-by-step: How to run this workflow in n8n

1. Import the workflow

  1. Download the workflow file using the Download button on this page.
  2. In the n8n editor, click the menu and select “Import from File.” Choose the downloaded workflow file.

2. Configure credentials and inputs

  1. Add your Google Drive OAuth2 credentials to the Google Drive node.
  2. Add your Google Docs OAuth2 credentials to the Google Docs node.
  3. Update the “Google Doc Image Id” Set node parameter “id” with the file ID of the image to analyze.
  4. Update the Google Docs URL in the Google Docs node for saving analysis.

3. Run and test

  1. Click the manual trigger Manual Trigger node to start the workflow.
  2. Watch the execution to ensure no errors happen and outputs look correct.

4. Activate workflow for production use

  1. After successful test runs, activate the workflow by toggling it to “Active.”
  2. Set up triggers or manual runs as desired to automate image analyses.
  3. Consider running n8n in a reliable self-host n8n environment for better uptime.

Customizations and improvements

  • Replace Google Drive node with other file storage nodes if images are elsewhere.
  • Edit the “General Image Prompt” node to change or focus the analysis details asked from the models.
  • Add Slack, email, or messaging nodes after analysis to notify teams instantly.
  • Change the model list in “List of Vision Models” to add or remove Ollama models as needed.
  • Adjust batch sizes and timing in batching nodes to manage high volume processing smoothly.

Handling common issues

500 error from Ollama API

Check if local Ollama is running and accessible at the set URL.

Restart Ollama or fix the URL if needed.

Google Drive image download fails

Verify Google Drive file ID is correct and OAuth2 credentials allow file access.

Empty responses or no output from models

Inspect JSON request body formatting and confirm image converts to base64 properly.


Final notes

This workflow automates deep image analysis without manual work.

You will get markdown reports saved automatically in Google Docs for easy sharing.

Testing the workflow and managing credentials carefully will ensure smooth runs.

Expand or adjust model prompts and list based on analysis needs.


Summary

✓ Automates detailed image analysis with local Ollama models.

✓ Converts Google Drive images to base64 for multimodel processing.

✓ Generates markdown reports saved into Google Docs for collaboration.

→ Saves time and reduces errors compared to manual photo review.

→ Fits well for teams needing consistent, thorough image descriptions.


Frequently Asked Questions

Add Google Drive and Google Docs OAuth2 credentials, update the image file ID and Google Docs URL, then test and activate the workflow.
It downloads the image binary, converts it to a base64 string, then embeds this string in a JSON request sent via HTTP POST to the local Ollama API.
That means the local Ollama service might not be running or reachable; the user should verify the service status and URL settings.
Yes, by replacing the Google Drive node with nodes for other storage services and updating credentials accordingly.

Promoted by BULDRR AI

Related Workflows

Automate Viral UGC Video Creation Using n8n + Degaus (Beginner-Friendly Guide)

Learn how to automate viral UGC video creation using n8n, AI prompts, and Degaus. This beginner-friendly guide shows how to import, configure, and run the workflow without technical complexity.
Form Trigger
Google Sheets
Gmail
+37
Free

AI SEO Blog Writer Automation in n8n

A complete beginner guide to building an AI-powered SEO blog writer automation using n8n.
AI Agent
Google Sheets
httpRequest
+5
Free

Automate CrowdStrike Alerts with VirusTotal, Jira & Slack

This workflow automates processing of CrowdStrike detections by enriching threat data via VirusTotal, creating Jira tickets for incident tracking, and notifying teams on Slack for quick response. Save hours daily by transforming complex threat data into actionable alerts effortlessly.
scheduleTrigger
httpRequest
jira
+5
Free

Automate Telegram Invoices to Notion with AI Summaries & Reports

Save hours on financial tracking by automating invoice extraction from Telegram photos to Notion using Google Gemini AI. This workflow extracts data, records transactions, and generates detailed spending reports with charts sent on schedule via Telegram.
lmChatGoogleGemini
telegramTrigger
notion
+9
Free

Automate Email Replies with n8n and AI-Powered Summarization

Save hours managing your inbox with this n8n workflow that uses IMAP email triggers, AI summarization, and vector search to draft concise replies requiring minimal review. Automate business email processing efficiently with AI guidance and Gmail integration.
emailReadImap
vectorStoreQdrant
emailSend
+12
Free

Automate Email Campaigns Using n8n with Gmail & Google Sheets

This n8n workflow automates personalized email outreach campaigns by integrating Gmail and Google Sheets, saving hours of manual follow-up work and reducing errors in email sequences. It ensures timely follow-ups based on previous email interactions, optimizing communication efficiency.
googleSheets
gmail
code
+5
Free