Build a $10,000 RAG system using Gemini + Claude Code

RAG

STEP 1. Understand what you’re building

You are creating a system where:

→ Text, images, videos, documents live in one database

→ All data gets embedded into the same vector space

→ AI retrieves the most relevant pieces before answering

This is RAG

Retrieval Augmented Generation

✦ Key shift

Old systems handled text only

New systems handle meaning across formats


STEP 2. Set up your tools

You need 3 things:

  1. Gemini API → Used for embeddings → Get from Google AI Studio
  2. Pinecone → Your vector database → Stores embeddings
  3. OpenRouter or model provider → Used for chat responses
  4. Visual Studio Code → Your working environment
  5. Claude Code → Builds everything for you

STEP 3. Create your project

→ Open VS Code

Install Claude Code extension

→ Open a new folder

Now open Claude Code panel

Switch to plan mode

Paste documentation link for Gemini embeddings

Then prompt:

“Build a multimodal RAG system using Gemini Embedding 2 and Pinecone.

Create env file placeholders for API keys.

Support text, images, and videos.”

Claude Code will generate:

→ Project structure

→ Dependencies

→ Step-by-step plan

Accept it


STEP 4. Add API keys

In your env file, add:

→ Gemini API key

→ Pinecone API key

→ OpenRouter or model key

Save the file

That’s it for setup


STEP 5. Add your data

Create a “data” folder

Drop in anything:

→ PDFs

→ Images

→ Videos

→ Text files

No need to organize perfectly

The system handles classification


STEP 6. Run ingestion

Prompt Claude Code:

“Process all files and store embeddings in Pinecone.

Then build a simple chat app.”

What happens behind the scenes:

→ Files get chunked

→ Gemini creates embeddings

→ Data stored in Pinecone

→ Metadata added

✦ This is where older tools like n8n get messy

Manual chunking

Separate pipelines

Frequent failures

Here, it runs in one flow


STEP 7. Test your system

Claude Code builds a local app

You open localhost

Now test queries:

→ “How do I clean the filter?”

↳ Returns steps + images from PDF

→ “What are the parts?”

↳ Pulls multiple sections + diagrams

→ Upload an image

↳ Finds similar entries in database


STEP 8. Improve retrieval quality

By default:

→ Images and videos are stored as descriptions

To improve:

Ask Claude Code:

“Add better metadata descriptions for images and videos

Update app to display media inline”

Now your system:

→ Shows images

→ Plays videos

→ Gives richer results


STEP 9. Understand limitations

Current constraints:

→ Video length limit around 120 seconds

→ Image batch limits

→ Quality depends on metadata

✦ Important

Better descriptions = better retrieval


STEP 10. Real use cases

  1. Instruction manuals → Chat with complex PDFs → Get visual answers
  2. Service businesses → Upload project images → Retrieve similar jobs with pricing
  3. Internal knowledge bases → Mix documents, videos, images → One unified search

STEP 11. What changed

Before:

→ Complex n8n pipelines

→ Manual configuration

→ Fragile systems

Now:

→ Describe system in plain language

→ AI builds it

→ You refine outputs

Mini insight

This build took under 30 minutes

Earlier versions took hours or days

STEP 1. Understand what you’re building

You are creating a system where:

→ Text, images, videos, documents live in one database

→ All data gets embedded into the same vector space

→ AI retrieves the most relevant pieces before answering

This is RAG

Retrieval Augmented Generation

✦ Key shift

Old systems handled text only

New systems handle meaning across formats


STEP 2. Set up your tools

You need 3 things:

  1. Gemini API → Used for embeddings → Get from Google AI Studio
  2. Pinecone → Your vector database → Stores embeddings
  3. OpenRouter or model provider → Used for chat responses
  4. Visual Studio Code → Your working environment
  5. Claude Code → Builds everything for you

STEP 3. Create your project

→ Open VS Code

Install Claude Code extension

→ Open a new folder

Now open Claude Code panel

Switch to plan mode

Paste documentation link for Gemini embeddings

Then prompt:

“Build a multimodal RAG system using Gemini Embedding 2 and Pinecone.

Create env file placeholders for API keys.

Support text, images, and videos.”

Claude Code will generate:

→ Project structure

→ Dependencies

→ Step-by-step plan

Accept it


STEP 4. Add API keys

In your env file, add:

→ Gemini API key

→ Pinecone API key

→ OpenRouter or model key

Save the file

That’s it for setup


STEP 5. Add your data

Create a “data” folder

Drop in anything:

→ PDFs

→ Images

→ Videos

→ Text files

No need to organize perfectly

The system handles classification


STEP 6. Run ingestion

Prompt Claude Code:

“Process all files and store embeddings in Pinecone.

Then build a simple chat app.”

What happens behind the scenes:

→ Files get chunked

→ Gemini creates embeddings

→ Data stored in Pinecone

→ Metadata added

✦ This is where older tools like n8n get messy

Manual chunking

Separate pipelines

Frequent failures

Here, it runs in one flow


STEP 7. Test your system

Claude Code builds a local app

You open localhost

Now test queries:

→ “How do I clean the filter?”

↳ Returns steps + images from PDF

→ “What are the parts?”

↳ Pulls multiple sections + diagrams

→ Upload an image

↳ Finds similar entries in database


STEP 8. Improve retrieval quality

By default:

→ Images and videos are stored as descriptions

To improve:

Ask Claude Code:

“Add better metadata descriptions for images and videos

Update app to display media inline”

Now your system:

→ Shows images

→ Plays videos

→ Gives richer results


STEP 9. Understand limitations

Current constraints:

→ Video length limit around 120 seconds

→ Image batch limits

→ Quality depends on metadata

✦ Important

Better descriptions = better retrieval


STEP 10. Real use cases

  1. Instruction manuals → Chat with complex PDFs → Get visual answers
  2. Service businesses → Upload project images → Retrieve similar jobs with pricing
  3. Internal knowledge bases → Mix documents, videos, images → One unified search

STEP 11. What changed

Before:

→ Complex n8n pipelines

→ Manual configuration

→ Fragile systems

Now:

→ Describe system in plain language

→ AI builds it

→ You refine outputs

Mini insight

This build took under 30 minutes

Earlier versions took hours or days

Author
Written By
Vikash Kumar
Building AI agents, n8n workflows and end-to-end automation for 30+ Brands across India, the US, Europe, Dubai & Australia. 7+ years of Experience saving founders real hours every week - no code required.
Author
Written By
Vikash Kumar
Building AI agents, n8n workflows and end-to-end automation for 30+ Brands across India, the US, Europe, Dubai & Australia. 7+ years of Experience saving founders real hours every week - no code required.
Ask more Questions about this Blog with AI:

Our AI Articles

Learn from our AI Articles to excel in your profession ;)

n8n AI Agent Node: Build Your First AI Agent in 15 Minutes

Learn how the n8n AI Agent node works and build your first AI agent in 15 minutes. Step-by-step beginner guide...

Best Free OpenRouter Models in 2026: Which One Should You Use?

OpenRouter has 29 free AI models as of June 2026. Picking the wrong one for your task wastes your daily...

ClawdBot Tutorial 2026: Complete Beginner Guide to Install, Configure & Run Your First AI Agent

The complete beginner guide to Clawdbot in 2026 — from installation to your first running AI agent, with config templates,...

What Are Claude Skills? A Beginner’s Guide to AI Skills in 2026

Every time you start a new conversation with Claude, it forgets everything from the last one. Your preferences, your writing...

Claude vs ChatGPT in 2026: I Tested Both on the Same 10 Real Tasks

You’re probably paying $20 a month for one of these tools and quietly wondering if the other one is better....

HOW TO GET CLAUDE TO TEACH YOU STEP BY STEP

This framework turns any “I want to do X with Claude but have no idea where to start” into a...

Claude Code Without a Subscription: 3 Free Ways to Run It in 2026

Claude Code is Anthropic’s terminal-based AI coding assistant. It edits files, runs commands, plans projects, and debugs errors — entirely...

Best Claude Prompts 2026: 75 Templates That Actually Work

If your Claude outputs feel generic, the fix isn’t switching models — it’s fixing the prompt. Most people send a...

Build n8n Workflows Without Coding Using Claude Code: Step-by-Step (2026)

For years, building an n8n automation meant dragging nodes around a canvas, guessing field names, and bouncing between docs and...

7 Claude Code prompts

7 simple Claude Code prompts to get you started...
1:1 Free Strategy Session
Your competitors are already automating. Are you still paying for it manually?

Do you want to adopt AI Automation?

Every hour your team does repetitive work, you're burning real money.
While you wait, faster businesses are cutting costs and moving quicker.
AI and automations aren't the future anymore — they're the present.

Book a live 1-on-1 session where we show you exactly which of your daily tasks can be automated — and what it’s costing you not to.