Automate LinkedIn Data Scraping with Bright Data & Google Gemini

This workflow automates scraping detailed LinkedIn person and company profiles using Bright Data’s MCP Server combined with AI processing via Google Gemini. It solves the manual and error-prone process of LinkedIn data extraction by efficiently gathering, structuring, and saving profile data.
manualTrigger
mcp.mcpClient
lmChatGoogleGemini
+9
Workflow Identifier: 1719
NODES in Use: manualTrigger, stickyNote, set, mcpClient, httpRequest, code, merge, aggregate, function, readWriteFile, informationExtractor, lmChatGoogleGemini

Press CTRL+F5 if the workflow didn't load.

Learn how to Build this Workflow with AI:

Visit through Desktop for Best experience

What this workflow does

This workflow automates data collection from LinkedIn person and company profiles using Bright Data MCP Client and Google Gemini AI.

It solves the problem of spending many hours manually copying profile info and creating readable stories.

The result is fast, error-free data saved as JSON files and sent to webhooks for further use.


Who should use this workflow

This is for people who need LinkedIn profile data often, like market researchers, sales teams, or recruiters.

No special tech skills needed, but basic familiarity with n8n helps.

It works well for people wanting to save time and get clean, clear profile data automatically.


Tools and services used

  • n8n Automation Platform: Runs and connects all workflow parts.
  • Bright Data MCP Client API: Provides reliable LinkedIn scraping via proxies.
  • Google Gemini: AI model to convert data into readable stories.
  • Webhook endpoints: Receives data externally for other apps.
  • Local file system: Saves scraped JSON data as files.

Inputs, process, and output explained

Inputs

  • LinkedIn person profile URLs (set by user).
  • LinkedIn company profile URLs (set by user).
  • Valid Bright Data MCP API credentials.
  • Google Gemini API key.
  • Webhook URLs for sending data.

Processing steps

  • The MCP Client nodes call the Bright Data API to fetch person and company profiles.
  • Scraped data comes back as JSON in Markdown format.
  • A Code node parses company data from the Markdown to JSON.
  • A Langchain extractor uses Google Gemini AI to turn company JSON into a story.
  • Person and company data streams merge and aggregate into one combined object.
  • Data packets post to webhook URLs, then encode and save as JSON files locally.

Outputs

  • Scraped LinkedIn person profile JSON saved as “d:\LinkedIn-Person.json” on disk.
  • Scraped LinkedIn company profile JSON saved as “d:\LinkedIn-Company.json”.
  • Both outputs also sent as HTTP POST to configured webhooks.
  • Company stories enhanced by AI for easier reading.

Beginner step-by-step: How to use this workflow inside n8n for production

Step 1: Get the workflow file

  1. Download the workflow file using the Download button on this page.
  2. Open the n8n editor where you want to run the automation.
  3. Use “Import from File” and select the downloaded workflow.

Step 2: Configure credentials and URLs

  1. Add your Bright Data MCP Client API Key under Credentials in n8n.
  2. Enter your Google Gemini API Key in the AI node credentials.
  3. Update the Set nodes labeled “Set the URLs” and “Set the LinkedIn Company URL” by replacing sample LinkedIn profile URLs with ones you need.
  4. Replace webhook URLs in the same Set nodes with your actual webhook endpoints.

Step 3: Test the workflow

  1. Run the workflow manually by clicking the Manual Trigger node and hitting “Execute Workflow”.
  2. Monitor each node’s output to verify there are no errors and data looks correct.

Step 4: Activate for production use

  1. Turn the workflow ON from the editor’s top right corner.
  2. Use n8n’s scheduling or external triggers to run this workflow automatically as needed.
  3. Ensure permissions to write files to the disk path exist or update path in Read & Write File nodes.
  4. If self hosting n8n, consult self-host n8n for proper server setup.

Customizations ✏️

  • Change LinkedIn URLs: Update url values in the Set the URLs nodes to target any LinkedIn profiles.
  • Modify output file paths: Change file names and paths in the Read & Write File nodes to save JSON files wherever preferred.
  • Switch AI model: Replace Google Gemini with another AI language node if desired, updating prompts accordingly.
  • Add data fields: Adjust MCP Client tool parameters for more LinkedIn profile details if supported.
  • Webhook integration: Point webhook URLs to your APIs or services for further automation.

Troubleshooting 🔧

Problem: “MCP Client API authentication failed”

This happens if the Bright Data MCP API Key is invalid or expired.

Fix by updating the MCP Client API credentials in n8n with a valid key from Bright Data.

Problem: “JSON.parse error in Code node”

Means the input JSON string is malformed or API response format changed.

Check the MCP Client node output to confirm JSON format and adjust parsing code in Code node if needed.

Problem: “File write permission denied”

Occurs if n8n user lacks permissions to write the JSON files in specified folders.

Change output folder to one with write access or fix OS file permissions for the n8n process.


Pre-production checklist ✅

  • Verify Bright Data MCP Client API Key is correct and active.
  • Confirm Google Gemini API Key works and has usage quota.
  • Test webhook endpoints accept HTTP POST requests properly.
  • Run workflow manually, check every node for success or errors.
  • Ensure file write paths are accessible by n8n.

Summary of benefits and results

✓ Saves many hours of manual LinkedIn research and data entry.

✓ Gets clean, structured LinkedIn profiles fast.

✓ Removes errors caused by copying and pasting.

✓ Provides AI-enhanced readable stories from raw data.

→ Enables smart, quick decisions based on fresh LinkedIn insights.

→ Outputs organized JSON files plus webhook delivery for extra automation.


Frequently Asked Questions

No, LinkedIn blocks scraping attempts. Bright Data MCP Client uses proxies and handles compliance needed to scrape LinkedIn ethically.
The workflow’s MCP Client nodes will fail to authenticate and cannot fetch data. Update the API keys to fix.
Scraped LinkedIn person and company profile data are saved as JSON files on disk, default paths are d:LinkedIn-Person.json and d:LinkedIn-Company.json.
Import the workflow file inside the n8n editor, enter required API keys and webhook URLs, then run the Manual Trigger node to test. Activate the workflow to run automatically.

Promoted by BULDRR AI

Related Workflows

Automate Viral UGC Video Creation Using n8n + Degaus (Beginner-Friendly Guide)

Learn how to automate viral UGC video creation using n8n, AI prompts, and Degaus. This beginner-friendly guide shows how to import, configure, and run the workflow without technical complexity.
Form Trigger
Google Sheets
Gmail
+37
Free

AI SEO Blog Writer Automation Workflows in n8n

A complete beginner guide to building an AI SEO blog writer automation using n8n.
AI Agent
Google Sheets
httpRequest
+5
Free

Automate CrowdStrike Alerts with VirusTotal, Jira & Slack

This workflow automates processing of CrowdStrike detections by enriching threat data via VirusTotal, creating Jira tickets for incident tracking, and notifying teams on Slack for quick response. Save hours daily by transforming complex threat data into actionable alerts effortlessly.
scheduleTrigger
httpRequest
jira
+5
Free

Automate Telegram Invoices to Notion with AI Summaries & Reports

Save hours on financial tracking by automating invoice extraction from Telegram photos to Notion using Google Gemini AI. This workflow extracts data, records transactions, and generates detailed spending reports with charts sent on schedule via Telegram.
lmChatGoogleGemini
telegramTrigger
notion
+9
Free

Automate Email Replies with n8n and AI-Powered Summarization

Save hours managing your inbox with this n8n workflow that uses IMAP email triggers, AI summarization, and vector search to draft concise replies requiring minimal review. Automate business email processing efficiently with AI guidance and Gmail integration.
emailReadImap
vectorStoreQdrant
emailSend
+12
Free

Automate Email Campaigns Using n8n with Gmail & Google Sheets

This n8n workflow automates personalized email outreach campaigns by integrating Gmail and Google Sheets, saving hours of manual follow-up work and reducing errors in email sequences. It ensures timely follow-ups based on previous email interactions, optimizing communication efficiency.
googleSheets
gmail
code
+5
Free