How to import the social media extraction workflow into n8n?

Download the workflow file and open n8n editor. Use Import from File to load the workflow. Then add OpenAI and Supabase API Keys and update any database table names if needed.

What should be done if the AI agent returns malformed JSON?

Check that the JSON schema in LangChain JSON Parser matches the AI output format exactly. Simplify or correct the AI prompt to get well-formed JSON.

What causes HTTP request failures with 403 errors in this workflow?

403 errors happen when websites block automated requests. Adding user-agent headers or using proxy settings in HTTP Request nodes usually fixes this.

Can this workflow be scaled to process hundreds of companies?

Yes, with proper API rate limits and database handling, this workflow can batch process large company lists efficiently.

Extract Social Media Links With N8n And OpenAI AI Crawler

What this workflow does

This workflow fetches company info from a database, crawls company websites using AI, extracts social media links, and saves the results back to the database.

It cuts down manual work by gathering social media profiles automatically and fast for many companies.

The main goal is to stop you wasting hours clicking and copying URLs and instead get a correct list of social media links ready for analysis.

How the workflow works: Inputs, Process, and Output

Inputs

The workflow starts by getting company names and website URLs from a Supabase table called companies_input.

Processing steps

Get company data: The Supabase Get All node fetches all rows from the input table.

Focus fields: A Set node keeps only name and website to make processing clearer.

AI crawl: The LangChain AI agent (Crawl website) uses GPT-4o to read the company’s website.

Get page text: The Text tool workflow requests the website HTML and converts it to markdown for easy text processing by AI.

Extract links: The URLs tool workflow grabs all <a> tag hrefs, cleans duplicates and bad URLs.

Parse AI output: The AI returns a JSON listing social media platform names and URLs; the LangChain JSON Parser checks this format and outputs an array.

Combine data: The extracted social media array is merged with original company info in a Merge node.

Save result: A Supabase Insert node writes the social media profiles into the companies_output table for each company.

Output

The output is saved rows in the companies_output Supabase table showing company names alongside found social media profile URLs.

Who should use this workflow

This workflow is good for anyone needing social media data from many company websites quickly.

It helps marketing analysts, researchers, or anyone tired of clicking links and copying from every webpage manually.

No deep coding needed; if you can use n8n to run workflows and set API keys, it can save you many hours.

Tools and services used

n8n: Automates tasks in visual workflows.

Supabase: Stores input company data and output results.

OpenAI GPT-4o API: Powers the AI web crawler agent.

HTTP Request nodes: Fetch website content and HTML.

HTML Extraction and Markdown nodes: Get links and clean text before AI processing.

LangChain JSON Parser: Ensures AI output matches expected JSON format.

Beginner step-by-step: How to use this workflow in n8n production

1. Import the workflow

Download the workflow file using the Download button on this page.

In n8n editor, choose Import from File and select the downloaded workflow.

2. Configure credentials

Set your OpenAI API Key in the appropriate credential node.

Configure Supabase API Key and URL credentials to connect your database.

If needed, update table names or database schema field names to match your setup.

3. Check prompts and URLs

Review the LangChain AI agent (Crawl website) node prompt for social media extraction.

Adjust the prompt text if needed. Use the copy block below to update easily:


Extract social media profile URLs like Facebook, Twitter, LinkedIn, Instagram from this website content and links. Return a JSON array listing platform names and URLs only.

4. Test the workflow

Manually trigger the workflow using the Manual Trigger node.

Check Supabase companies_output table to see if social media links got saved.

5. Activate for production

After confirm tests succeed, toggle the workflow active.

Set a schedule trigger or API trigger if you want periodic or event-driven runs.

If running self hosting n8n, refer to self-host n8n for best practices.

Common mistakes and edge cases

Forgetting the URL protocol (http/https) may cause failed HTTP requests.

Wrong or missing API keys cause errors in Supabase or OpenAI nodes.

The AI agent might respond with invalid JSON if the prompt or JSON schema does not match the output.

Websites blocking robots or scrapers cause HTTP 403 or timeouts. Use proxy settings or user-agent headers here.

Customization ideas

Change AI prompt to extract emails, phone numbers, or company descriptions instead of social media links.

Replace Supabase nodes with Airtable, Google Sheets, or MySQL if preferred database services.

Enable proxy support in HTTP Request nodes to bypass website restrictions.

Make the crawler follow multiple pages inside the same domain for more thorough data.

Summary and outcome

✓ Quickly get social media profiles from many company websites without manual clicking.

✓ Save complete and clean data back to your database automatically.

✓ Save hours of tedious manual work each week.

→ Have accurate social media datasets ready for marketing or analysis.

→ Easily build on this workflow for other web data extraction needs.

Extract Social Media Links with n8n and OpenAI AI Crawler

What this workflow does

How the workflow works: Inputs, Process, and Output

Inputs

Processing steps

Output

Who should use this workflow

Tools and services used

Beginner step-by-step: How to use this workflow in n8n production

1. Import the workflow

2. Configure credentials

3. Check prompts and URLs

4. Test the workflow

5. Activate for production

Common mistakes and edge cases

Customization ideas

Summary and outcome

Frequently Asked Questions

2 Months of Sales Navigator 👉 FREE

10,000+ n8n Workflows to Download & Learn Building

Automate your LinkedIn Posts

1:1 - Meeting FREE

Get Self-Host n8n

Promoted by BULDRR AI

Automate Viral UGC Video Creation Using n8n + Degaus (Beginner-Friendly Guide)

AI SEO Blog Writer Automation Workflows in n8n

Automate CrowdStrike Alerts with VirusTotal, Jira & Slack

Automate Telegram Invoices to Notion with AI Summaries & Reports

Automate Email Replies with n8n and AI-Powered Summarization

Automate Email Campaigns Using n8n with Gmail & Google Sheets

Extract Social Media Links with n8n and OpenAI AI Crawler

What this workflow does

How the workflow works: Inputs, Process, and Output

Inputs

Processing steps

Output

Who should use this workflow

Tools and services used

Beginner step-by-step: How to use this workflow in n8n production

1. Import the workflow

2. Configure credentials

3. Check prompts and URLs

4. Test the workflow

5. Activate for production

Common mistakes and edge cases

Customization ideas

Summary and outcome

Frequently Asked Questions

2 Months of Sales Navigator 👉 FREE

10,000+ n8n Workflows to Download & Learn Building

Automate your LinkedIn Posts

1:1 - Meeting FREE

Get Self-Host n8n

Promoted by BULDRR AI

Learn by Category

Related Workflows

Automate Viral UGC Video Creation Using n8n + Degaus (Beginner-Friendly Guide)

AI SEO Blog Writer Automation Workflows in n8n

Automate CrowdStrike Alerts with VirusTotal, Jira & Slack

Automate Telegram Invoices to Notion with AI Summaries & Reports

Automate Email Replies with n8n and AI-Powered Summarization

Automate Email Campaigns Using n8n with Gmail & Google Sheets

Browse by Apps