Scrape & Structure Hacker News ‘Who is Hiring’ Posts with n8n

Struggling to efficiently extract job listings from Hacker News ‘Who is Hiring’ posts? This n8n workflow automates scraping, cleans, and structures hiring data using OpenAI and stores it in Airtable, saving hours of manual work.
httpRequest
lmChatOpenAi
code
+8
Workflow Identifier: 2180
NODES in Use: Manual Trigger, Sticky Note, HTTP Request, Split Out, Set, Filter, Code, AI Chat Model, Structured Output Parser, Chain LLM, Airtable
Scrape Hacker News with n8n and OpenAI

Press CTRL+F5 if the workflow didn't load.

Learn how to Build this Workflow with AI:

What This Automation Does ⚙️

This workflow finds new job posts from Hacker News asking “Who is hiring?” every month.
It grabs posts from last 30 days, cleans the text, uses AI to pull out job details, and saves all in Airtable.
This saves hours of manual copying and keeps job data tidy and searchable.

It works by querying Hacker News search API for hiring posts, then fetching each full post and all job replies.
Cleaned text goes to OpenAI GPT-4o-mini to get structured data like company, role, location, and application links.
After that, the structured jobs are saved to Airtable for easy tracking.


Inputs → Processing → Outputs

Inputs

  • API keys for Algolia to query Hacker News posts
  • OpenAI API key for GPT-4o-mini model access
  • Airtable base and table where jobs are stored

Processing

  • Searches Hacker News posts titled exactly “Ask HN: Who is hiring?” within last 30 days
  • Splits search results into individual posts
  • Fetches full post data and all job-reply comments using official Hacker News API
  • Cleans text with JavaScript code removing HTML tags and fixing characters
  • Calls OpenAI GPT-4o-mini to parse cleaned text into structured job info JSON
  • Filters posts by date and applies optional testing limits

Output

  • New rows in Airtable with fields like company, job role, location, salary, job type, application URL, and description
  • A reliable, searchable job listings database updated monthly

Who Should Use This Workflow

This helps recruiters, job boards, or anyone wanting fresh tech job listings.
Users who want to stop spending hours copying job posts by hand will find this handy.
It is also good for people who want neat, structured data from raw forum posts for analysis or outreach.

If you use self-host n8n or a cloud account, you can run this regularly.
See self-host n8n for running on your own server.


Tools and Services Used

  • Hacker News Algolia API: To query relevant “Ask HN: Who is hiring?” posts.
  • Hacker News Official API: Fetch full posts and comments by ID.
  • n8n Workflow Automation: Orchestrates API calls, processing, and data flow.
  • OpenAI GPT-4o-mini: Parses unstructured job text into a clean JSON format.
  • Airtable API: Stores cleaned and structured job listings.

Beginner Step-by-Step: How to Use This Workflow in n8n

Step 1: Import the Workflow

  1. Download the workflow file using the Download button on this page.
  2. Open your n8n editor.
  3. Click on “Import from File” and select the downloaded workflow JSON.

Step 2: Add Your Credentials and API Keys

  1. Go to Credential settings in n8n.
  2. Add your Algolia API Key and App ID for the Hacker News search node.
  3. Add your OpenAI API key for the GPT-4o-mini node.
  4. Add Airtable API key and connect to your base and table where job posts will be saved.

Step 3: Configure IDs and Table Columns (if needed)

  1. Check ‘Search for Who is hiring posts’ node has the right search query.
  2. Verify ‘Write results to airtable’ node matches your Airtable base and table columns.
  3. Check the cleaning JavaScript code inside the ‘Clean text’ node to customize if wanted.
  4. Review prompt text in ‘OpenAI Chat Model’ node if needed.

Step 4: Test the Workflow

  1. Trigger the workflow manually using the Manual Trigger node.
  2. Watch each node run and verify it fetches data and saves correctly.

Step 5: Activate for Production Usage

  1. Switch the Manual Trigger node to a scheduled trigger for automation (weekly/monthly).
  2. Monitor workflow runs occasionally for errors.

Edge Cases and Failure Points

  • If API keys are missing or wrong, HTTP requests will fail with 403 Forbidden.
  • If too many job posts are processed, OpenAI API limits can be hit causing timeouts.
  • Unclean or malformed text can produce incomplete JSON parsing from GPT.
  • Duplicates may occur if Airtable is not cleared or de-duplication logic added.

Customization Ideas

  • Change search filter in ‘Search for Who is hiring posts’ node to target other “Ask HN” posts.
  • Adjust days threshold in ‘Get latest post’ filter for different date ranges.
  • Edit JavaScript in ‘Clean text’ to tweak text cleaning rules.
  • Replace Airtable node with Google Sheets or SQL database node to change storage.
  • Switch OpenAI model in ‘OpenAI Chat Model’ node for cost or accuracy balance.

Summary of Benefits

✓ Saves over 5 hours monthly by automating job post collection.
✓ Turns messy forum posts into clean, structured job records.
✓ Keeps job listings fresh by filtering last 30 days automatically.
✓ Stores data in Airtable for easy management and search.
✓ Flexible to adjust filters, cleaning, and storage options.
✓ Works with self-host n8n or cloud setups for scheduled runs.


Scrape Hacker News with n8n and OpenAI

Visit through Desktop to Interact with the Workflow.

Frequently Asked Questions

Yes, by changing the search query string in the ‘Search for Who is hiring posts’ HTTP Request node, other Ask HN topics can be targeted.
Yes, every job text processed by the OpenAI node uses API quota. The workflow includes an optional limit node to control how many posts are sent.
No, Airtable can be replaced with other storage options like Google Sheets or SQL by swapping the final node.
It handles moderate loads well but may reach API rate limits. Consider splitting workflows or adding limits for very large data.

Promoted by BULDRR AI

Related Workflows

Automate Twist Channel Creation and Messaging with n8n

This workflow automates creating and updating a channel in Twist and sending a personalized message to specific users. It eliminates manual setup errors and saves time managing Twist communications.

Automate Ideogram Image Generation with Google Sheets & Gmail

This workflow automates graphic design image generation via Ideogram AI, storing image data in Google Sheets and Google Drive, with email alerts via Gmail. It saves designers hours by automating image creation, remixing, review, and record-keeping.

Automate IT Support with Slack and OpenAI in n8n

Streamline IT support by automating Slack message handling using n8n and OpenAI. This workflow handles Slack DMs, filters bots, queries a Confluence knowledge base, and delivers AI-generated responses, improving support efficiency and response time.

Automate Crypto Analysis with CoinMarketCap & n8n AI Agent

Discover how this unique n8n workflow leverages CoinMarketCap’s multi-agent AI to deliver precise, real-time cryptocurrency insights directly via Telegram. Manage crypto data analysis efficiently with automated multi-source API integration.

Automate Gumroad to Beehiiv Subscriber Sync with n8n

Learn how to automatically add new Gumroad sales customers as Beehiiv newsletter subscribers using n8n automation. This workflow saves time by syncing sales data to Google Sheets CRM and notifying your Telegram channel instantly.

Generate On-Brand Blog Articles Using n8n and OpenAI

This workflow automates the creation of on-brand blog articles by analyzing existing company content using n8n and OpenAI. It extracts article structures and brand voice to produce consistent draft articles, saving significant content creation time.
1:1 Free Strategy Session
Your competitors are already automating. Are you still paying for it manually?

Do you want to adopt AI Automation?

Every hour your team does repetitive work, you're burning real money.
While you wait, faster businesses are cutting costs and moving quicker.
AI and automations aren't the future anymore — they're the present.

Book a live 1-on-1 session where we show you exactly which of your daily tasks can be automated — and what it’s costing you not to.