Automate News Extraction with n8n and OpenAI for Weekly Summaries

This n8n workflow automates extracting, summarizing, and keywording the latest news posts from a website without RSS, saving hours of manual work each week. It pulls fresh news, summarizes content with OpenAI, and stores results in NocoDB for easy access.
html
openAi
set
+7
Workflow Identifier: 1959
NODES in Use: html, openAi, set, merge, code, httpRequest, itemLists, scheduleTrigger, nocoDb, stickyNote
Automate news extraction with n8n and OpenAI

Press CTRL+F5 if the workflow didn't load.

Learn how to Build this Workflow with AI:

What this workflow does

This workflow gets news posts from a telecom company website without an RSS feed. It collects links and post dates, filters posts from the last 7 days, grabs full post content, and uses AI (OpenAI GPT-4) to make short summaries and extract keywords. All data is saved into a NocoDB database. This saves many hours each week and avoids missing important updates.

Who should use this workflow

This workflow is for people who need to track weekly news from websites that do not provide RSS feeds. Users should want fast summaries and key tags from long articles without reading everything. It fits well for market analysts, researchers, and technical teams who need clean, up-to-date news data in a database.

Tools and services used

  • n8n: The automation platform to run and schedule the workflow.
  • OpenAI GPT-4: Generates summaries and extracts keywords from news content.
  • NocoDB: A no-code SQL database to store news data for easy use.
  • HTTP Request node: Fetches web pages for data extraction.
  • HTML node: Extracts information from HTML using CSS selectors.
  • Code node: Filters posts based on date logic using JavaScript.
  • Merge and ItemLists nodes: Organize data for processing and combining.

Inputs, processing, and outputs in the workflow

Inputs

  • The URL of the news page to scrape.
  • OpenAI API Key for AI summarization and keyword extraction.
  • NocoDB API token and target database table information.

Processing Steps

  • Fetch the main news page HTML.
  • Extract all news post links and also their publication dates using CSS selectors.
  • Split those arrays into individual items for easier handling.
  • Merge corresponding dates and links by their positions to keep data aligned.
  • Filter out posts older than 7 days using a JavaScript code node.
  • Fetch the full content of each recent news post by visiting each link.
  • Extract the title and actual news content for each post.
  • Combine content with respective date and link information.
  • Send the news content to OpenAI GPT-4 for short, 70-word summaries.
  • Send content again to OpenAI to get exactly three technical keywords per article.
  • Clean and rename the summary and keywords fields for easier merging.
  • Merge AI outputs with the news metadata for a complete record.
  • Save everything into a NocoDB SQL database for access and review.

Outputs

  • Clean news records with title, date, link, AI-generated summary, and keywords stored in the database.
  • Reduced manual work and improved update accuracy.

How to use this workflow in n8n

Import and set up workflow

  1. Download the workflow file using the Download button on this page.
  2. In the n8n editor, go to “Import from File” and upload the workflow file.
  3. After import, open the workflow to review it inside n8n.

Configure credentials and settings

  1. Add your OpenAI API Key in the OpenAI nodes under credentials.
  2. Enter your NocoDB API token and select the correct target database table in the NocoDB node.
  3. Check URLs in HTTP Request nodes to confirm they are correct and currently working.
  4. Review any IDs, emails, or fields to update them based on your data setup.

Test and activate

  1. Run the workflow manually once to confirm all steps complete without errors.
  2. Check your NocoDB to verify that news records are saved properly after the test run.
  3. If everything works well, activate the workflow to run automatically each week on schedule.
  4. Optionally, use the link self-host n8n if you want more control by running n8n on your own server.

Customization ideas

  • Change the main news page URL in the first HTTP Request node to scrape other websites.
  • Adjust CSS selectors in HTML nodes to match new page structures.
  • Modify the date filter in the Code node to lengthen or shorten the last days range.
  • Try different OpenAI models or tweak prompts in the summary and keywords nodes for varied AI output.
  • Swap NocoDB with different database nodes if you want to store data elsewhere.
  • Add an email node or messaging node after saving data to notify teams about new news.

Common issues and how to fix them

  • No data extracted: Check if CSS selectors are correct using browser Inspect tool; update selectors in HTML nodes.
  • OpenAI API key errors: Make sure to enter a valid and active API Key in credentials; refresh token if expired.
  • Merge node errors: Use “combine” mode with “mergeByPosition” in Merge nodes to keep data aligned.
  • Date filtering not working: Ensure date format matches what JavaScript Code node expects.

Summary of workflow results

✓ Saves more than four hours of manual news checking every week.
✓ Creates short, clear news summaries with AI automatically.
✓ Extracts key technical terms to tag articles for easy searching.
✓ Stores all data in an easy-to-use NocoDB database.
✓ Runs on schedule, no manual start needed.

Automate news extraction with n8n and OpenAI

Visit through Desktop to Interact with the Workflow.

Frequently Asked Questions

Users must add OpenAI API Key and NocoDB API token credentials before running the workflow.
A Code node filters posts by comparing post dates to the last 7 days using JavaScript date logic.
The node generates a short summary under 70 words and extracts three technical keywords for each news post.
Yes, users can replace the NocoDB node with other SQL database nodes supported by n8n by adjusting field mappings.

Promoted by BULDRR AI

Related Workflows

Automate Twist Channel Creation and Messaging with n8n

This workflow automates creating and updating a channel in Twist and sending a personalized message to specific users. It eliminates manual setup errors and saves time managing Twist communications.

Automate Ideogram Image Generation with Google Sheets & Gmail

This workflow automates graphic design image generation via Ideogram AI, storing image data in Google Sheets and Google Drive, with email alerts via Gmail. It saves designers hours by automating image creation, remixing, review, and record-keeping.

Automate IT Support with Slack and OpenAI in n8n

Streamline IT support by automating Slack message handling using n8n and OpenAI. This workflow handles Slack DMs, filters bots, queries a Confluence knowledge base, and delivers AI-generated responses, improving support efficiency and response time.

Automate Crypto Analysis with CoinMarketCap & n8n AI Agent

Discover how this unique n8n workflow leverages CoinMarketCap’s multi-agent AI to deliver precise, real-time cryptocurrency insights directly via Telegram. Manage crypto data analysis efficiently with automated multi-source API integration.

Automate Gumroad to Beehiiv Subscriber Sync with n8n

Learn how to automatically add new Gumroad sales customers as Beehiiv newsletter subscribers using n8n automation. This workflow saves time by syncing sales data to Google Sheets CRM and notifying your Telegram channel instantly.

Generate On-Brand Blog Articles Using n8n and OpenAI

This workflow automates the creation of on-brand blog articles by analyzing existing company content using n8n and OpenAI. It extracts article structures and brand voice to produce consistent draft articles, saving significant content creation time.
1:1 Free Strategy Session
Your competitors are already automating. Are you still paying for it manually?

Do you want to adopt AI Automation?

Every hour your team does repetitive work, you're burning real money.
While you wait, faster businesses are cutting costs and moving quicker.
AI and automations aren't the future anymore — they're the present.

Book a live 1-on-1 session where we show you exactly which of your daily tasks can be automated — and what it’s costing you not to.