Automate Academic Paper Summaries: Hugging Face + Notion + OpenAI

Discover how to automate fetching, analyzing, and storing Hugging Face research papers using n8n with Notion and OpenAI integration. This automation streamlines paper collection, deep abstract analysis, and organizes insights efficiently in Notion, saving hours of manual research work.
scheduleTrigger
notion
openAi
+5
Workflow Identifier: 1767
NODES in Use: Schedule Trigger, HTTP Request, HTML Extract, Split Out, Split In Batches, Notion, If, OpenAI LangChain
Automate paper summaries with n8n and OpenAI

Press CTRL+F5 if the workflow didn't load.

Learn how to Build this Workflow with AI:

What This Automation Does

This workflow fetches new papers from Hugging Face every weekday morning and checks if the papers already exist inside a Notion database.

For new papers, it extracts titles and abstracts, then uses OpenAI to analyze abstracts and pull out key ideas and details.

Finally, it saves all this data into Notion, so the user gets well-organized, detailed summaries automatically without doing any manual copy-paste work.


Who Should Use This Workflow

This automation fits researchers or knowledge workers who check new AI papers often and want to skip the boring, manual process of gathering information.

If spending hours copying abstracts, typing notes, or hunting papers wastes time and causes mistakes, this workflow helps by doing it all automatically.


Tools / Services Used

  • Hugging Face Papers Web Page: Source of new AI research papers.
  • Notion API: Database to store paper details and summaries.
  • OpenAI API (Via LangChain node): AI model to analyze abstracts and give structured insights.
  • n8n Automation Platform: Runs the entire process with scheduled triggers and various nodes.

Input → Processing → Output

Inputs

Processing Steps

  • Fetch paper list HTML for the specific date.
  • Extract paper URLs from HTML using CSS selectors.
  • Split paper URLs for easy item processing and batch handling.
  • Check each URL against Notion to find if paper is new.
  • Skip papers already in Notion, process only new ones.
  • Download the detailed paper page HTML for new papers.
  • Extract abstract and title using precise CSS selectors.
  • Send abstract to OpenAI model with a custom prompt extracting key info in JSON format.
  • Map AI output and paper metadata into Notion database entry.

Output

  • New Notion pages with enriched metadata, summaries, keywords, and classifications.
  • Elimination of manual copy-pasting and missed insights.

Beginner Step-by-Step: How to Use This Workflow in Production

Download and Import Workflow

  1. Download the workflow file using the Download button on this page.
  2. Open n8n editor where the workflow will run.
  3. Use the option “Import from File” to load the downloaded workflow into n8n.

Configure Credentials and IDs

  1. In n8n, add your OpenAI API key in the credential area for the LangChain node.
  2. Set up Notion credentials with integration token and select your Notion database ID correctly.
  3. If needed, update any hard-coded URLs, database fields, or parameters like date formatting to fit your setup.

Testing

  1. Run the workflow manually once to verify all nodes work correctly and data flows to Notion properly.
  2. Check your Notion database for new entries with correct metadata and AI summaries.

Activate for Production

  1. Toggle the workflow on from the n8n dashboard to make it run automatically on weekdays at 8 AM as scheduled.
  2. If you want to host this workflow yourself, consider self-host n8n for more control.
  3. Optionally monitor logs or add notifications for failure alerts.

Common Issues and Edge Cases

  • No papers extracted: The CSS selector used may be outdated because website layouts often change. Fix by inspecting Hugging Face page and updating selector.
  • Incorrect duplicate detection in Notion: Make sure URL strings match exactly and filter conditions in Notion nodes are set properly.
  • OpenAI API failures: Check API key validity and if rate limits are exceeded. Add retry logic or space out batches.

Customization Ideas

  • Change schedule timing in the Schedule Trigger node to better match paper release times.
  • Add more data fields to extract, such as authors or publication dates, by updating HTML selectors.
  • Improve the AI prompt in OpenAI LangChain node to gather more insights like potential paper uses or limits.
  • Adjust batch sizes for better API request control and faster processing.
  • Extend Notion database fields and map them in the Notion Create node to track citation counts or DOIs.

Conclusion

This workflow lets you get new AI research papers from Hugging Face every weekday, analyze abstracts smartly with OpenAI, and store neat summaries in Notion without manual work.

It saves you 5+ hours every week and reduces errors in note-taking.

Next steps can be adding citation fetch automation or connecting with other paper sources like arXiv.

Download and import the workflow, set it up, and let n8n run this helpful automation for you.


Sample OpenAI Prompt Used in LangChain Node

This prompt instructs the AI to extract main intro, keywords, results, technical details, and classification from each paper abstract, returning in JSON format.

{
  "role": "system",
  "content": "Extract the following key details from the paper abstract:\n\nCore Introduction: Summarize the main contributions and objectives of the paper, highlighting its innovations and significance.\nKeyword Extraction: List 2-5 keywords that best represent the research direction and techniques of the paper.\nKey Data and Results: Extract important performance metrics, comparison results, and the paper's advantages over other studies.\nTechnical Details: Provide a brief overview of the methods, optimization techniques, and datasets mentioned in the paper.\nClassification: Assign an appropriate academic classification based on the content of the paper.\n\nOutput as json:\n{...}"
}

Automate paper summaries with n8n and OpenAI

Visit through Desktop to Interact with the Workflow.

Frequently Asked Questions

Yes, users can change the model ID in the OpenAI LangChain node to any supported OpenAI model in n8n.
The number of API credits depends on how many paper abstracts are processed and their length. Batch size and frequency can be adjusted to manage costs.
Verify that the Notion query filter matches the full Hugging Face URL exactly and URLs stored use consistent formatting.
Yes, with proper batch sizes and API quota management, it can process large volumes efficiently.

Promoted by BULDRR AI

Related Workflows

Automate Twist Channel Creation and Messaging with n8n

This workflow automates creating and updating a channel in Twist and sending a personalized message to specific users. It eliminates manual setup errors and saves time managing Twist communications.

Automate Ideogram Image Generation with Google Sheets & Gmail

This workflow automates graphic design image generation via Ideogram AI, storing image data in Google Sheets and Google Drive, with email alerts via Gmail. It saves designers hours by automating image creation, remixing, review, and record-keeping.

Automate IT Support with Slack and OpenAI in n8n

Streamline IT support by automating Slack message handling using n8n and OpenAI. This workflow handles Slack DMs, filters bots, queries a Confluence knowledge base, and delivers AI-generated responses, improving support efficiency and response time.

Automate Crypto Analysis with CoinMarketCap & n8n AI Agent

Discover how this unique n8n workflow leverages CoinMarketCap’s multi-agent AI to deliver precise, real-time cryptocurrency insights directly via Telegram. Manage crypto data analysis efficiently with automated multi-source API integration.

Automate Gumroad to Beehiiv Subscriber Sync with n8n

Learn how to automatically add new Gumroad sales customers as Beehiiv newsletter subscribers using n8n automation. This workflow saves time by syncing sales data to Google Sheets CRM and notifying your Telegram channel instantly.

Generate On-Brand Blog Articles Using n8n and OpenAI

This workflow automates the creation of on-brand blog articles by analyzing existing company content using n8n and OpenAI. It extracts article structures and brand voice to produce consistent draft articles, saving significant content creation time.
1:1 Free Strategy Session
Your competitors are already automating. Are you still paying for it manually?

Do you want to adopt AI Automation?

Every hour your team does repetitive work, you're burning real money.
While you wait, faster businesses are cutting costs and moving quicker.
AI and automations aren't the future anymore — they're the present.

Book a live 1-on-1 session where we show you exactly which of your daily tasks can be automated — and what it’s costing you not to.