Hey there, Makers!
Want to take your blog to the next level with AI-powered content? In this guide, I’ll show you how I created an automated AI research paper TL;DR blog using LlamaParse and Make.
In our previous guide, we explored how to use Jina AI Reader to extract content from URLs, which works great for web pages and online articles. However, when dealing with AI research papers, the content is often in PDF or other document formats. That’s where LlamaParse comes in!
LlamaParse is a game-changer, acting as a super-smart research assistant that quickly extracts content from complex documents (.pdf, .pptx, .docx, .html, .xml, and more). By integrating it with Make, I’ve built a fully automated content creation and publication workflow.
All the prompts used in this workflow are included in the scenario blueprints, so you can simply download and import them directly into your Make account.
blueprint (5).json (157.1 KB)
The best part? LlamaParse offers a generous free tier, giving you 1,000 free pages per day! If you need even more, their paid plan includes 7,000 free pages per week, with additional pages costing just $0.003 each.
You can visit the LlamaParse website at LlamaCloud to obtain your API key.
If you want to make your life easier, don’t miss our custom LlamaParse app, made just for Make. It offers a simple, hassle-free way to integrate LlamaParse into your Make workflows, so you can focus on creating amazing content.
let’s dive in and see how you can use LlamaParse and Make to create your own AI-powered blog!
So, here’s the problem with LLMs: they’re not always updated with the latest information. One solution is to feed the AI with the most recent data, but extracting and feeding that information can be a real headache. That’s where LlamaParse comes into play!
We’re going to create a semi-RAG (Retrieval Augmented Generation) system that leverages LlamaParse to extract key insights from the latest AI research papers and feed them to our LLM.
This way, our AI-powered blog will always be up-to-date with the most recent breakthroughs in the field.
Let me explain how our scenario works:
This scenario automates the process of generating blog posts based on AI research papers from arXiv. It retrieves a research paper, extracts its content, generates a blog post outline and full article, creates an engaging featured image, and publishes the post on a WordPress blog.
Scenario Flow
- Retrieve the next research paper from a Google Sheet.
- Download the paper’s PDF file.
- Extract the paper’s content using LlamaParse.
- Update the Google Sheet to mark the paper as processed.
- Generate a comprehensive, SEO-optimized blog post outline using Anthropic Claude.
- Create an engaging, easy-to-understand blog post based on the outline and paper content using Anthropic Claude.
- Generate a concise prompt for an AI image that represents the blog post using OpenAI.
- Create a high-quality featured image using OpenAI DALL-E 3.
- Publish the blog post with the featured image on a WordPress blog.
The reason we’re using Claude here is because some research papers can have many pages, and Claude is better at retaining and summarizing the content of lengthy PDFs. making it more suitable for handling the large content of research papers efficiently
Adapting this scenario to your unique use case
The semi-RAG system we’ve described for creating an AI-powered blog can be easily adapted to any scenario where you need to feed your LLM with up-to-date information. Whether you’re a financial analyst, healthcare professional, or marketing expert, this setup can be tailored to your specific needs.
You might be wondering how we can get the arXiv PDF URLs and save them into the Google Sheet. In that case, we can use this scenario:
Automated arXiv AI Research Paper Retrieval
blueprint (6).json (27.9 KB)
This scenario allows you to subscribe to any new AI-related research papers from arXiv and automatically save their details, including the PDF URLs, into a Google Sheet.
Scenario Flow
- Retrieve the RSS feed items from the arXiv AI category using the RSS module.
- Aggregate the retrieved data using the Basic Aggregator module.
- Feed the aggregated data into the Iterator module.
- Add a new row to the specified Google Sheet for each research paper, including the title, description, and URL.
For reference, here’s the template of the Google Sheet used in this workflow: arxiv - Google Sheets
We have also used the formula =SUBSTITUTE(C2, “abs”, “pdf”) in the Google Sheet to quickly convert the arXiv URLs to their corresponding PDF URLs.
In this example, we are retrieving all the available AI research papers from the arXiv RSS feed. However, you can easily modify the scenario to subscribe to any new papers related to your specific interests by adjusting the RSS feed URL.
Conclusion
And there you have it, Makers! You now have a powerful, automated AI-powered blog that stays up-to-date with the latest research papers. By combining the strengths of LlamaParse and Make, you can create a content creation workflow that saves you time and keeps your readers engaged.
Remember, this setup is highly adaptable and can be tailored to fit your unique needs
If you have any questions or need further assistance, don’t hesitate to leave a comment below or reach out to me at bilalmansouri.com.
Happy Making!