Improve PDF OCR performance from ChatGPT

tb · February 24, 2025, 5:03am

Hi there,

I’m trying to setup an automation to acquire invoices from a mailbox, and summarize them using ChatGPT.

My issue is that ChatGPT struggles to extract accurate information from the invoices. The invoices are in clean one-page PDFs. Somehow, ChatGPT invents some information and fails to acquire most information.

Is there a known fix for this? I know that other specialized third-party invoice analysis tools exist, but if possible I’d prefer to stick to ChatGPT.

Automation overview below (two branches to deal with PDF and image files separately).

Thank you so much in advance!

Milan_Vasarhelyi · February 24, 2025, 10:52am

Which model do you use? Do you have the same issue with chatgpt.com or just the API?

Stoyan_Vatov · February 24, 2025, 11:07am

Hey there,

please have in mind that ChatGPT is NOT a PDF OCR platform. You can try creating an agent and training it with your own data to improve its performance or switch to a different tool actually designed to do PDF OCR.

samliew · February 25, 2025, 4:05pm

Welcome to the Make community!

Try using the Dumpling AI “Extract data from PDF with AI” module:

Extract structured data from one or more PDF files with multimodal AI (10+ credits)

For more information on how to set this up, see

Hope this helps! Let me know if there are any further questions or issues.

— @samliew

P.S.: Investing some effort into the Make Academy will save you lots of time and frustration using Make.

Andrei_Unterzeger · February 26, 2025, 5:31am

Can you tell me what role the router plays in your scenario?

Andrei_Unterzeger · February 26, 2025, 7:23am

I managed to complete this task in this way:

At the output, the data is loaded into a Google table.
Filter for PDF attachments.

There were problems with the Dumpling AI settings. I did some magic and watched the developer’s video tutorials. The screenshot shows the settings.

Topic		Replies	Views
Analyze pdf in open ai How To api , open-ai , pdf	1	42	May 5, 2025
Analize PDF with Open AI How To gmail	2	1023	December 7, 2023
Pass PDF to analyse images in OpenAI (or other formats and other LLMs) How To chatgpt , ai , pdf , gemini	3	126	April 25, 2025
OpenAI GPT4o - extracting info from PDF How To error	0	45	May 19, 2025
Create a JSON out of a pdf with ChatGPT How To error	3	86	February 4, 2025

Improve PDF OCR performance from ChatGPT

Related topics