Create a JSON out of a pdf or jpg with chat gpt

Natalia_Grech · December 20, 2023, 4:03pm

Hello
I’m trying to create a scenario in which everytime a save a ticket or invoice in pdf format or jpg, Chat GPT can read it, extract key information and then create a record in Airtable with the expense.
First of all, I have developed the prompt and is working perfectly in Chat GPT. This is the prompt: Read the file {{8.id}} which is an invoice in pdf or jpg format and generate a JSON with the following information:

Issue date
Issuer’s VAT number
Issuing company name
Ticket or invoice number
Concept
Amount excluding VA
VAT rate (%
VAT
Total amount
The date must be in DD.MM.YYYYY format.
The amounts only with the decimals you do not have to separate the thousands, except the VAT rate (%) that goes without decimals. Decimals are indicated with a comma.

My make scenario is as follow:

Watch files in Drive folder
Download the file
Upload the file with ChatGPT (Purpose: assistant)
Read the file and extract the information in JSON. Role: System.
The 4th step is failing. At the beginning it said it could not perform that kind of task, but now the output is zero.
The next step will be to create the record in Airtable, but I’m still struggling with the 4th step.

Anyone can give me a clue?
I can share my scenario if needed.
Thanks and best regards

Automate_with_Rezwan · December 20, 2023, 4:12pm

May peace be upon you @Natalia_Grech

Can you provide a screenshot of the module settings of the 4th step?

Natalia_Grech · December 20, 2023, 4:45pm

Sure!
Thanks for your help

Automate_with_Rezwan · December 20, 2023, 5:12pm

@Natalia_Grech I think you should send the message as an User not System.

Natalia_Grech · December 20, 2023, 5:36pm

I tried all of them (System, user, assistant) and nothing seemed to work.

Msquare_Automation · December 21, 2023, 9:57am

Hi @Natalia_Grech

Please try giving the instructions of output as “User”, pass the data as “System” and give the prompt as “Assistant” to present the output as “json”.

If you require additional assistance, please don’t hesitate to reach out to us.
MSquare Support
Visit us here
Youtube Channel

Natalia_Grech · December 21, 2023, 11:27am

So, if I understood correctly I need to lay out 3 chatgpt modules, one for each step?
Or in one module with 3 items?
THanks

Msquare_Automation · December 21, 2023, 11:35am

@Natalia_Grech

See the image for reference in one module.

If you require additional assistance, please don’t hesitate to reach out to us.
MSquare Support
Visit us here
Youtube Channel

Natalia_Grech · December 21, 2023, 1:36pm

Ok, so it seems to quite worked, but I received this message as output:
“Sorry for any misunderstanding, but as a language AI model, I do not have the ability to access external files or read PDF or JPG documents. However, I could show you an example that will represent what your JSON output will look like.”
So what I don’t understand that the prompt works when I use it in chatgpt but not in Make.
Thanks again

Rafael_Sanchez · December 26, 2023, 4:37pm

@Natalia_Grech intenta creando un asistente en el API de OpenAI y dale las instrucciones al asistente para que haga lo que le pides.
mira este ejemplo:

Emile · December 26, 2023, 5:25pm

Yep same here,

I don’t understand how to upload the file
The documentation says we need to use HTTP get a file
but it doesn’t help

https://www.make.com/en/help/app/openai-dall-e-chatgpt

All the files I try to upload seem to be corrupted and not usable by chatgpt

vendy · December 27, 2023, 8:27am

Hey @Rafael_Sanchez

I just wanted to let you know that I translated your question to English since that’s the official language of the community. By keeping the content in one language, we give everybody the chance to step in and help you.
Translation:

Try creating an assistant in the OpenAI API and give instructions to the assistant to do what you ask. Look at this example:

Note: Please read our Code of conduct and get familiar with the rules and guidelines that apply here in our community. This way we keep the community organized and neat for everybody.

Thank you very much

Natalia_Grech · December 27, 2023, 10:14am

Hi @Rafael_Sanchez
Where did you find the assistant? I don’t get any as options.
Do you have to create it first as a GPT?
Thanks for your help.
Best regards

Natalia_Grech · December 27, 2023, 10:16am

Hi @Emile
I’m having issues with the upload. I use the option UPLOAD FILE out of the ChatGPT module.
Have you tried it?
Best regards

Natalia_Grech · December 27, 2023, 10:40am

Hi @Rafael_Sanchez
I figured out how to create the assistant with the API, but I still get the same answer: It cannot process a pdf.
Thanks anyway.
Best regards

Emile · December 27, 2023, 10:53am

Yep the assistants are like GPTs but for the backend (as of my understanding). you need to create them here
https://platform.openai.com/assistants

As for the files, you can check the uploaded files here
https://platform.openai.com/files/
but the problem is that when we upload them with make it seem it’s not uploading properly the file

Someone here has done something with base64 for images, but it doesn’t seem to work on my end

@Rafael_Sanchez can you show us how you upload the file?

Natalia_Grech · December 27, 2023, 10:59am

Thanks @Emile
My issue is not with the upload, that seems to work fine. The problem is that ChatGPT says it cannot read pdf files.
BR

Emile · December 27, 2023, 12:54pm

If you upload the file by hand and not via Make, you will see that you can read the pdf file.
It’s just that when you upload it with make it gets corrupted. (at least on my side, but maybe I’m using the Upload File module incorrectly)

Can you show me a screenshot of your OpenAi Upload File module ? (in step 3. Upload the file with ChatGPT (Purpose: assistant))

Rafael_Sanchez · December 27, 2023, 3:50pm

@Natalia_Grech I saw the messages and you can do a couple of things.
if you want to read the PDF and are open to semi semi-automation scenario you could create a GPT that reads your PDF and then sends the information to a webhook and processes the information in integromat.

Another method is to create an assistant and you could use an “Upload file” module from ChatGPT, then use the file ID, and pass that ID to the assistant just like I show in my last picture.

please DM me for quick assistance on how to do it.

Rafael_Sanchez · December 27, 2023, 3:52pm

Hello @vendy ok, I saw that @Natalia_Grech is giving instructions in Spanish and i assumed she may be a Spanish speaker, but you’re correct, I’ll keep communication in English

Topic		Replies	Views
Create a JSON out of a pdf with ChatGPT How To error	3	61	February 4, 2025
Invoice automation with ChatGPT How To chatgpt	10	1681	August 15, 2024
How to map chatGPT output into my airtable fields (With Json) How To chatgpt , scenario , error	5	103	December 20, 2024
Extract the Chat GPT Text to create a report in PDF format How To google-sheets	2	346	May 13, 2024
Date format issues passing from ChatGPT to Airtable How To airtable , chatgpt	3	31	September 16, 2024

Create a JSON out of a pdf or jpg with chat gpt

Related topics