Extract data from pdf - pdf.co

Good morning everyone,
I’m trying to make a seemingly simple automation (although I don’t know much about it). I want to download data from a pdf file. I’ve gotten to this point

The data I want to access is contained in a file, e.g. here

Ultimately, I want to put some of this data in a Google sheet. My problem is that I can’t get to this data from another level. Every control I try to use only shows me one line pageindex 6.

Example of an output file

             {
                        "PageIndex": 0,
                        "Type": "EditBox",
                        "FieldName": "GR1.brutto.",
                        "AltFieldName": "GR1.brutto",
                        "Value": "277.39",
                        "Left": 241.01300048828125,
                        "Top": 283.80596923828125,
                        "Width": 282.47998046875,
                        "Height": 22
                    },
                    {
                        "PageIndex": 0,
                        "Type": "EditBox",
                        "FieldName": "GR1.godziny.",
                        "AltFieldName": "GR1.godziny",
                        "Value": "60 minut",
                        "Left": 241.3040008544922,
                        "Top": 310.27899169921875,
                        "Width": 282.47998046875,
                        "Height": 22
                    },

Please forgive me but I’m a total beginner

You need to use an iterator which will iterate over the field objects one by one. Then you can map the data from the output of the iterator.

2 Likes

I don’t know what I’m doing wrong, but the Iterator returns many rows, but it still doesn’t look right either?

Attempting to map to a sheet

Hi @Artur1

You are right approach. You just need to refresh the google sheets module.

If you require additional assistance, please don’t hesitate to reach out to us.
MSquare Support
Visit us here
Youtube Channel

2 Likes

Unfortunately, nothing changes. I still only have the variables from the Iterator and I can’t map the fields

Hi @Artur1

Please make sure that you have mapped the Feilds array with in the iterator. If Yes you can hard code the variable eg: {{53.PageIndex}}

If you require additional assistance, please don’t hesitate to reach out to us.
MSquare Support
Visit us here
Youtube Channel

2 Likes

Thanks for the advice, but I think I’ll give up. I can’t do this :frowning:

Hi @Artur1

Please try below steps before you quit this. :slightly_smiling_face:
Very simple once you understand Make

  1. Map field in the iterator

    2 Filter with page index (After running one time you can get this elements after iterator)

    3 Map the requirements in the google sheet

If you require additional assistance, please don’t hesitate to reach out to us.
MSquare Support
Visit us here
Youtube Channel

2 Likes

I try to do it like this all the time

Then I get a good array

I am doing a filter for PageIndex = 1 and I am not getting any record as a result of the filter

@Artur1 This is actually not very difficult once you’re familiar with the system. If you want, I would be happy to assist you over a zoom call. It shouldn’t take more than 5-10 minutes.

Hi @_Mark, your eagerness to assist is awesome!

If you do end up hopping on the Zoom call, could you swing back here and share a quick summary of the final solution? We like to discuss problem-solving openly in the community so that everyone can benefit from the shared knowledge.

Thanks :raised_hands:

2 Likes