Besides the solutions I have made here: How to PDF into openAI (Solution!)
I’ve found the pdf.co module a lot better to extract pdf data. As it’s setup specifically to extract pdf data.
Also makes openAI has a module to set up a structured JSON module, which is also worth noting too.