Multi-Pa
ge PDF Processing Issue (Vehicle Registration) & Outbound Bundle Duplication
Hello Make Community!
I am working on automating data extraction from complex official documents, specifically Vehicle Registration Certificates (VRCs) from EU countries (Baltics, Poland).
My current scenario flow:
-
Receive the source file (PDF/PNG/JPG).
-
Conversion (PDF.co).
-
OCR Recognition (Google Cloud Vision).
-
Field extraction (Text Parser) using RegEx.
-
Consolidation (Array Aggregator).
-
Email sending (Brevo/SendGrid).
The Core Problem:
-
Extraction Instability: The Google Cloud Vision + Text Parser / RegEx combination is proving very fragile. Due to OCR errors (non-linear field layout, rotated text), the RegEx often fails or misses fields.
-
Outbound Bundle Duplication (Primary Issue): Since the source documents are multi-page (2 pages in the PDF), the entire scenario iterates two or more times (one iteration per page). Despite using the Array Aggregator for consolidation, I still end up sending 2 or more emails for a single source PDF.
- Note: The flow works correctly (1 document → 1 email) for single-page JPG/PNG files.
Questions for the Community:
-
Specialized IDP Modules: What tools are you successfully using for extracting structured data from complex, non-linear documents (invoices, passports, VRCs)? Are there more reliable services in the Make.com ecosystem than the brittle Google Cloud Vision + RegEx setup?
- Has anyone successfully implemented ComIDP, Azure AI Document Intelligence (Form Recognizer), or Amazon Textract? Do these services provide a single, structured JSON output that inherently solves the multi-page document aggregation problem?
-
Consolidation/Aggregation: If you have encountered outbound duplication after processing a multi-page PDF, how did you guarantee that the complete data array was bundled into a single outgoing operation, ensuring the client only receives one email?
I would be grateful for any advice and practical solutions, especially regarding the use of specialized IDP (Intelligent Document Processing) modules!




