Help Needed: Filtering HTML Elements and Managing Text Overload in Google Docs

What are you trying to achieve?

hello guys, i hope some solution for my issues

I have set up a Make.com scenario to automate the retrieval and processing of newsletters in order to extract the most relevant financial news.
The scenario works as follows:

1Retrieves the 10 latest emails from Outlook (ID: 35).
2. Converts the HTML content into plain text using HTML to Text (ID: 33).
3. Aggregates the cleaned emails using Text Aggregator (ID: 37).
4. Processes the aggregated content with Gemini AI (ID: 14) to extract the 4 most relevant news items related to finance, wealth management, and investment.
5. Stores the final results in the same Google Docs file (ID: 34).

Problem Encountered

  1. HTML Elements Not Fully Removed
  • Despite using the HTML to Text module, some images and links remain in the text. This makes the final content cluttered and difficult to read.
  • Is there a better way to fully remove all HTML elements while keeping the text clean?
  1. Overlapping Text in Google Docs
  • All emails are stored in one Google Docs file, and once sent to Gemini, the AI-generated summary is also stored in the same document.
  • This results in a mix of raw emails and AI-generated content, making the document overloaded with text and links.
  • How can I structure the output to separate raw emails from the final Gemini summary more effectively?

What I need help with

  • A better way to filter out all unnecessary HTML elements before processing the text.
  • A method to organize the document structure so the email content and Gemini’s generated summaries do not mix in the same document.
  • Any alternative approaches that have worked for similar automation workflows.

Screenshots

scenario:


Html to text :

Text agragator :

Thanks in advance for your help! :folded_hands:

Welcome to the Make community!

You have not yet provided sufficient information to demonstrate the problem that would allow us to reproduce the scenario and any error(s).

To allow others to assist you with your scenario, please provide the following:

1. Relevant Screenshots

Could you please share screenshots of your full scenario? Also include screenshots of any error messages, module settings (fields), relevant filter settings (conditions), and module output bundles. We need to see what you’re working with to give you the best advice.

You can upload images here using the Upload icon in the text editor:

2. Scenario Blueprint

Please export the scenario blueprint. Providing your scenario blueprint file will allow others to quickly recreate and see how you have set up the mappings in each module, and also allows us take screenshots or provide module exports of any solutions we have for you in return - this would greatly benefit you in implementing our suggestions as you can simply paste module exports back into your scenario editor!

To export your scenario blueprint, click the three dots at the bottom of the editor then choose ‘Export Blueprint’.

You can upload the file here by clicking on this button:

3. Output Bundles of Modules

Please provide the output bundles of each of the relevant modules by running the scenario (you can also get this without re-running your scenario from the History tab).

Click on the white speech bubbles on the top-right of each module and select “Download input/output bundles”.

A. Upload as a Text File

Save each bundle contents in a plain text editor (without formatting) as a bundle.txt file.

You can upload the file here by clicking on this button:

B. Insert as Formatted Code Block

If you are unable to upload files on this forum, alternatively you can paste the formatted bundles.
These are the two ways to format text so that it won’t be modified by the forum:

  • Method 1: Type code block manually

    Add three backticks ``` before and after the content/bundle, like this:

    ```
    content goes here
    ```

  • Method 2. Highlight and click the format button in the editor

Providing the input/output bundles will allow others to replicate what is going on in the scenario, especially if there are complex data structures (nested arrays and collections) or if external services are involved.

Sharing these details will make it easier for others to assist you.