Reddit: Looking to extract new post body and post link in a structured manner

Hello, I’m trying to extract posts from a subreddit. I know make.com already has a reddit module: ‘Watch New Comments…’ but I do not like this module because of the duplicate posts that it returns back so I went down the route of using ‘make a oauth request’ http module to get it straight from source.

However, when I run this module, it returns a huge blub of html with other irrelevant information I do not want. Literally scrapes the whole page. How can I remove all these irrelevant info and just extract, the post URL link and post body in a simple json manner. I have even tried the text parser as seen below but still im getting ads posts, mod posts etc.

Would appreciate any help

@samliew could you help with the above please?

Welcome to the Make community!

You didn’t provide any blueprint or link to the content you want to scrape.

To allow others to assist you with your scenario, please provide the following:

1. Relevant Screenshots

Please share screenshots of your scenario, any error messages, relevant module fields, and filters in question? It would really help other community members to see what you’re looking at.

You can upload images here using the Upload icon in the text editor:

2. Scenario Blueprint

Please export the scenario blueprint file to allow others to view the mapped variables in the module fields. At the bottom of the scenario editor, you can click on the three dots to find the Export Blueprint menu item.

3. Output Bundles of Modules

Please provide the output bundles of the modules by running the scenario (or get from the scenario History tab), then click the white speech bubble on the top-right of each module and select “Download input/output bundles”.

A. Upload as Text File

Save each bundle contents in your text editor as a bundle.txt file, and upload it here into this discussion thread.

B. Insert as Formatted Code Block

If you are unable to upload files on this forum, alternatively you can paste the formatted bundles.
These are the two ways to format text so that it won’t be modified by the forum:

  • Method 1: Type code block manually

    Add three backticks ``` before and after the content/bundle, like this:

    ```
    content goes here
    ```

  • Method 2. Highlight and click the format button in the editor

Providing the input/output bundles will allow others to replicate what is going on in the scenario even if they do not use the external service.

Following these steps will allow others to assist you here. Thanks!

the url that you entered doesnt return a JSON string for me? So im thinking than the HTTP module cant parse the response?