I want to send the website headlines fot ChatGPT to evaluate.
Currently, when I use HTTP request modul, it scrapes the entire website.
Can I somehow define to scrape only H1, H2, H3 titles? If I send the entire HTML-code from a single website, the amount of data is too much for chatGTP and/or it costs too much.
Or what might be a good module to manage the scraped HTML code before sending it to ChatGPT? So I could somehow cut away unneeded HTML elements
What is the website and what is the output of the HTTP module?
When reaching out for assistance with your regex pattern for a Text Parser module, it would be super helpful if you could share the actual content you’re trying to match. Screenshots of text can be a bit tricky, so if you could copy and paste the text directly here, that would be awesome! It ensures we can run it against test patterns effectively. If there’s any sensitive info, feel free to change it to something fictional yet still valid by keeping the format intact.
Providing clear text examples saves time on both ends and helps us give you the best possible solution. Without proper examples, we might end up playing a guessing game, and nobody wants that as it is a waste of time! You are more likely to get a correct answer faster. So, help us help you by sharing those text snippets. Thanks a bunch!
Please provide the input and output bundles of the modules by running the scenario (or get from the scenario History tab), then click the white speech bubble on the top-right of each module and select “Download input/output bundles”.
A.
Save each bundle contents in your text editor as a bundle.txt file, and upload it here into this discussion thread.