I am having trouble getting Chatgpt to read a webpage

samliew · May 1, 2024, 1:26am

If you’re fetching the article’s URL, you’ll probably need to do some kind of HTML sanitisation, or convert from HTML to text, or both, before you can pass it to OpenAI.

There is no right way of doing it. You can even use a web scraping module to simplify the job.

For web scraping, some apps you can use are ScrapingBee and ScrapeNinja to get content from the page.

I’ve used ScrapeNinja, and you can use jQuery-like selectors there in the extractor function.

ScrapeNinja also can run the page in a web-browser so it closely emulates what users see, as opposed to just the raw page HTML fetched from the HTTP module.

If you want an example, take a look at Grab data from page and url - #5 by samliew

samliew – request private consultation

Join the Make unofficial Discord server!

Topic		Replies	Views
Make with GPT-4o not browse url Questions api , chatgpt	3	211	November 23, 2024
News Automation (RSS -> Scraptio -> OpenAI --> Google Sheet): almost there, please help! Questions error	6	526	September 11, 2024
Google alerts rss feed to chatGPT to Wordpress Questions functions	3	1428	December 7, 2023
Get only article text from a remote page Questions text-parser , http	2	1727	April 3, 2024
How to summarize a scraped page? Questions	2	745	December 7, 2023

I am having trouble getting Chatgpt to read a webpage

Related topics