Scraping a news story

Safety_Tom · July 31, 2025, 11:26am

Figured this out using Scraping Bee (ScrapeNinja - and its documentation - was not working for me). The essential piece of knowledge I uncovered is that many article authors use the HTML tag or to contain the “body text” of the article. My JSON extraction was simple {“title”:“title”,“body”:“body”,“article”,“article”}. After I get this object I then investigated bot and to select the larger item…and then cut this item down to 40,000 characters (because I wanted to store it in a single cell in a google sheet). Hope this helps someone someday!

Topic		Replies	Views
Make with GPT-4o not browse url Questions api , chatgpt	3	211	November 23, 2024
Multiple Blog scrap and resume with chatgpt Questions airtable , api , chatgpt , web-scraping	2	399	April 5, 2024
News Automation (RSS -> Scraptio -> OpenAI --> Google Sheet): almost there, please help! Questions error	6	531	September 11, 2024
Using ChatGPT to tell me if news is worth reading and summarising Questions chatgpt	2	585	August 21, 2024
How to access and get data of content of a specific news from a website and add it to a google sheet Questions google-sheets , http	2	99	July 23, 2025

Scraping a news story

Related topics