Readability / article extraction?

For web scraping, some apps you can use are ScrapingBee and ScrapeNinja to get content from the page.

I’ve used ScrapeNinja, and you can use jQuery-like selectors there in the extractor function.

ScrapeNinja also can run the page in a web-browser so it closely emulates what users see, as opposed to just the page HTML fetched from the HTTP module.

If you want an example, take a look at Grab data from page and url - #5 by samliew

2 Likes