Identifying specific content within a website

Chris_Bain · July 9, 2025, 3:36pm

I’m trying to create a scenario that will identify written case study content within a given website. Using HTTP Make a Request to identify the sitemap.xml of the site doesnt provide enough detail, because typically the case studies reside several steps below the main URL - and there isn’t a uniform structure (some are stored as HTML, some are PDF etc). I could use a scraper, but that will be costly, and difficult to find the right content.

Any suggestions?

Rishabh_Dugar · July 9, 2025, 4:08pm

Could you share an example url? Will the relevant content be present in sitemap for sure?

Chris_Bain · July 9, 2025, 7:26pm

Hi - here are a few examples:

Cisco case studies (in English) are here (unlike most others, Cisco has a single page listing all assets, although they are also linked from many other pages) - Case Studies and Customer Success Stories - Full Listing - Cisco
Salesforce case studies (called Customer Stories) are here (spread across multiple pages linked form this one) - https://www.salesforce.com/uk/customer-stories/
Zoom case studies (again called Custome Stories) are here (only a few are displayed at first, more are displayed if a btton is clicked) - https://www.zoom.com/en/customer-stories/all/

Ideally I’d like to be able to point at a top-level URL (i.e. www.cisco.com) and identify all case studies automatically.

Topic		Replies	Views
How to scan own website for specific issues regarding content? How To api	2	34	July 13, 2025
A web scraping method to scrape many websites of different structure and content How To notion	27	2678	January 16, 2025
How to access and get data of content of a specific news from a website and add it to a google sheet How To google-sheets , http	2	64	July 23, 2025
Scraping website How To functions	1	42	May 15, 2025
Read a specific Google SERP How To google	3	302	November 18, 2024

Identifying specific content within a website

Related topics