Grab data from page and url

Hello,
I need to extract information from email that is sent with a URL that when you open it it have status information and the address of package delivery (format is kept fixed). All that to connect it with woocommerce. It is the long way as this service I use does not have API option.
I know how to extract info from email, connect with woocommerce but I’m stuck how to extract delivey information from that url.
Is there a way to extract information from that link/web? AI that can do it?
thank you very much,

Hello,

Could you tell us more information about the URL string, provide an example of the URL, and what key/values do you want to extract from the URL/page?

If you are looking to get data from a webpage, you can use scraping integrations like ScrapingBee and ScrapeNinja to get content from the page.

2 Likes

Hi @ipoblete

If the URL provided is for a publicly accessible file, please utilize the HTTP “GET request” module to retrieve the data. If it isn’t a file, we should verify the URL before making an HTTP module make a request.

If you require additional assistance, please don’t hesitate to reach out to us.
MSquare Support
Visit us here
Youtube Channel

Hi @samliew
sure, is a link like Live Tracking where you get info like this,

where I want to grab string “Dirección” (Addres) and check it exist “Entrega Exitosa” (successful delivery) with a woocommerce order information.
Best regards,

I would use ScrapeNinja’s “Scrape (Real Browser)” module for this, as client-side JavaScript needs to run to retrieve the data.

You CANNOT use a HTTP “Make a request” module to fetch this data.

Fields:

  1. CSS Selector field:   .visit-detail

  2. Extractor function field:

    function (input, cheerio) {
      let $ = cheerio.load(input);
      return {
        orderNum: $('h1').text().replace('Order #', ''),
        address: $('h3').first().next().text(),
        status: $('ul').children().first().children().last().text()
      }
    }
    

Output:

3 Likes
Readability / article extraction?
I am having trouble getting Chatgpt to read a webpage
How to get information from a platform that doesn't send APIs and has no webhooks
GET HTTP Request - wait to load the page
Scraping prices on Ebay from EAN codes with Make
Website integration
How to access and get data of content of a specific news from a website and add it to a google sheet
Webscraping 403 forbidden
How can I use Apollo IO free plan & Scraping Leads
Need Help parsing text
Scenario to log into a website portal and look up data and export list via email
How to read a particular HTML tag/DOM Object from HTML content
News Automation (RSS -> Scraptio -> OpenAI --> Google Sheet): almost there, please help!
News Automation (RSS -> Scraptio -> OpenAI --> Google Sheet): almost there, please help!
How to Scrape a Data from Website using RSS Feed
HTTP Timeout with valid URL
Scrape Content from Redirected Google News Links
How to interact with website/platform if api is unaccessible?
How to approve or reject a text if not falls under specific category
Scraping data from website like Reddoorz
Overview of Different Web Scraping Techniques in Make :globe_with_meridians:
Scrape Tweets from specific users about specific topis and reposting with source mention
Scraping Google Jobs
How to handle "Javascript disabled" issue when web scraping
How to grab only the main content and main image URL from webpage?
The text parser has an program error
Extract Instagram url from website list
How do I trigger an email if a website URL content changes?
Convert long string into json
Opening URL contained on an airtable field
Stasus code 202 - fix?
I can't get data from a public link on META
Search Linked In for job posts
Instagram search
Scrape Ninja (Real Browser) Not Scraping All Data vs. Dumpling AI
Google Search result scrape stopped working
Scrape any kind of product pages
How to search/filter through a massive amount of data for one resource
Automate Website actions
RSS Module – 403 Forbidden Error When Retrieving Feed (Works Fine in Zapier)
Is there any way to pull text from links in telegram?
HTTP Get not getting all data form WEB PAGE
Crawl Cloudflare protected websites | Apify does not fully work
Text parser error : You need to enable JavaScript to run this app
SEO optimizing wordpress

cool @samliew. Thank you

1 Like