You already know the URL of the domain you were scraping (from the first module), so use that and prepend it.
samliew – request private consultation
Join the Make Fans Discord server to chat with other makers!
You already know the URL of the domain you were scraping (from the first module), so use that and prepend it.
samliew – request private consultation
Join the Make Fans Discord server to chat with other makers!
Yeah, I suppose, was just worried in case it’s something like “/docs/about-us” - if I just added the original link onto the back of the parsed text it may not get the right url?
What do you mean?
If you added https://www.eurogamer.net
to /docs/about-us
you get
https://www.eurogamer.net/docs/about-us
If that link is wrong then it was wrong on the original website in the first place.
samliew – request private consultation
Join the Make Fans Discord server to chat with other makers!
Edit: It seems to now work.
Weirdly though, sometimes it puts the FULL url into the sheet - and other times it just puts the text! (I’m always after the full URL)
This is the pattern I’ve put in the text parser: <a[^<“>]?href=“(?[^<”>]+?)"[^<>]>(?[^<”>]?privacy[^<">]?)</a>
Here’s what the parser outputs look like:
Yes, it all depends on what was in the source code of the link tag.
Reckon there’s much I can do to just get the full URL in all cases? I’m wanting to then pass the links into another automation. If I add the ‘original’ URL to the google sheet appended with the parser output, sometimes I’ll get a URL with double the https:// stuff on. If that makes sense.
I’ve tried it here:
… and here’s the output:
You can see in some cases its doubling the URL within the same row…
Add only when it isn’t present.
Something like this
samliew – request private consultation
Join the Make Fans Discord server to chat with other makers!
Oooo, where do I add that string into?
Your URL (A) field.
samliew – request private consultation
Join the Make Fans Discord server to chat with other makers!
Oh you gotta select the variables from the variables panel.
See how some of the special variables have a background?
Or you can manually type it out like that
{{if(indexOf(2.url; "http") = 0; 2.url; 5.value + 2.url)}}
Here are some useful links and guides you can use to learn more on how to use the Make platform, apps, and app modules. I found these useful when I was learning Make, and hope they might benefit you too —
parseDate
| Tokens for formatDate
samliew – request private consultation
Join the unofficial Make Discord server to chat with other makers!
Hi Sam, you’ve been a great help - I really appreciate your time and patience. I’m new to this, and finding the whole filtering/regex stuff all a bit over my head. You’ve helped me move things forward so thanks for that - you’ve been great. Cheers