For the life of me, I cannot connect google sheets (Get links for the website) to my apify actor. I’ve spent at least 10h on this…
I’ve tried adding variables etc… it’s impossible.
Please, if anyone has 10mins to help me out via chat, I’d be extremely grateful.
I want to scrape websites from construction companies (not only homepages, but subdomains) to get info on them, and then via gpt, use that information to personalize opening lines in e-mails.
I’ve watched dozens of hours of videos, worker with chat gpt, canvas, tried everything, can’t make it work.
Here’s my JSON for apify website content crawler:
{
“aggressivePrune”: false,
“clickElementsCssSelector”: “[aria-expanded="false"]”,
“clientSideMinChangePercentage”: 15,
“crawlerType”: “playwright:adaptive”,
“debugLog”: false,
“debugMode”: false,
“expandIframes”: true,
“ignoreCanonicalUrl”: false,
“keepUrlFragments”: false,
“maxPagesPerCrawl”: 12,
“proxyConfiguration”: {
“useApifyProxy”: true
},
“pseudoUrls”: [
{
“purl”: “[.(projekte|referenzen|leistungen|portfolio|unternehmen|ueber-uns|über-uns|about|team|services|karriere|profil|philosophie|geschichte).]”
}
],
“readableTextCharThreshold”: 100,
“removeCookieWarnings”: true,
“removeElementsCssSelector”: “nav, footer, script, style, noscript, svg, img[src^=‘data:’],[role="alert"],[role="banner"],[role="dialog"],[role="alertdialog"],[role="region"][aria-label*="skip" i],[aria-modal="true"]”,
“renderingTypeDetectionPercentage”: 10,
“respectRobotsTxtFile”: true,
“saveFiles”: false,
“saveHtml”: false,
“saveHtmlAsFile”: false,
“saveMarkdown”: true,
“saveScreenshots”: false,
“startUrls”: [
{
“url”: “Apify Console”,
“method”: “GET”
}
],
“useSitemaps”: false
}
I use the same one in input JSON box in call an actor module, and instead of the link, I place a website (from sheets) or from variables, tried everything doesn’t work.