How to scrape page requiring login?

AlbertoCabasVidani · August 6, 2024, 10:10am

First of all, I need to scrape my own page. I’m not trying to steal anything here.

In particular, I want to scrape my Substack Notes feed to extract the number of likes and comments of each post.
The URL is Home | Substack. But I need to be logged in to see it.

I found an year-old python tutorial that used an HTTP post request. He inspected the login request to find out the payload.

I did the same, but couldn’t find the information.

How should I proceed?

samliew · August 6, 2024, 10:51am

If there is no captcha involved it’s relatively simple.

You just have to emulate the actual headers and payload sent during login, and store the cookie for a future request.

Hope this helps! Let me know if there are any further questions or issues.

— @samliew

AlbertoCabasVidani · August 6, 2024, 11:13am

Yes, there’s no captcha. How do you inspect the payload to emulate it?

samliew · August 6, 2024, 11:16am

Your web browser’s developer console’s Network tab.

On Google Chrome it’s F12 to open.

AlbertoCabasVidani · August 6, 2024, 12:46pm

Yes, but how do I find the right call?

samliew · August 6, 2024, 12:48pm

It varies from site to site, so you have to take a look yourself.

Usually it is a URL with a POST method, and an endpoint like /login

Hope this helps! Let me know if there are any further questions or issues.

— @samliew

AlbertoCabasVidani · August 6, 2024, 1:25pm

Ok. I’ll have to scroll through them all again.

Topic		Replies	Views
HTTP Log In Request How To http	2	1047	February 23, 2024
How to skip Captcha when logging into a website through HTTP How To web-scraping	4	194	November 12, 2024
How Login into a website via HTTP Module and get Data after Login [again] How To webhooks , connections , http , web-scraping	5	177	June 1, 2025
"Scrape" or just "Visit" a website with Login How To connections	5	74	May 26, 2025
Strategies to scrape full text from required login page Getting Started functions	2	26	June 8, 2025

How to scrape page requiring login?

Related topics