How to scrape urls in a google sheet

The problem : i've got a Google Sheet with several urls in the same column (1 row = 1 url). all those urls are users pages (github user pages)
I would like my scraper to visit each of them and collect data in each user page. but i could not find a way because i was not able to select the urls as a selector in the google sheet. the scraper does not recognize the urls as a possible selector. So i transformed the google sheet in a pdf doc. i am now able to select the urls as a selector ! but the scraper still doesn't work. i tried several type of selector (link, pagination - i tried all different types of pagination selector) and i was not able to figure how to make it work... can you help me ? do you have had the same issue ?

Sitemap:
{id:"sitemap code"}

I can recommend you to use pastelink.net, think about that...

Google uses a lot of Ajax, I believe. These types of sites are harder to scrape. Easier to just download the file, copy the URLs and put them in pastelink.

1 Like