Hello,
I was wondering if there is a way to build a webscraper on the webscraper.io extension, export the sitemap JSON, and then place that into a Python/Jupyter Notebook file to run the scraper.
Has anyone done this yet?
Hello,
I was wondering if there is a way to build a webscraper on the webscraper.io extension, export the sitemap JSON, and then place that into a Python/Jupyter Notebook file to run the scraper.
Has anyone done this yet?
Dunno if it's possible with Python, but someone has done it with Node.js: https://www.npmjs.com/package/web-scraper-headless