Hi guys,
I cannot scrape this https://www.qualit-enr.org/annuaire
Can someone help ? Any idea why i cannot find a way to scrap this page ?
Thanks
Hi guys,
I cannot scrape this https://www.qualit-enr.org/annuaire
Can someone help ? Any idea why i cannot find a way to scrap this page ?
Thanks
Only Element attribute seems to work here. Try this sitemap which will get all the region Urls and titles. You'll need to prepend the base Url later, but that should be trivial. I used Page load delay: 7000
{"_id":"forum-qualit-en","startUrl":["https://www.qualit-enr.org/annuaire"],"selectors":[{"id":"Region wrappers","type":"SelectorElement","parentSelectors":["_root"],"selector":"div > svg[id='france'] > g[id]","multiple":true,"delay":0},{"id":"a url","type":"SelectorElementAttribute","parentSelectors":["link wrappers"],"selector":"_parent_","multiple":false,"extractAttribute":"href","delay":0},{"id":"a title","type":"SelectorElementAttribute","parentSelectors":["link wrappers"],"selector":"_parent_","multiple":false,"extractAttribute":"title","delay":0},{"id":"link wrappers","type":"SelectorElement","parentSelectors":["Region wrappers"],"selector":"a","multiple":true,"delay":0}]}