Help with this website

I have a lot of problem scraping this website :

Url: Clubfinder

I don't know how to navigate through all the pages in the 4 drop down lists. What I want to retrieve on the webpages reached is :

  • Name of the Club
  • E-Mail

Can somebody help me ?

Hi,

You can start with this sitemap to traverse through all the links:

{"_id":"lions-ch","startUrl":["https://www.lions.ch/fr/ueber-uns/distrikte-clubs/clubfinder.html"],"selectors":[{"id":"links","linkType":"linkFromAttributes","multiple":true,"parentSelectors":["_root"],"selector":".tx_lionsbase_piclubselector option","type":"SelectorLink"},{"id":"name","multiple":false,"parentSelectors":["links"],"regex":"","selector":"h1","type":"SelectorText"},{"id":"e-mail","multiple":false,"parentSelectors":["links"],"regex":"","selector":"tr:contains('E-Mail') td a,tr:contains('E-mail') td a, tr:contains('Email') td a","type":"SelectorText"}]}

Some of the pages have a different layout, thus fine-tuning will be necessary.

1 Like

Many thanx ! It's perfect !