Hi team,
I'm trying to scrape exhibitor data (specifically website links) from this Ungerboeck-based site:
I created a sitemap using a pagination selector for the “Next” button and tried to collect the exhibitor links from div.booth-title a
, and then grab the website URL from the exhibitor detail page.
This method works fine on other sites with a similar structure, but on this one:
- Pagination doesn’t trigger new pages in the scraper
- The links aren’t being followed
- No data is being collected
{"_id":"hallerickson","startUrl":["[Momentus":{"id":"Exhibitors","paginationType":"auto","parentSelectors":["_root","Exhibitors"],"selector":"div.booth-title","type":"SelectorPagination"},{"id":"Weblink","linkType":"linkFromHref","multiple":false,"parentSelectors":["Exhibitors"],"selector":".contact-item a","type":"SelectorLink"}]}
I suspect it might be due to JavaScript rendering or how the site loads data via AJAX. Is there any known workaround to scrape such Ungerboeck portals with Web Scraper?
Would really appreciate any guidance or sample sitemaps.
Thanks in advance!