Hello guys, long story short, I'm trying to scrap items from this website https://eventup.com/venues/chicago-il/ and I get NULL in some columns. Check the picture & the sitemap below.
Sitemap:
{"_id":"chicago","startUrl":["https://eventup.com/venues/chicago-il/?capacity_type=standing&page=[1-20]"],"selectors":[{"id":"Name","type":"SelectorLink","parentSelectors":["_root"],"selector":"div.property-overlay a.btn","multiple":true,"delay":0},{"id":"Category","type":"SelectorText","parentSelectors":["Name"],"selector":"div.jss143.jss167 h5.jss235","multiple":false,"regex":"","delay":0},{"id":"ADDRESS","type":"SelectorText","parentSelectors":["Name"],"selector":"p.jss235 span","multiple":false,"regex":"","delay":0},{"id":"name","type":"SelectorText","parentSelectors":["Name"],"selector":"h1.jss235","multiple":false,"regex":"","delay":0}]}
I allowed more time for Request interval & page load delay but I still get some NULL results
You had some CSS Selectors that weren't as ideal. Try this sitemap
{"_id":"chicago","startUrl":["https://eventup.com/venues/chicago-il/?capacity_type=standing&page=[1-20]"],"selectors":[{"id":"Name","type":"SelectorLink","parentSelectors":["_root"],"selector":"div.property-overlay a.btn","multiple":true,"delay":0},{"id":"Category","type":"SelectorText","parentSelectors":["Name"],"selector":".jss247","multiple":false,"regex":"","delay":0},{"id":"ADDRESS","type":"SelectorText","parentSelectors":["Name"],"selector":".jss293:first","multiple":false,"regex":"","delay":0},{"id":"name","type":"SelectorText","parentSelectors":["Name"],"selector":"H1","multiple":false,"regex":"","delay":0}]}
Thanks man ! looks good so far ! could you please tell me what you did exactly to make it scrap better ? thanks
I just selected better CSS selectors. The one you had chosen weren't consistent across pages