I'm trying to extract a name, number and website from thomsonlocal.com for a particular trade. I've run a test on one page and I've successfully extracted all three, the problem is, I first get a column of names, followed by a column of numbers and then web addresses.
I assume the first name corresponds to the first number but not every record has a website.
I can extract both name and number with a text selector but I need to grab the website with a element attribute selector with href.
So I'd like to know if it's possible to extract all three together as one record?
{"_id":"thomson","startUrl":["https://www.thomsonlocal.com/electricians/UK/?Page=2"],"selectors":[{"id":"name","type":"SelectorText","parentSelectors":["_root"],"selector":"h2.businessName","multiple":true,"regex":"","delay":0},{"id":"number","type":"SelectorText","parentSelectors":["_root"],"selector":"div.phoneCont","multiple":true,"regex":"","delay":0},{"id":"site","type":"SelectorElementAttribute","parentSelectors":["_root"],"selector":"li.listingHeadLink.website a","multiple":true,"extractAttribute":"href","delay":0}]}