I am trying to scrap a directory. Everything looks find when I validate data by clicking "Data Preview"
However, when I run the scraper, I only get either business_name or business_name_english data.
Following is example csv that only shows either business_name or business_name_english
| business_name | business_name_english |
|---|---|
| 대한항공 여승무원 동우회 | |
| 한인 시니어 탁구협회 | |
| CHONBUK NATIONAL UNIVERSITY | |
| 토론토 다운타운 통신원 - 정준일 | |
| 토론토통신원 - 깊은맛 욜싸 | |
| Kangwnondo People's Club | |
| WHI MOON HIGH SCHOOL |
but when I delete either business_name or business_name_english selector then it retrieves all data without skipping.
I tried over 20 times starting from scratch and nothing help.
Please help.
Url: http://www.budongsancanada.com/WebPage.aspx?pageid=10
Sitemap:
{"_id":"budongsan","startUrl":["http://www.budongsancanada.com/WebPage.aspx?pageid=10"],"selectors":[{"id":"category","type":"SelectorLink","parentSelectors":["_root"],"selector":"ul.YPCGList a.rounded5","multiple":true,"delay":0},{"id":"sub-category","type":"SelectorLink","parentSelectors":["category"],"selector":"div.YPCategoryContainer a.rounded5","multiple":true,"delay":0},{"id":"business_name","type":"SelectorText","parentSelectors":["sub-category"],"selector":"span.YPTitle","multiple":true,"regex":"","delay":0},{"id":"business_name_english","type":"SelectorText","parentSelectors":["sub-category"],"selector":"span.YPETitle","multiple":true,"regex":"","delay":0}]}