As part of a competitor analysis - I am trying to extract all product titles and prices from different categories from Online Pharmacies. These also include pagination. It looks my sitemap is scraping duplicate records. From what I have read online, it looks like it could be an issue with element selector but would appreciate an expert advice on this.
Url: https://asteronline.com/personal-care/nail-care.html (The site is slow)
Sitemap:
{
"_id": "aster_1",
"startUrl": [
"https://asteronline.com/personal-care/nail-care.html"
],
"selectors": [
{
"id": "pagination",
"paginationType": "clickMore",
"parentSelectors": [
"_root",
"pagination"
],
"selector": ".product-list-container .pages-items a",
"type": "SelectorPagination"
},
{
"delay": 0,
"id": "parent",
"multiple": true,
"parentSelectors": [
"pagination"
],
"selector": "li.product",
"type": "SelectorElement"
},
{
"delay": 0,
"id": "title",
"multiple": false,
"parentSelectors": [
"parent"
],
"regex": "",
"selector": "a.product-item-link",
"type": "SelectorText"
},
{
"delay": 0,
"id": "price",
"multiple": false,
"parentSelectors": [
"parent"
],
"regex": "",
"selector": "div.price-box",
"type": "SelectorText"
}
]
}