Scrap the prestashop

Please help with this page.https://kraftdele.info I need to download all products, descriptions, photos, attributes, categories and subcategories. I am a complete novice. well thank you

Url: https://kraftdele.info/

Sitemap:
{id:"https://kraftdele.info/1_pl_0_sitemap.xml"}

Hi @stamp

Well this should help you to get started:

{"_id":"kraftdele","startUrl":["https://kraftdele.info/38-agregaty-jednofazowe"],"selectors":[{"id":"product-wrapper","type":"SelectorElement","parentSelectors":["_root","pagination"],"selector":"ul.product_list > li","multiple":true,"delay":0},{"id":"product-name","type":"SelectorText","parentSelectors":["product-wrapper"],"selector":"a.product-name","multiple":false,"regex":"","delay":0},{"id":"product-link","type":"SelectorLink","parentSelectors":["product-wrapper"],"selector":"a.product-name","multiple":false,"delay":0},{"id":"product-price","type":"SelectorText","parentSelectors":["product-wrapper"],"selector":"span[itemprop='price']","multiple":false,"regex":"","delay":0},{"id":"pagination","type":"SelectorLink","parentSelectors":["_root","pagination"],"selector":"#pagination_next_bottom a:nth(0)","multiple":true,"delay":0},{"id":"produt-image","type":"SelectorImage","parentSelectors":["product-wrapper"],"selector":"img","multiple":false,"delay":0}]}

As a complete novice you should check out these resources too :

*Installation | Web Scraper Documentation

*Web Scraper Tutorials

*Web Scraper << How to >> video tutorials

Hope it helps! :smiley:

Hi viesturs, thank you very much. I tried to modify it to download everything. But unsuccessfully. I watched the videos but it doesn't work for me. The complete code would help me the most and this is how I would learn. I am also willing to pay but I was desperate.:frowning: well thank you

Do you know how many products are there? If it's more than 5 thousand, extension might not be enough and you will probably need to use Cloud Scraper - Web Scraper

Hi viesturs, thank you for answer. There are about 1000 products. The most products I managed to get are 800 just because I can't download others without subcategories. thank you

@stamp Try this then:

{"_id":"kraftdwele","startUrl":["https://kraftdele.info/38-agregaty-jednofazowe"],"selectors":[{"id":"product-wrapper","type":"SelectorElement","parentSelectors":["pagination","categories","sub-categories","sub-sub-categories"],"selector":"ul.product_list > li","multiple":true,"delay":0},{"id":"product-name","type":"SelectorText","parentSelectors":["product-wrapper"],"selector":"a.product-name","multiple":false,"regex":"","delay":0},{"id":"product-link","type":"SelectorLink","parentSelectors":["product-wrapper"],"selector":"a.product-name","multiple":false,"delay":0},{"id":"product-price","type":"SelectorText","parentSelectors":["product-wrapper"],"selector":"span[itemprop='price']","multiple":false,"regex":"","delay":0},{"id":"pagination","type":"SelectorLink","parentSelectors":["pagination","categories","sub-categories","sub-sub-categories"],"selector":"#pagination_next_bottom a:nth(0)","multiple":true,"delay":0},{"id":"produt-image","type":"SelectorImage","parentSelectors":["product-wrapper"],"selector":"img","multiple":false,"delay":0},{"id":"categories","type":"SelectorLink","parentSelectors":["_root"],"selector":".tree > li > a","multiple":true,"delay":0},{"id":"sub-categories","type":"SelectorLink","parentSelectors":["categories"],"selector":".tree > li:has(span.OPEN) > ul > li > a","multiple":true,"delay":0},{"id":"sub-sub-categories","type":"SelectorLink","parentSelectors":["sub-categories"],"selector":".tree > li:has(span.OPEN) > ul > li:has(span.OPEN) > ul > li > a","multiple":true,"delay":0}]}

viesturs you are awesome. I don't know how to thank you. It went through and downloaded each category, subcategory and sub-sub-category. But how would we achieve it to open each product and download a short description, a long description, photos, price and name. Now it's so confusing and complicated for me. Help me with this and then I will just study and study. Thank you very much

Hi there,

Sorry to invade this question, but i have the same issue of stamp... :
How can i get the same data for this site (also prestashop): https://www.cupcake.pt ?
The problem here is that the pagination is like "1,2,3,...,64 >" as this eg: https://www.cupcake.pt/13-sinto-me-sexy

How can i fetch all items from this page?

Thank you in advance

Found it!