Hello,
Two Questions:
-
When using the Start URL as https://www.lazada.co.th/shop-makeup-accessories/?page=[1-10]
Why does the scraping usually start at the highest page number and then go backwards? i.e. page 10,9,8,7,6,5... -
Why does the output data from all multi-page website scrapes appear in random page order?
See example output below:
web-scraper-start-url
https://www.lazada.co.th/shop-makeup-accessories/?page=10
https://www.lazada.co.th/shop-makeup-accessories/?page=8
https://www.lazada.co.th/shop-makeup-accessories/?page=5
https://www.lazada.co.th/shop-makeup-accessories/?page=3
https://www.lazada.co.th/shop-makeup-accessories/?page=1
https://www.lazada.co.th/shop-makeup-accessories/?page=1
https://www.lazada.co.th/shop-makeup-accessories/?page=4
https://www.lazada.co.th/shop-makeup-accessories/?page=8
https://www.lazada.co.th/shop-makeup-accessories/?page=9
https://www.lazada.co.th/shop-makeup-accessories/?page=6
https://www.lazada.co.th/shop-makeup-accessories/?page=4
https://www.lazada.co.th/shop-makeup-accessories/?page=1
https://www.lazada.co.th/shop-makeup-accessories/?page=10
https://www.lazada.co.th/shop-makeup-accessories/?page=8
https://www.lazada.co.th/shop-makeup-accessories/?page=1
- My actual goal is to scrape all products from pages 1-90, however I noticed the scraping stops after about 4-5 pages are scraped. Can anyone get this working for categories on www.lazada.co.th ?
Thanks!