Multiple product images

Hi!
I've spent a few hours but can't get it right. I want the csv to have only one row per product and the image urls (not the tumbnails) in separate columns. Whatever I try I get a new row for each image. Please help!

Url: http://www.crafton.se

Sitemap:
{"_id":"crafton_nyheter","startUrl":["http://www.crafton.se/cgi-bin/ibutik/AIR_ibutik.pl?funk=steg_tva&extra=nyheter"],"selectors":[{"id":"Produktlista","type":"SelectorLink","selector":"div.PT_Bildruta a","parentSelectors":["Visa allt"],"multiple":true,"delay":0},{"id":"Visa allt","type":"SelectorLink","selector":"div.Artgrp_VisaFlerArtiklar a:nth-of-type(3)","parentSelectors":["_root"],"multiple":false,"delay":0},{"id":"produktid","type":"SelectorText","selector":"h1","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"artikelnr","type":"SelectorText","selector":"div#ArtnrFalt","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"produkttext","type":"SelectorText","selector":"div#Produktinfo","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"produkttext2","type":"SelectorHTML","selector":"div#Produktinfo","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"Produktkategori","type":"SelectorText","selector":"a.BreadCrumb:nth-of-type(2)","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"underkategori","type":"SelectorText","selector":"a.BreadCrumbActive","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"pris","type":"SelectorText","selector":"span.PrisBOLD","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"produktsida","type":"SelectorElement","selector":"div.customer","parentSelectors":["Produktlista"],"multiple":false,"delay":0},{"id":"bilder","type":"SelectorElement","selector":"div#Bildkolumn","parentSelectors":["produktsida"],"multiple":false,"delay":0},{"id":"bildlankar","type":"SelectorPopupLink","selector":"div.BildZoomThumbnails:nth-of-type(n+2) img","parentSelectors":["bilder"],"multiple":true,"delay":0}]}

Hi

it is the normal way of proceding of this extension.

You have to retreat with Excel the result of the scraping, but it is not so easy.

Good luck

It's because your popup link selector had multiple option checked. If you want multiple images/items in one row you have to select them with grouped selector or make a separate selector for each image/item.

Here is an updated version of your sitemap with grouped selector:

{"_id":"crafton_nyheter","startUrl":["http://www.crafton.se/cgi-bin/ibutik/AIR_ibutik.pl?funk=steg_tva&extra=nyheter"],"selectors":[{"id":"Produktlista","type":"SelectorLink","selector":"div.PT_Bildruta a","parentSelectors":["Visa allt"],"multiple":true,"delay":0},{"id":"Visa allt","type":"SelectorLink","selector":"div.Artgrp_VisaFlerArtiklar a:nth-of-type(3)","parentSelectors":["_root"],"multiple":false,"delay":0},{"id":"produktid","type":"SelectorText","selector":"h1","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"artikelnr","type":"SelectorText","selector":"div#ArtnrFalt","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"produkttext","type":"SelectorText","selector":"div#Produktinfo","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"produkttext2","type":"SelectorHTML","selector":"div#Produktinfo","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"Produktkategori","type":"SelectorText","selector":"a.BreadCrumb:nth-of-type(2)","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"underkategori","type":"SelectorText","selector":"a.BreadCrumbActive","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"pris","type":"SelectorText","selector":"span.PrisBOLD","parentSelectors":["produktsida"],"multiple":false,"regex":"","delay":0},{"id":"produktsida","type":"SelectorElement","selector":"div.customer","parentSelectors":["Produktlista"],"multiple":true,"delay":0},{"id":"bilder","type":"SelectorGroup","selector":"div.BildZoomThumbnails:nth-of-type(n+2) a","parentSelectors":["produktsida"],"delay":0,"extractAttribute":"href"}]}

Note that if you use grouped selector to extract images you won't be able to download them with image download script.

1 Like

Thank you so much! That will be perfect! To bad about the image downloader but maybe one day it will work :slight_smile: