Scraping the Website

Url: https://www.oeffnungszeitenbuch.de/stadt/Berlin-1.html

I want to scrape the Company and Adress.

The Problem is that export is not in the same Line of the Table.

Here a Screenshot: https://www.bilder-upload.eu/bild-b7294c-1558035732.png.html

Sitemap:
{"_id":"berlin","startUrl":["https://www.oeffnungszeitenbuch.de/stadt/Berlin-1.html"],"selectors":[{"id":"Name","type":"SelectorText","parentSelectors":["_root"],"selector":"div:nth-of-type(4) u","multiple":true,"regex":"","delay":0},{"id":"Adress","type":"SelectorText","parentSelectors":["_root"],"selector":"div:nth-of-type(4) div.cboxAddr","multiple":true,"regex":"","delay":0}]}

This happens because both of your selectors have been checked as 'Multiple', which causes the rows to misalign.

This will do the trick:

{"_id":"berlin","startUrl":["https://www.oeffnungszeitenbuch.de/stadt/Berlin-1.html"],"selectors":[{"id":"Name","type":"SelectorText","parentSelectors":["element"],"selector":"u","multiple":false,"regex":"","delay":0},{"id":"Adress","type":"SelectorText","parentSelectors":["element"],"selector":"div.cboxAddr","multiple":false,"regex":"","delay":0},{"id":"element","type":"SelectorElement","parentSelectors":["_root"],"selector":".cboxcontent","multiple":true,"delay":0}]}

Hello, thank you for your help it works fine :grinning:
Now i have a second question - the pagination is not like in the Video Tuturial because the Side is ending wit .html

I think i have to use a element click and generate them seperate but together they dont work

Your Script:

{"_id":"berlin","startUrl":["https://www.oeffnungszeitenbuch.de/stadt/Berlin-1.html"],"selectors":[{"id":"Name","type":"SelectorText","parentSelectors":["element"],"selector":"u","multiple":false,"regex":"","delay":0},{"id":"Adress","type":"SelectorText","parentSelectors":["element"],"selector":"div.cboxAddr","multiple":false,"regex":"","delay":0},{"id":"element","type":"SelectorElement","parentSelectors":["_root"],"selector":".cboxcontent","multiple":true,"delay":0}]}

My Pagination Script:

{"_id":"dfg","startUrl":["https://www.oeffnungszeitenbuch.de/stadt/Berlin-1.html"],"selectors":[{"id":"berlin78","type":"SelectorElementClick","parentSelectors":["_root"],"selector":"a:nth-of-type(8)","multiple":true,"delay":"5","clickElementSelector":"a:nth-of-type(8)","clickType":"clickOnce","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueText"}]}

You need to use the Link selector for the pagination in this one:

{"_id":"dfg","startUrl":["https://www.oeffnungszeitenbuch.de/stadt/Berlin-1.html"],"selectors":[{"id":"berlin78","type":"SelectorElement","parentSelectors":["_root","pagination"],"selector":"div.cboxinnerL","multiple":true,"delay":""},{"id":"pagination","type":"SelectorLink","parentSelectors":["_root","pagination"],"selector":"a:contains('Nächste Seite')","multiple":true,"delay":0},{"id":"name","type":"SelectorText","parentSelectors":["berlin78"],"selector":"u","multiple":false,"regex":"","delay":0}]}