Different Domains with different selectors

Hi,

is it possible to scrape different domains with different selecors?
For excample: I want to scrape the table on website A und the table on website B. The problem is the different structure. So I need to implement different/specific selectors for the different pages. Is this viable?

A: https://allegegenpistor.wdr2.de/tipptrend.php
B: https://www.sattipp.de/spt/bl-17/27/

Hey!

You can create either few selectors to pick data according to a website, or merge them into each other, but separated by a comma to save space in end results.

Example:
Selector 1 - will pick field named, for ex. Name, on first website, but will return 'null' on second website
Selector 2 - will pick field named, for ex. surname, on second website, but will return 'null' on first website

You can create a mixed selector, with both Selector 1 and Selector 2 inside, but separated by a comma, so it will pick 1st available field. That means, if you reach website 2 and, if two selectors were there, 1st would return null, but instead it will just skip non-existent field and pick second, thus rendering end results much more usable.

As for different websites, you can add any in your Metadata. (Menu -> Sitemap -> Metadata)

Okay, thx for helping again. I trying it since a hour and I have some problems with this website:

https://www.kickform.de/bundesliga-tipps/

The first table ("TOP-PROGNOSEN / 1. SPIELTAG) doestn seem to be a real table; so I cant select it with the table-selector. If i select the elements with "text", it doesnt combine the 4 elements in a row (for example the first row Bayern, 2:1, Hoffenheim, 24.08.2018 20:30 Uhr)....so each scraped element has his own row.

Can you help me again? :frowning:

If you want to group your selectors in one row, you can use Element selector. It will act as a wrapper for a group of elements contained inside. Then all you've left to add is text selectors to pick data.

Here's an example:

{"_id":"kickform","startUrl":["https://www.kickform.de/bundesliga-tipps/"],"selectors":[{"id":"group","type":"SelectorElement","selector":"div.pos li","parentSelectors":["_root"],"multiple":true,"delay":0},{"id":"Team1","type":"SelectorText","selector":"strong.left a","parentSelectors":["group"],"multiple":false,"regex":"","delay":0},{"id":"Team2","type":"SelectorText","selector":"strong.right a","parentSelectors":["group"],"multiple":false,"regex":"","delay":0},{"id":"Result","type":"SelectorText","selector":"div.top span","parentSelectors":["group"],"multiple":false,"regex":"","delay":0}]}

1 Like