Hi,
I need help again with this page: https://www.sattipp.de/spt/bl-18/1/
There I want to extract some dates from that table; for example the numbers of the cell "2:0 [Info]". How can I scrape only 2:0?...without "[info]"?
Hi,
I need help again with this page: https://www.sattipp.de/spt/bl-18/1/
There I want to extract some dates from that table; for example the numbers of the cell "2:0 [Info]". How can I scrape only 2:0?...without "[info]"?
Here's a nice trick, extract all the info you want, copy & paste on excel, then hit CTRL+F , chose replace, on find what type [Info] & leave "replace with" empty, and finally click on replace all...problem has been solved
Thx for your answer.
I know this "trick" 
Isnt it possible to extract a cell without a certain part of text?
I don't think it's possible in that case
Hi!
In order to pick only numbers, followed by a same pattern of results (scores, number:number), you can use RegEx in your selector:
\d+.+\d
Open your score selector and put above mentioned RegEx to pick numbers without [info].
P.S. looks like funny looking meme haha
Super, it works! But: Why? What does "\d+.+\d" mean?
\d stands for any digit character, + after it will pick all resting same-type characters.
. stands for any single character.
You can test your regex prior using it on https://regex101.com/
Please keep in mind that WebScraper does not support any flags, e.g. Global flag.
At the moment using Global flag can be achieved if WebScraper used in pair with Tampermonkey extension.
Super, Thank you very much!
I´ve another question (sorry!):
Another website (1) has a table, where the football-results are splitted in 3 cells. I want to scrape it in 1 cell. Is that also with the regex possible?
Results are split into table data cells there, you can use an Element selector to pick up the rows, and add text selectors inside to pick the results. You don't need regex there.
Example sitemap:
{"_id":"fussballvorhersage","startUrl":["http://www.fussballvorhersage.de/b1/b1_20180817.htm"],"selectors":[{"id":"row","type":"SelectorElement","selector":"tr:nth-of-type(6) tr","parentSelectors":["_root"],"multiple":true,"delay":0},{"id":"count1","type":"SelectorText","selector":"td:nth-of-type(4)","parentSelectors":["row"],"multiple":false,"regex":"","delay":0},{"id":"colon","type":"SelectorText","selector":"td:nth-of-type(5)","parentSelectors":["row"],"multiple":false,"regex":"","delay":0},{"id":"count2","type":"SelectorText","selector":"td:nth-of-type(6)","parentSelectors":["row"],"multiple":false,"regex":"","delay":0}]}
Great, thanks!
Back to regex: "3:1 [Info]"
To scrape the data without "[Info]" works fine.
Now I want to extract the single digits.
First i extracted the first one (3); it works! (\d)
But how to scrape only the second one (1) and without [Info]?
Edit: I got it: "[^d] "