Hi Guy's, newbie / beginner here… I am trying to scrape the horse racing tips data from https://gg.co.uk/tips/today
I can extract the horse, time and course data and exports the data into csv just fine.
What I would like to add to this is the forecast odds data that are shown in the morning. The "data element" is then updated during the day to show the race result and odds.
When I add the odds element, I do get the data but the structure is all wrong.
What would be the best way to get the odds/results alongside the currently extracted data.
This import will get the time, course and horses name.
Thanks in advance to anyone that can help
Tim
UPDATE - This sitemap nearly works, but it doesn’t get the results in the correct column though. I don't seem able to select the elements correctly.
{"_id":"gg-results","startUrl":["https://gg.co.uk/tips/02-jan-2020"],"selectors":[{"id":"ele1","type":"SelectorElement","parentSelectors":["_root"],"selector":"td:nth-of-type(2)","multiple":true,"delay":0},{"id":"time","type":"SelectorText","parentSelectors":["ele1"],"selector":"a.winning-post","multiple":false,"regex":"","delay":0},{"id":"horse","type":"SelectorText","parentSelectors":["ele1"],"selector":"a.horse","multiple":false,"regex":"","delay":0},{"id":"ele2","type":"SelectorElement","parentSelectors":["_root"],"selector":"td.tips-price","multiple":true,"delay":0},{"id":"result","type":"SelectorText","parentSelectors":["ele2"],"selector":"parent","multiple":false,"regex":"","delay":0}]}
Any help is much appreciated..