Hi Community,
im trying to scrape the archive of a newspaper. To do that, i need to scrape the article name, then open the article and scrape the content. since there are many articles, I need to skip to the next page of articles, once the articles on the current page are scraped. Through this forum is was able to get this far (see below), but it just skips from page to page without scraping the articles. Any ideas on how to solve this?
Thanks so much in advance!
Url: Archiv – Politik Nachrichten – 2020 – Sueddeutsche.de -
Sitemap:
{"_id":"sz2020neu","startUrl":["https://www.sueddeutsche.de/archiv/politik/2020"],"selectors":[{"id":"artikel","type":"SelectorElement","parentSelectors":["neueseite"],"selector":"em > ","multiple":true,"delay":0},{"id":"inhalt","type":"SelectorText","parentSelectors":["artikel"],"selector":"p.css-13wylk3","multiple":true,"regex":"","delay":0},{"id":"neueseite","type":"SelectorLink","parentSelectors":["_root","neueseite"],"selector":".arrow a","multiple":true,"delay":0}]}