Apologize for the dumb question in advance but this seems like it should be easy to scrape but because their HTML is organized poorly I'm having trouble.
I would like the blog name and URLs of the list of 50 on this site
Url: https://blogging.com/top-bloggers/
I mostly select what I want manually by clicking but sometimes, as occurs here, it grabs stuff I don't want. Is there a way to deselect elements or is there instruction on how to write out the selector? I know I want the first <a href>
tag under the first<p>
tag of every <h2>
tag.
Sitemap:
{"_id":"blogs","startUrl":["https://blogging.com/top-bloggers/"],"selectors":[{"id":"Name","type":"SelectorText","selector":"h2","parentSelectors":["_root"],"multiple":true,"regex":"","delay":0},{"id":"url","type":"SelectorText","selector":"p:nth-of-type(n+6) a:nth-of-type(1)","parentSelectors":["_root"],"multiple":true,"regex":"","delay":0}]}