I am trying to apply this approach ...
... onto my situation. Here's my current sitemap:
{"_id":"esp","startUrl":["https://www.paginasamarillas.es/search/apartamento/all-ma/barcelona/all-is/barcelona/all-ba/all-pu/all-nc/1"],"selectors":[{"id":"page","parentSelectors":["_root","page"],"paginationType":"auto","selector":".pagination a","type":"SelectorPagination"},{"id":"wrapper","parentSelectors":["page"],"type":"SelectorElement","selector":".item-ip div.box","multiple":true,"delay":0},{"id":"str","parentSelectors":["wrapper"],"type":"SelectorText","selector":"span[itemprop='streetAddress']","multiple":false,"delay":0,"regex":""},{"id":"sip","parentSelectors":["wrapper"],"type":"SelectorText","selector":"span[itemprop='postalCode']","multiple":false,"delay":0,"regex":""}]}
A cannot simply apply "a.contains" as suggested in the shared post, due to the favicon instead of an '>' char.
I wonder how to scrape these address components. I tried different pagination types, only resulting in scraping multiple addresses on a single page.
How should I approach this?