HELP! Scraper does find correct links but only checks the last one?

Hi all,

I am trying to scrape some info from the following site: Search Results

I managed to let the scraper expand the list and find the desired links (namely the clickable company names). Now I want to scrape some info from the page of every company link (for simplicity lets say I want to scrape every company name from the company link), but scraper returns the name of the last company for all company links! Anyone know how to solve?

{"_id":"sapcompaniesdenmark","startUrl":["Search Results p","clickType":"clickMore","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueCSSSelector"},{"id":"SelectCompanyLinkInsideWrapper","type":"SelectorLink","parentSelectors":["ExpandListAndSelectCompanyWrappers"],"selector":".search-result__head a","multiple":false,"delay":0},{"id":"SaveName","type":"SelectorText","parentSelectors":["SelectCompanyLinkInsideWrapper"],"selector":".partner-details section:nth-of-type(1) header","multiple":false,"regex":"","delay":0}]}

Hi @qwerty Your sitemap did not work when I tried to copy it but I hope that I managed to figure out what you were after! :wink:

{"_id":"partneredge-sap-com","startUrl":[""],"selectors":[{"id":"company-wrapper","type":"SelectorElementClick","parentSelectors":["_root"],"selector":"","multiple":true,"delay":"1200","clickElementSelector":"button.btn__show-more","clickType":"clickMore","discardInitialElements":"discard-when-click-element-exists","clickElementUniquenessType":"uniqueHTML"},{"id":"company-name","type":"SelectorText","parentSelectors":["company-wrapper"],"selector":".search-result__head a","multiple":false,"regex":"","delay":0}]}

He doesn't need that. He wants to click the link and scrape the data which is on the company page like contact name, email, etc.

List> Show more + Click On Company Name> Get Data from the linked page.

I've tried but I didn't get it through.

@Asad Oh, i see. I tested it with 2 of the company starting URLs and it refuses to go to the next link.

@qwerty It seems the issue lies in the link itself because of the "#" symbol the extension has difficuilties to proceed further.

After trying different combinations in the Cloud Scraper Environment, I managed to get the result you probably went for.

Thanks, both, for your effort! @viesturs, do the things you did in Cloud Scraper translate into the final sitemap? Or how can I replicate your findings? Because it seems as you indeed found what I am looking for!

Thanks for your help in advance!