I'm compiling a list of contacts for substance abuse facilities. Psychology Today has a great repository with many contacts. I was able to scrape their comprehensive list of about 39k contacts. This returned name, location, and phone number. The issue is I need email and URL for each contact. The site itself offers email contact through a form that obscures the actual email address from view. It does the same for the counselors websites. I'm able to create a link selector to follow each website link, but pulling the actual URL is proving difficult once I hit their sites, since they are all structured differently. Anyone have a workaround for this?
https://www.psychologytoday.com/us/therapists/addiction/
Once at this link click on a state, then a provider, then the "website" link to see what I mean.
Sitemap:
{"_id":"pt_add_w_site","startUrl":["https://www.psychologytoday.com/us/therapists/addiction"],"selectors":[{"id":"State Pagination","type":"SelectorLink","parentSelectors":["_root"],"selector":".us_region_list a","multiple":true,"delay":0},{"id":"ind_state_pagination","type":"SelectorLink","parentSelectors":["State Pagination","ind_state_pagination"],"selector":"a.pager-page","multiple":true,"delay":0},{"id":"Ind Link","type":"SelectorLink","parentSelectors":["State Pagination","ind_state_pagination"],"selector":"a[itemprop='url']","multiple":true,"delay":0},{"id":"Name","type":"SelectorText","parentSelectors":["Ind Link"],"selector":".hidden-sm-down h1","multiple":false,"regex":"","delay":0},{"id":"Phone","type":"SelectorText","parentSelectors":["Ind Link"],"selector":"[itemprop='telephone'] a","multiple":false,"regex":"","delay":0},{"id":"city","type":"SelectorText","parentSelectors":["Ind Link"],"selector":".hidden-sm-down span[itemprop='addressLocality']","multiple":false,"regex":"","delay":0},{"id":"state","type":"SelectorText","parentSelectors":["Ind Link"],"selector":".hidden-sm-down span[itemprop='addressRegion']","multiple":false,"regex":"","delay":0},{"id":"web_link","type":"SelectorLink","parentSelectors":["Ind Link"],"selector":"a.btn-md.btn-default","multiple":false,"delay":0}]}