I'm trying to scrape dates from a website with lab results, however recently the website changed and now there are two dates in a single cell, like this:
'Aug 19, 2021
(Tested: Aug 18, 2021)'
I need either the first one, or the latter, but I can't have both because that messes up further scripts.
How should I update my DATE selector?
Than you in advance for any help
Url: DrugsData.org: Results : Lab Test Results for Recreational Drugs
Sitemap:
{"_id":"drugsdata","startUrl":["https://drugsdata.org/search.php?search_field=adv&name=&field_test=&substance1=&substance2=&id=&color=&colorexact=0&city=&source=&m1=&y1=&m2=&y2=&state=&country=9999&sold_as_ecstasy=both&on_hold=both"],"selectors":[{"id":"wrapper","type":"SelectorElement","parentSelectors":["_root"],"selector":"table#MainResults","multiple":false,"delay":0},{"id":"BADGE","type":"SelectorText","parentSelectors":["rows selector"],"selector":"td:nth-of-type(2), th[rowspan]:nth-of-type(2)","multiple":false,"regex":"","delay":0},{"id":"TITLE 1/2","type":"SelectorText","parentSelectors":["rows selector"],"selector":"td:nth-of-type(3), th#Substance","multiple":false,"regex":"","delay":0},{"id":"TITLE 2/2","type":"SelectorText","parentSelectors":["rows selector"],"selector":"th#Amounts, td:nth-of-type(4)","multiple":false,"regex":"","delay":0},{"id":"DATE","type":"SelectorText","parentSelectors":["rows selector"],"selector":"th:nth-of-type(4), td:nth-of-type(5)","multiple":false,"regex":"","delay":0},{"id":"link","type":"SelectorLink","parentSelectors":["rows selector"],"selector":".Tablet a","multiple":false,"delay":0},{"id":"TEXT 2/2","type":"SelectorText","parentSelectors":["rows selector"],"selector":"th:nth-of-type(5), td:nth-of-type(6)","multiple":false,"regex":"","delay":0},{"id":"img","type":"SelectorImage","parentSelectors":["rows selector"],"selector":".Tablet img","multiple":false,"delay":0},{"id":"rows selector","type":"SelectorElement","parentSelectors":["wrapper"],"selector":"tbody tr","multiple":true,"delay":0}]}
