Hi all, I'm trying to sort through around 300000 links from a website and scrape all of the ones that contain certain keywords (~500), so far I've managed to just scrape 70 at random and just look at them. Problem is, the exported data doesn't show the titles of the links, only some random garbled numbers, which doesn't help me at all.
How can I make it so the exported data has the titles of the links somewhere attached? And most importantly, is there a way to exclude all other links from the results apart from the ones with the keywords I want?
This is what I have so far
{"_id":"ant1","startUrl":["https://antenna.gr/embed/[309900-309976]"],"selectors":[{"id":"vid","parentSelectors":["_root"],"sitemapXmlMinimumPriority":0.1,"sitemapXmlUrlRegex":"","sitemapXmlUrls":["https://www.antenna.gr/sitemap.xml","https://www.antenna.gr/shows-sitemap","https://www.antenna.gr/videos-sitemap"],"type":"SelectorSitemapXmlLink"},{"delay":0,"extractAttribute":"","id":"video","multiple":true,"parentSelectors":["_root"],"selector":".bmpui-ui-hugeplaybacktogglebutton div","type":"SelectorElementAttribute"}]}