Outer HTML capturing without regex

Hi @iconoclast this one is for you:)

I am looking to capture the Outer HTML that is associate with the Company name in the title.

As can see the company name "SearchBrothers" falls outside the tags within the the

tag.
I found a workaround using regex to select everything after the comma but I know there was a simpler way to do this without regex. Any help?

image

Sitemap:
{"_id":"smxeast","startUrl":["https://marketinglandevents.com/smx/east/speakers/"],"selectors":[{"id":"Element Click","type":"SelectorElementClick","parentSelectors":["_root"],"selector":"div.modal.in div.modal-body","multiple":true,"delay":0,"clickElementSelector":"div.spkrbox > a:nth-of-type(2),.close","clickType":"clickOnce","discardInitialElements":false,"clickElementUniquenessType":"uniqueText"},{"id":"Name","type":"SelectorText","parentSelectors":["Element Click"],"selector":"h4.modal-title","multiple":false,"regex":"","delay":0},{"id":"Title","type":"SelectorText","parentSelectors":["Element Click"],"selector":"em","multiple":false,"regex":"","delay":0},{"id":"Bio","type":"SelectorText","parentSelectors":["Element Click"],"selector":"div.col-md-12 div.col-md-12","multiple":false,"regex":"","delay":0},{"id":"Company","type":"SelectorText","parentSelectors":["Element Click"],"selector":"div.col-md-10 p","multiple":false,"regex":"\s(\w+)$","delay":0},{"id":"Twitter","type":"SelectorElementAttribute","parentSelectors":["Element Click"],"selector":"a.icon:nth-of-type(1)","multiple":false,"extractAttribute":"href","delay":0},{"id":"linkedin","type":"SelectorElementAttribute","parentSelectors":["Element Click"],"selector":"a.icon:nth-of-type(2)","multiple":false,"extractAttribute":"href","delay":0}]}

Hi!

Unfortunately it's not possible to pick company name without regex, cause otherwise you'll also get a lot of white space:
image

IMO it's easier if you add a link selector to profiles, and then just add text selector to pick company name clearly.
Selector would be p a (set as Text selector to pick text instead of Link)

P.S. and if you're using 0.3.8 there's a new feature to use lookbehind using regex

0.3.8 [Feature] In a recent release Chrome added lookbehind to regex engine. Now you can write regex like this (?<=sku: ).+. This will extract 12345 from sku: 12345

P.P.S. and it's inner html not outer

Hmm... I guess that's why my attempt to google CSS selector for Outer Html didn't provide any solutions. Thanks for the info. Regex ended up working fine and I am happy about lookbehind being added