Regex to extract specific part of HTML

Hi,

With help from the forum i have extracted a part of the HTML source.
From that source, i would like to narrow it down to the digits after the "gtin" part, the length varies:

{ "@context": "http://schema.org/", "@type": "Product", "name": "Kids Smoothie", "image": "https://osuma.dk/custgfx/5038862683005.jpg", "description": "Smoothie med Jordbær, Solbær & Hindbær ", "brand": "Innocent", "gtin13": "5038862683005", "offers": { "@type": "Offer", "url": "https://osuma.dk/butik/produkt/26692", "price":"9.50", "priceCurrency": "DKK" } }

i am pretty sure it is possible for the trained regex expert, but i need help :slight_smile:

Thanks
Kenneth

If the "gtin" is always in digits, you can try:

(?<=gtin13": ")\d+

1 Like

Brilliant !!

Thanks for the help