Regex to extract specific part of HTML

Kenneth_Johansen · January 10, 2020, 8:30pm

Hi,

With help from the forum i have extracted a part of the HTML source.
From that source, i would like to narrow it down to the digits after the "gtin" part, the length varies:

{ "@context": "http://schema.org/", "@type": "Product", "name": "Kids Smoothie", "image": "https://osuma.dk/custgfx/5038862683005.jpg", "description": "Smoothie med Jordbær, Solbær & Hindbær ", "brand": "Innocent", "gtin13": "5038862683005", "offers": { "@type": "Offer", "url": "https://osuma.dk/butik/produkt/26692", "price":"9.50", "priceCurrency": "DKK" } }

i am pretty sure it is possible for the trained regex expert, but i need help

Thanks
Kenneth

leemeng · January 11, 2020, 1:07am

If the "gtin" is always in digits, you can try:

(?<=gtin13": ")\d+

Kenneth_Johansen · January 11, 2020, 5:58pm

Brilliant !!

Thanks for the help