I'm interested in extracting a relational table from a wikipedia infobox. For example, consider the page https://en.wikipedia.org/wiki/Star_Wars:_Episode_IX .. On the right side of the page, there is an info box that, converted to csv would look like this:
key,value
Directed By,J. J. Abrams
Produced By,Kathleen Kennedy
Produced By,J. J. Abrams
Produced By,Michelle Rejwan
etc.
Now, AFAIK webscraper would have trouble generated the duplicate "Produced By" (in the HTML the Produced By is a row header). So any other result that could be easily converted into the above would be great. The page is extraordinarily well labeled, but getting the right "nesting" or matching elements into the selector is tricky.