Airtable help scrape

Hi, can't create any scrape from this popup which

this popup is called from here


Url: https://airtable.com/shrpj2r4Kjc4YoMu4/tbl8m95GiuWehnIiT?blocks=hide

Sitemap:
{"_id":"layoffs","startUrl":["https://airtable.com/shrpj2r4Kjc4YoMu4/tbl8m95GiuWehnIiT?blocks=hide"],"selectors":[{"id":"page1","type":"SelectorElement","parentSelectors":["_root"],"selector":"div#hyperbaseContainer","multiple":false,"delay":0},{"id":"comp1","type":"SelectorLink","parentSelectors":["page1"],"selector":".firstRow a","multiple":true,"delay":0},{"id":"popup","type":"SelectorElement","parentSelectors":["comp1"],"selector":".scrollContainer div.body","multiple":false,"delay":0},{"id":"comp name","type":"SelectorText","parentSelectors":["popup"],"selector":"div:nth-of-type(1) .cell div.col-12","multiple":false,"regex":"","delay":0},{"id":"status","type":"SelectorText","parentSelectors":["popup"],"selector":"div.flex-auto.truncate","multiple":false,"regex":"","delay":0}]}

Hello Norm,

you probably won't be able to perform a scrape using webscraper.io - I think your best bet would be using an OCR scraping method. I know UI Kanto has OCR scraping capabilities, but not sure about Webscraper.io

I'm not sure why you'd want to scrape from that, you should be able to export the full list. The data in that pop-up is hosted in each row specific to that company anyways.

please let me know how i can export this data?

I've been struggling with this also, since many layoff lists are in airtable, I am kind of motivated to figure it out.

I did find a low-tech way to do it.

Go to Print view and "print it" to a PDF. Once you have it in PDF, adobe can save/convert it to an excel file. It's not 100% but got me 98% of what I wanted.

It also was VERY slow in even getting to the print view, I imagine due to how many lines this document is.

If you can't make it work, let me know and I'll go through the steps and see if I can get it into XSL or CSV for you.
I

Try Zamzar for PDF-to-Excel conversion, and many other file types.

https://www.zamzar.com/convert/pdf-to-xls/

1 Like

Hey Norm, found a solution.
I know I am a bit late to the party but here you go!
Airtables gives you the ability to scroll through records using the keyboard combination Ctrl + Shift +.
Holding this down I got all the URLs added to my history.
I then exported them using [https://www.nirsoft.net/utils/chrome_history_view.html](http://Chrome History View) which gives you the links in a format that you can process in excel.
In excel I adjusted their format (instead of https://airtable.... to be like "https://airtable..." and delimitated them with commas.
After that, I put them into the sitemap as starting URLs through the import JSON.

I hope it doesn't seem chaotic.
It was my first time doing it and it works.