Airtable help scrape

Hi, can't create any scrape from this popup which

this popup is called from here


Url: https://airtable.com/shrpj2r4Kjc4YoMu4/tbl8m95GiuWehnIiT?blocks=hide

Sitemap:
{"_id":"layoffs","startUrl":["https://airtable.com/shrpj2r4Kjc4YoMu4/tbl8m95GiuWehnIiT?blocks=hide"],"selectors":[{"id":"page1","type":"SelectorElement","parentSelectors":["_root"],"selector":"div#hyperbaseContainer","multiple":false,"delay":0},{"id":"comp1","type":"SelectorLink","parentSelectors":["page1"],"selector":".firstRow a","multiple":true,"delay":0},{"id":"popup","type":"SelectorElement","parentSelectors":["comp1"],"selector":".scrollContainer div.body","multiple":false,"delay":0},{"id":"comp name","type":"SelectorText","parentSelectors":["popup"],"selector":"div:nth-of-type(1) .cell div.col-12","multiple":false,"regex":"","delay":0},{"id":"status","type":"SelectorText","parentSelectors":["popup"],"selector":"div.flex-auto.truncate","multiple":false,"regex":"","delay":0}]}

Hello Norm,

you probably won't be able to perform a scrape using https://webscraper.io/ - I think your best bet would be using an OCR scraping method. I know UI Kanto has OCR scraping capabilities, but not sure about https://webscraper.io/

I'm not sure why you'd want to scrape from that, you should be able to export the full list. The data in that pop-up is hosted in each row specific to that company anyways.

please let me know how i can export this data?

I've been struggling with this also, since many layoff lists are in airtable, I am kind of motivated to figure it out.

I did find a low-tech way to do it.

Go to Print view and "print it" to a PDF. Once you have it in PDF, adobe can save/convert it to an excel file. It's not 100% but got me 98% of what I wanted.

It also was VERY slow in even getting to the print view, I imagine due to how many lines this document is.

If you can't make it work, let me know and I'll go through the steps and see if I can get it into XSL or CSV for you.
I

Try Zamzar for PDF-to-Excel conversion, and many other file types.

https://www.zamzar.com/convert/pdf-to-xls/

1 Like

Hey Norm, found a solution.
I know I am a bit late to the party but here you go!
Airtables gives you the ability to scroll through records using the keyboard combination Ctrl + Shift +.
Holding this down I got all the URLs added to my history.
I then exported them using [https://www.nirsoft.net/utils/chrome_history_view.html](http://Chrome History View) which gives you the links in a format that you can process in excel.
In excel I adjusted their format (instead of https://airtable.... to be like "https://airtable..." and delimitated them with commas.
After that, I put them into the sitemap as starting URLs through the import JSON.

I hope it doesn't seem chaotic.
It was my first time doing it and it works.